IBM GPFS | CompecTA - IT Consulting & Integration Services

Homepage Contact Add to Favorites EN | TR

IBM

GPFS

Shared disk file systems, Linux file systems, IBM storage devices, Distributed file systems

The General Parallel File System (GPFS) is a high-performance shared-disk clustered file system developed by IBM. It is used by some of the supercomputers on the Top 500 List.[1] For example, GPFS is the filesystem of the ASC Purple Supercomputer[2] which is composed of more than 12,000 processors and has 2 petabytes of total disk storage spanning more than 11,000 disks.

In common with typical cluster filesystems, GPFS provides concurrent high-speed file access to applications executing on multiple nodes of clusters. It can be used with AIX 5L clusters, Linux clusters, on Microsoft Windows Server, or a heterogeneous cluster of AIX, Linux and Windows nodes. In addition to providing filesystem storage capabilities, GPFS provides tools for management and administration of the GPFS cluster and allows for shared access to file systems from remote GPFS clusters.

GPFS has been available on IBM's AIX since 1998, on Linux since 2001 and on Microsoft Windows Server since 2008, and is offered as part of the IBM System Cluster 1350.

GPFS provides high performance by allowing data to be accessed over multiple computers at once. Most existing file systems are designed for a single server environment, and adding more file servers does not improve performance. GPFS provides higher input/output performance by "striping" blocks of data from individual files over multiple disks, and reading and writing these blocks in parallel. Other features provided by GPFS include high availability, support for heterogeneous clusters, disaster recovery, security, DMAPI, HSM and ILM.

This results in high reading and writing speeds for a single file, as the combined bandwidth of the many physical drives is high. This makes the filesystem vulnerable to disk failures -any one disk failing would be enough to lose data. To prevent data loss, the filesystem nodes have RAID controllers — multiple copies of each block are written to the physical disks on the individual nodes. It is also possible to opt out of RAID-replicated blocks, and instead store two copies of each block on different filesystem nodes.

Other features of the filesystem include

Distributed metadata, including the directory tree. There is no single "directory controller" or "index server" in charge of the filesystem. This is contrast to Apache Hadoop's HDFS, whose Namenode is a Single Point of Failure.

Efficient indexing of directory entries for very large directories. Many filesystems are limited to a small number of files in a single directory (often, 65536 or a similar small binary number). GPFS does not have such limits.

Distributed locking. This allows for full Posix filesystem semantics, including locking for exclusive file access.

Partition Aware. The failure of the network may partition the filesystem into two or more groups of nodes that can only see the nodes in their group. This can be detected through a heartbeat protocol, and when a partition occurs, the filesystem remains live for the largest partition formed. This offers a graceful degradation of the fileystem — some machines will remain working.

Filesystem maintenance can be performed online. Most of the filesystem maintenance chores (adding new disks, rebalancing data across disks) can be performed while the filesystem is live. This ensures the filesystem is available more often, so keeps the supercomputer cluster itself available for longer.

References:

GPFS: A Shared-Disk File System for Large Computing Clusters

Storage Systems - Projects - GPFS

Parallel I/O for High Performance Computing

Related Solutions

HPC

Easy Access
Solutions&Products related pages
- Solutions
  - » HPC
  - » Cloud Computing
  - » Virtualization
  - » Benchmarking
  - Post-production Rendering
    - » Broadcasting
    - » Editing, VFX and Parallelization
- Products
  - File Systems
    - » Lustre
    - » Hadoop (HDFS)
    - » Panassas pNFS
    - » IBM GPFS
    - » Red Hat GFS
    - » SANBOLIC MelioFS
    - » HP Ibrix FS
  - » ScaleMP
  - » InfiniBand
  - » DDN SFA10000
  - » HP EVA P6000
  - » IBM DS5000 Series
  - » IBM Storwize V7000
  - » IBM DS3500
  - » Fujitsu Eternus DX400
  - HVCEM Tools
    - » xCAT
    - » HP CMU
    - » Platform ISF
    - » Platform LSF
    - » Platform Symphony
    - » Platform Cluster Manager
    - » MOAB Cluster Suite
    - » Moab Adaptive HPC Suite
    - » Moab Grid Suite
    - Open Source Products
      - » Torque
      - » Maui
  - » HPC Tools
  - » HPC Applications

E-mail Group
Join our e-mail group to hear the latest news and updates about Compecta.

Customer Login

tercüme bürosu

CompecTA Bilisim ve Iletisim Hizmetleri Sanayi Ticaret Ltd. Sti.
Yenişehir Mh. Barajyolu Cad. Özgür Sk. Tellioğlu Apt. No:7/4 Ataşehir/İSTANBUL Tel: 0216 455 18 65, Fax: 0216 455 39 88