High performance computing recently introduced as a concept in common, in IT market. CompecTA serves its clients as a solution and hardware provider along with HPC software development tools consulting. HPC tools include compilers, parallel programming libraries, high-performance libraries of mathematical operations, debuggers, performance analyzers, and the profilers.
Here is the sample list of the HPC tools that you can get consultancy and support from CompecTA:
Intel® Parallel Studio XE: Intel® Parallel Studio XE parallel software development suite combines Intel's industry-leading C/C++ compiler and Fortran compiler; performance and parallel libraries; error checking, code robustness, and performance profiling tools into a single suite offering.
Intel® C++ Studio XE: Intel C++ Studio XE combines an optimizing C++ compiler with high-performance libraries, performance profiling, thread and memory checking, static security analysis, and more, speeding and simplifying threading and performance.
Intel® Fortran Studio XE: Intel Fortran Studio XE combines an optimizing Fortran compiler with high-performance libraries, performance profiling, thread and memory checking, static security analysis, and more, speeding and simplifying threading and performance.
Intel® Parallel Studio: Intel Parallel Studio XE combines optimizing C, C++ and Fortran compilers with high-performance libraries, performance profiling, thread and memory checking, static security analysis that simplify creating faster executing, more reliable applications.
Intel Cluster Studio: Intel® Cluster Studio cluster software provides exceptional value at a significantly discounted price for clusters running either Linux* or Windows*. Easy to install and easy to use, this cluster suite helps you develop, analyze, and optimize performance of parallel applications for cluster development using Intel® 32- and 64-bit architectures.
Intel Composer XE: Intel® Composer XE combines optimizing compilers with high-performance libraries, advanced vectorization, and Intel® Parallel Building Blocks, speeding and simplifying threading and performance on Linux* and Windows* with the same code base.
Intel C++ Composer XE: Intel C++ Composer XE combines an optimizing C++ compiler with high-performance libraries, advanced vectorization, and Intel® Parallel Building Blocks, speeding and simplifying threading and performance.
Intel Fortran Composer XE: Intel Fortran Composer XE combines an optimizing Fortran compiler with high-performance libraries, advanced vectorization, and Intel® Parallel Building Blocks, speeding and simplifying threading and performance.
Intel Parallel Composer: Intel Parallel Composer equips novice and advanced developers to create faster applications that exploit multicore and enable portability, reliability, scalability, and simplicity.
Intel® Integrated Performance Primitives: Intel® Integrated Performance Primitives (Intel® IPP) is an extensive library of multicore-ready, highly optimized software functions for multimedia, data processing, and communications applications. Intel IPP offers thousands of optimized functions covering frequently used fundamental algorithms.
Intel® Math Kernel Library: Intel® Math Kernel Library (Intel® MKL) is a computing math library of highly optimized, extensively threaded math routines for applications that require maximum performance. Core math functions include BLAS, LAPACK, ScaLAPACK1, sparse solvers, fast Fourier transforms, vector math, and more.
Offering performance optimizations for current and next-generation Intel® processors, it includes improved integration with Microsoft Visual Studio*, Eclipse*, and XCode*. The Intel® MKL computing math library allows for full integration of the Intel® Compatibility OpenMP* runtime library for greater Windows*/Linux* cross-platform compatibility.
Intel® Threading Building Blocks: Intel® Threading Building Blocks 4.0 (Intel® TBB) is a widely used, award-winning C++ template library for creating reliable, portable, and scalable parallel applications. Use Intel® TBB for a simple and rapid way of developing robust task-based parallel applications that scale to available processor cores, are compatible with multiple environments, and are easier to maintain. Intel® TBB is the most proficient way to implement future-proof parallel applications that tap into the power and performance of multicore and manycore hardware platforms.
Intel® VTune™ Amplifier XE: Intel® VTuneTM Amplifier XE is a powerful threading and performance optimization tool for C/C++ and Fortran developers who need to understand an application's serial and parallel behavior to improve performance and scalability.
The powerful performance profiling tool removes the guesswork and analyzes performance behavior in Windows*and Linux* applications, providing quick access to scaling information for faster and improved decision making.
Use Intel VTune Amplifier XE to fine-tune for optimal performance, ensuring cores are fully exploited and new processor capabilities are supported to the fullest.
Intel® Parallel Amplifier 2011: Intel Parallel Amplifier 2011 makes it simple to quickly find multicore performance bottlenecks - without needing to know the processor architecture or assembly code. Intel Parallel Amplifier eliminates the guesswork and analyzes performance behavior in Windows* applications, providing quick access to scaling information for faster and improved decision making.
Intel® MPI Library: Intel® MPI Library focuses on making applications perform better on IA-based clusters—implementing the high-performance MPI-2 specification on multiple fabrics. It enables you to quickly deliver maximum end-user performance, even if you change or upgrade to new interconnects, without requiring major changes to the software or operating environment.
Use this high-performance MPI message library to develop applications that can run on multiple cluster fabric interconnects chosen by the user at runtime. Intel also provides a free runtime environment kit for products developed with the Intel MPI Library.
NOTE: Most of the Intel tools listed above, have non-commercial and academic licenses for the Linux operating system with no fee.
gcc
g++
g77
gfortran
OpenMPI
MVAPICH1/MVAPICH2
MPICH1/MPICH2
BLAS (Basic Linear Algebra Subroutines): Contains subroutines and functions for a number of basic linear algebra operations:
– Dot product
– Givens rotation generation and application
– Vector updates
– Matrix-vector product update
– Triangular system solve (with single or multiple right hand sides)
– Matrix-matrix product update – ...
The routines operate on various storage formats and on four data types (single, double, complex, double complex).
LAPACK (Linear Algebra PACKage): High level algorithms and tricky numerical algorithms go into LAPACK.
– Factorizing matrices
– Applying factored-form orthogonal matrices
– Solving linear equations
– Solving linear least squares problems
– Decomposing matrices
SVD,Schur,... – Computing eigenvalues and eigenvectors
Symmetric,non-symmetric,... – Error bounds, condition estimation
BLACS (Basic Linear Algebra Communication Subroutines): BLACS’ purpose is to create a linear algebra oriented message passing interface that may be implemented efficiently and uniformly across a large range of distributed memory platforms.
FFTW (Fastest Fourier Transform in the West): FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions, of arbitrary input size, and of both real and complex data (as well as of even/odd data, i.e. the discrete cosine/sine transforms or DCT/DST). We believe that FFTW, which is free software, should become the FFT library of choice for most applications.
MS-MPI Library
Virtual Studio
HPC Debugger