User Tools

Site Tools


hardware

This is an old revision of the document!


Popular Libraries for Scientific Computing, Machine Learning, and Data Science

This document provides an overview of widely used libraries across various domains such as scientific computing, machine learning, data science, and high-performance computing.


Compiler Libraries

    1. Domain: Compiler
    2. Description: GCC is a collection of compilers for various programming languages, including C, C++, and Fortran. It's essential for compiling code in high-performance computing and software development.
    1. Domain: Compiler Infrastructure
    2. Description: LLVM is a collection of modular and reusable compiler and toolchain technologies. It’s used for developing compilers, optimizers, and other code analysis tools.

GPU Computing & Deep Learning

    1. Domain: GPU Computing
    2. Description: CUDA is a parallel computing platform and application programming interface model created by NVIDIA. It allows developers to utilize GPUs for general-purpose processing.
    1. Domain: Deep Learning
    2. Description: cuDNN is a GPU-accelerated library for deep neural networks. It's optimized for high performance on NVIDIA hardware and is widely used in machine learning and AI applications.
    1. Domain: Machine Learning
    2. Description: PyTorch is an open-source machine learning library based on the Torch library. It is widely used in deep learning and research due to its flexibility and ease of use.
    1. Domain: Machine Learning
    2. Description: TensorFlow is an open-source platform for machine learning. It provides a comprehensive ecosystem of tools, libraries, and community resources to facilitate the development of ML models.

Scientific Computing & Numerical Libraries

    1. Domain: Mathematics & Numerical Computation
    2. Description: GSL is a collection of numerical routines for scientific computing, including statistics, linear algebra, and numerical integration.
    1. Domain: Signal Processing
    2. Description: FFTW is a C library for computing discrete Fourier transforms (DFT). It is highly efficient and widely used for signal processing and numerical simulations.
    1. Domain: Data Storage
    2. Description: HDF5 is a file format and set of tools for storing and managing large amounts of data. It's commonly used in scientific computing for high-volume data storage.
    1. Domain: Scientific Computing & Data Science
    2. Description: NumPy is the fundamental package for numerical computing in Python. It provides support for large multi-dimensional arrays and matrices, along with a collection of mathematical functions to operate on these arrays.
    1. Domain: Scientific Computing
    2. Description: SciPy is a Python library used for scientific and technical computing. It builds on NumPy and provides a collection of algorithms for optimization, integration, interpolation, eigenvalue problems, and more.
    1. Domain: Linear Algebra
    2. Description: LAPACK is a library for solving linear algebra problems such as systems of linear equations, least squares problems, eigenvalue problems, and singular value decomposition.
    1. Domain: Linear Algebra
    2. Description: OpenBLAS is an optimized BLAS (Basic Linear Algebra Subprograms) library that provides highly optimized matrix operations and is used for high-performance numerical computing.

Image Processing

    1. Domain: Image Processing
    2. Description: libjpeg-turbo is a high-speed JPEG image compression library that provides faster encoding and decoding of JPEG images compared to other implementations.
    1. Domain: Image Processing
    2. Description: libpng is a reference library for the PNG (Portable Network Graphics) image format. It’s used extensively for handling PNG images in web development, image processing, and graphics applications.

High-Performance Computing (HPC)

    1. Domain: Linear Algebra & HPC
    2. Description: MAGMA is a library for high-performance linear algebra routines on multi-core and multi-GPU systems. It's a key tool in high-performance computing (HPC) applications.
    1. Domain: Parallel Computing
    2. Description: MPICH is an implementation of the Message Passing Interface (MPI) standard for parallel computing. It is widely used in high-performance parallel applications.
    1. Domain: Parallel Computing
    2. Description: OpenMPI is an open-source implementation of the Message Passing Interface (MPI) for parallel and distributed computing. It is used in high-performance computing environments for multi-node computations.

Data Science & Data Manipulation

    1. Domain: Data Science
    2. Description: Pandas is a fast, powerful, flexible, and easy-to-use data analysis and manipulation library for Python. It’s used extensively in data science and data analysis.
    1. Domain: Programming Language
    2. Description: Python is a versatile, high-level programming language that is widely used in web development, data science, automation, machine learning, and more.
    1. Domain: Python Package Management
    2. Description: Setuptools is a package development and distribution library for Python. It simplifies packaging Python projects and managing their dependencies.

Data Compression

    1. Domain: Data Compression
    2. Description: LZ4 is a fast compression algorithm and library that allows extremely high compression and decompression speeds. It is used in high-performance data storage and transmission applications.
    1. Domain: Data Compression
    2. Description: Zlib is a software library used for data compression. It is used in many applications for compressing data to reduce storage or transmission size.

Additional Libraries

    1. Domain: Compiler
    2. Description: GCC is a collection of compilers for various programming languages, including C, C++, and Fortran. It's essential for compiling code in high-performance computing and software development.
    1. Domain: Programming Language
    2. Description: Java is a widely used programming language that is object-oriented and designed to have as few implementation dependencies as possible. It is used across various industries, including enterprise applications and big data.
    1. Domain: High-Performance Computing & Data Science
    2. Description: Julia is a high-performance programming language for technical computing. It’s used for scientific computing, data science, and machine learning, thanks to its speed and ease of use.
    1. Domain: Compiler Infrastructure
    2. Description: LLVM is a collection of modular and reusable compiler and toolchain technologies. It’s used for developing compilers, optimizers, and other code analysis tools.
    1. Domain: Machine Learning
    2. Description: TensorFlow is an open-source platform for machine learning. It provides a comprehensive ecosystem of tools, libraries, and community resources to facilitate the development of ML models.
    1. Domain: Data Storage
    2. Description: HDF5 is a file format and set of tools for storing and managing large amounts of data. It's commonly used in scientific computing for high-volume data storage.
hardware.1744065115.txt.gz · Last modified: 2025/04/07 22:31 by nshegunov

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki