Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    GPU accelerated decision optimization

    Cuda 627 104

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 392 62

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16.5k 1.5k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.7k 231

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 3.9k 455

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.7k 941

Repositories

Showing 10 of 645 repositories
  • cuda-tile Public

    CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-based computation patterns and optimizations targeting NVIDIA tensor core units.

    NVIDIA/cuda-tile’s past year of commit activity
    MLIR 130 6 1 0 Updated Dec 20, 2025
  • doca-platform Public

    DOCA Platform manages provisioning and service orchestration for Bluefield DPUs

    NVIDIA/doca-platform’s past year of commit activity
    Go 64 Apache-2.0 16 0 0 Updated Dec 20, 2025
  • TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    Python 12,434 1,969 524 478 Updated Dec 20, 2025
  • Fuser Public

    A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

    NVIDIA/Fuser’s past year of commit activity
    C++ 367 74 210 (15 issues need help) 215 Updated Dec 20, 2025
  • trt-samples-for-hackathon-cn Public

    Simple samples for TensorRT programming

    NVIDIA/trt-samples-for-hackathon-cn’s past year of commit activity
    Python 1,650 Apache-2.0 351 65 2 Updated Dec 20, 2025
  • cuda-quantum Public

    C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

    NVIDIA/cuda-quantum’s past year of commit activity
    C++ 876 313 404 (16 issues need help) 79 Updated Dec 20, 2025
  • Model-Optimizer Public

    A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

    NVIDIA/Model-Optimizer’s past year of commit activity
    Python 1,692 Apache-2.0 218 55 52 Updated Dec 20, 2025
  • nv-ingest Public

    NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.

    NVIDIA/nv-ingest’s past year of commit activity
    Python 2,789 Apache-2.0 280 101 (1 issue needs help) 32 Updated Dec 20, 2025
  • NVFlare Public

    NVIDIA Federated Learning Application Runtime Environment

    NVIDIA/NVFlare’s past year of commit activity
    Python 852 Apache-2.0 226 15 19 Updated Dec 20, 2025
  • TransformerEngine Public

    A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

    NVIDIA/TransformerEngine’s past year of commit activity
    Python 3,016 Apache-2.0 583 281 101 Updated Dec 20, 2025
SYSTEM_READY >> ...MS