NVIDIA Corporation

cuda-tile Public
CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-based computation patterns and optimizations targeting NVIDIA tensor core units.

NVIDIA/cuda-tile’s past year of commit activity

MLIR 130 6 1 0 Updated Dec 20, 2025
doca-platform Public
DOCA Platform manages provisioning and service orchestration for Bluefield DPUs

NVIDIA/doca-platform’s past year of commit activity

Go 64 Apache-2.0 16 0 0 Updated Dec 20, 2025
TensorRT-LLM Public
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

NVIDIA/TensorRT-LLM’s past year of commit activity

Python 12,434 1,969 524 478 Updated Dec 20, 2025
Fuser Public
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

NVIDIA/Fuser’s past year of commit activity

C++ 367 74 210 (15 issues need help) 215 Updated Dec 20, 2025
trt-samples-for-hackathon-cn Public
Simple samples for TensorRT programming

NVIDIA/trt-samples-for-hackathon-cn’s past year of commit activity

Python 1,650 Apache-2.0 351 65 2 Updated Dec 20, 2025
cuda-quantum Public
C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

NVIDIA/cuda-quantum’s past year of commit activity

C++ 876 313 404 (16 issues need help) 79 Updated Dec 20, 2025
Model-Optimizer Public
A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

NVIDIA/Model-Optimizer’s past year of commit activity

Python 1,692 Apache-2.0 218 55 52 Updated Dec 20, 2025
nv-ingest Public
NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.

NVIDIA/nv-ingest’s past year of commit activity

Python 2,789 Apache-2.0 280 101 (1 issue needs help) 32 Updated Dec 20, 2025
NVFlare Public
NVIDIA Federated Learning Application Runtime Environment

NVIDIA/NVFlare’s past year of commit activity

Python 852 Apache-2.0 226 15 19 Updated Dec 20, 2025
TransformerEngine Public
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

NVIDIA/TransformerEngine’s past year of commit activity

Python 3,016 Apache-2.0 583 281 101 Updated Dec 20, 2025

View all repositories

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NVIDIA Corporation

Pinned Loading

Repositories

People

Top languages

Most used topics