Skip to content
/ Fuser Public
forked from NVIDIA/Fuser

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

License

Notifications You must be signed in to change notification settings

nWEIdia/Fuser

 
 

Repository files navigation

Fuser

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

Installation

We publish nightly wheel packages on https://pypi.nvidia.com

built-env cuda 11.8 cuda 12.1
torch 2.2 nvfuser-cu118-torch22 nvfuser-cu121-torch22
torch nightly wheel nvfuser-cu118 nvfuser-cu121

Note that nvfuser built against torch-2.2 isn't compatible with nightly pytorch wheel, so ensure you pick the right version suiting your environment.

nightly nvfuser pip wheel

You can instll a nightly nvfuser pip package built against torch nightly code base with pip install --pre nvfuser-cu121 --extra-index-url https://pypi.nvidia.com

As we build against nightly torch wheel and there's no compatibility promised on nightly wheels, we have explicitly marked the nightly torch wheel as an optinoal dependency. You can choose to install the torch wheel along with nvfuser package. e.g. pip install --pre "nvfuser-cu121[torch]" --extra-index-url https://pypi.nvidia.com. Note that this may uninstall your local pytorch installation and install the compatible nightly pytorch.

nvfuser pip wheel against pytorch stable release

nvfuser pip wheel built against stable torch releases is published on pypi.org. Pick the right cuda toolkit version to match your torch installation. e.g. pip install nvfuser-cu121-torch22

PyPI: https://pypi.org/project/nvfuser/

Developer

Docs: https://github.com/NVIDIA/Fuser/wiki

Supported compilers:

GCC:

We support all "supported releases" of gcc as specified in the official site. As of 6/20/2024, they are:

  • gcc 11.4
  • gcc 12.4
  • gcc 13.2
  • gcc 14.1

Clang:

  • clang 14+

Supported C++ standard:

  • C++17
  • C++20

We are actively considering dropping C++17 support

About

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 88.6%
  • Python 7.1%
  • Cuda 3.3%
  • CMake 0.4%
  • HTML 0.3%
  • Shell 0.2%
  • Other 0.1%