Cutlass Operators

Install from Source

git clone 

# For Ampere(sm80) GPU
./build.sh --arch 80 --jobs 6
# For Ada Lovelace(sm89) GPU
./build.sh --arch 89 --jobs 6
# For Hopper(sm90) GPU
./build.sh --arch 90 --jobs 6

compute-sanitizer --tool memcheck python tools/test*.py

Dependencies

CUTLASS: Flux leverages CUTLASS to generate high-performance GEMM kernels. We currently use CUTLASS 3.7.0 and a tiny patch should be applied to CUTLASS.

Quick Start

# Generate search_space_gemmnormal.cu 
# Move it to src/ops/gemm_normal/tuning_config, and compile the library again.
python3 tools/gen_search_space.py --schema=GemmNormal

# Generate tuned_config_gemmnormal.cu
# Move it to src/ops/gemm_normal/tuning_config, and compile the library again.
python3 tools/tuning/tune_gemm_normal.py --schema=GemmNormal

# Now you can test it.
python3 tools/test_gemm_normal.py 100 12288 6144 --dtype=float16

Name		Name	Last commit message	Last commit date
Latest commit History 144 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
3rdparty		3rdparty
include/ctlop		include/ctlop
python/ctlop		python/ctlop
src		src
test		test
tools		tools
.clang-format		.clang-format
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
autotuner.log		autotuner.log
build.sh		build.sh
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Cutlass Operators

Install from Source

Dependencies

Quick Start

About

Uh oh!

Releases

Packages

Languages

License

cjmcv/flux

Folders and files

Latest commit

History

Repository files navigation

Cutlass Operators

Install from Source

Dependencies

Quick Start

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages