8000 [INFER] Add cutlass fp8 gemm auto tune by Sunny-bot1 · Pull Request #9020 · PaddlePaddle/PaddleNLP · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

[INFER] Add cutlass fp8 gemm auto tune #9020

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 20 commits into from

Conversation

Sunny-bot1
Copy link
Contributor

PR types

PR changes

Description

Copy link
paddle-bot bot commented Aug 27, 2024

Thanks for your contribution!

8000 Copy link
codecov bot commented Sep 4, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 53.29%. Comparing base (db270d9) to head (a4c05f8).
Report is 256 commits behind head on develop.

Additional details and impacted files
@@           Coverage Diff            @@
##           develop    #9020   +/-   ##
========================================
  Coverage    53.29%   53.29%           
========================================
  Files          652      652           
  Lines       105483   105483           
========================================
  Hits         56222    56222           
  Misses       49261    49261           

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@ZHUI ZHUI changed the title add cutlass fp8 gemm auto tune [INFER] Add cutlass fp8 gemm auto tune Sep 13, 2024
@@ -0,0 +1,583 @@
# Copyright (c) 2024 PaddlePaddle Authors. All Rights Reserved.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

我建议在csrc/目录下建一个utils或者tools目录,把csrc/generate_code_dual_gemm_fused_kernels.py移进去,另外generate_code_dual_gemm_fused_kernels文件的命名也有点不太准确,改为auto_gen_fp8_fp8_dual_gemm_fused_kernels.py ?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

另外顺便把csrc/test_tune_cublaslt_gemm.py也移到utils或者tools目录下,test_tune_cublaslt_gemm命名改为tune_cublaslt_int8_gemm.py吧

@yuanlehome yuanlehome closed this Sep 20, 2024
@yuanlehome
Copy link
Collaborator

move to #9151

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants
0