8000 GitHub - Elfsong/Venus: Instruction Tuning for Code Efficiency Improvement
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Elfsong/Venus

Repository files navigation

Venus


arXiv HuggingFace HuggingFace

  • 🎉 What is Venus? Venus is the dataset used to train Afterburner (WIP). It is an extension of the original Mercury dataset and currently includes 6 languages: Python3, C++, Javascript, Go, Rust, and Java.
  • 🚧 What is the current progress? We are in the process of expanding the dataset to include more programming languages.
  • 🔮 Why Venus stands out? A key contribution of Venus is that it provides runtime and memory distributions containing multiple solutions for each problem—significantly more than existing datasets. It can be potentially used in Reinforcement Learning or Instruction Tuning.
  • 🌠 Acknowledgement: Please consider upvoting and citing our work if you find it useful. If you have any questions or issues with the dataset, feel free to email me at mingzhe@nus.edu.sg. Thank you! 😀

🪐 Venus Dataset <- TLDR: here is the dataset:)

🔍 Venus Annotation System

Please consider citing our paper if you think the resource is useful. Thank you!

@inproceedings{du2024mercury,
  title={Mercury: A code efficiency benchmark for code large language models},
  author={Du, Mingzhe and Luu, Anh Tuan and Ji, Bin and Liu, Qian and Ng, See-Kiong},
  booktitle={The Thirty-eight Conference on Neural Information Processing Systems Datasets and Benchmarks Track},
  year={2024}
}

About

Instruction Tuning for Code Efficiency Improvement

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
0