-
CoDesc Public archive
A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.
-
TransCoder Public archive
Forked from facebookresearch/TransCoderPublic release of the TransCoder research project https://arxiv.org/pdf/2006.03511.pdf
-
CodeSearchNet Public archive
Forked from github/CodeSearchNetDatasets, tools, and benchmarks for representation learning of code.