Highlights
- Pro
Starred repositories
OmniCode [CodeArena]: Is a diverse A Diverse Software Engineering Benchmark for Evaluating Large Language Models
In this repository we investigate the capabilities of selected Large Language Models on understanding structured code execution.
[ICSME '24 NIER] Artifact for GlueTest: Testing Code Translation via Language Interoperability
A manually vetted dataset for security vulnerability detection in Java projects
Reproducing BugsInPy: Benchmarking Bugs in Python Projects
FANC is a tool for the proof transfer of incomplete verification
Stan development repository. The master branch contains the current release. The develop branch contains the latest stable development. See the Developer Process Wiki for details.