GitHub - arjunguha/longbench: I cannot remember the point of this repository. I'd delete it, but it looks like @cassanof did some work on it.

Use builder.pynb to create the benchmark. The script relies on randomness, so commit the generated benchmark to the repository.

Use completions.py to geenerate completions.

python3 completions.py --input benchmark.jsonl --output completions.jsonl --model-name /home/arjun/models/starcoderbase --batch-size 50 --num-completions 1 --max-tokens 8192

Use executions.py to execution completions. (Needs an update.)

python3 executions.py --input completions.jsonl --output executions.jsonl

Use pass1.ipynb to look at the results. (Needs an update.)

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
attic		attic
key_retrieval		key_retrieval
selfextend		selfextend
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
benchmark.jsonl		benchmark.jsonl
benchmark_builder.ipynb		benchmark_builder.ipynb
completions.py		completions.py
completions_starcoderbase15b.jsonl		completions_starcoderbase15b.jsonl
completions_starcoderbase1b.jsonl		completions_starcoderbase1b.jsonl
executions.py		executions.py
executions_starcoderbase15b.jsonl		executions_starcoderbase15b.jsonl
executions_starcoderbase1b.jsonl		executions_starcoderbase1b.jsonl
mutant_dataset_builder.py		mutant_dataset_builder.py
pass1.ipynb		pass1.ipynb
pass1.py		pass1.py
py_high_temp.tar.gz		py_high_temp.tar.gz

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Contributors 2

Uh oh!

Languages

License

arjunguha/longbench

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors 2

Uh oh!

Languages