8000 GitHub - arjunguha/longbench: I cannot remember the point of this repository. I'd delete it, but it looks like @cassanof did some work on it.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
This repository was archived by the owner on Jun 7, 2025. It is now read-only.
/ longbench Public archive

I cannot remember the point of this repository. I'd delete it, but it looks like @cassanof did some work on it.

License

Notifications You must be signed in to change notification settings

arjunguha/longbench

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

64 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

  1. Use builder.pynb to create the benchmark. The script relies on randomness, so commit the generated benchmark to the repository.

  2. Use completions.py to geenerate completions.

    python3 completions.py --input benchmark.jsonl --output completions.jsonl --model-name /home/arjun/models/starcoderbase --batch-size 50 --num-completions 1 --max-tokens 8192
    
  3. Use executions.py to execution completions. (Needs an update.)

    python3 executions.py --input completions.jsonl --output executions.jsonl
    
  4. Use pass1.ipynb to look at the results. (Needs an update.)

About

I cannot remember the point of this repository. I'd delete it, but it looks like @cassanof did some work on it.

Resources

License

Stars

Watchers

Forks

Contributors 2

  •  
  •  
0