8000 GitHub - uservan/speculative_thinking
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

uservan/speculative_thinking

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

speculative_thinking

eval normal model

  • if you eval the deepseek-7b, you could run the command
python ./skythought_evals/eval.py --model deepseek-ai/DeepSeek-R1-Distill-Qwen-32B \
    --evals amc23 --n 1 --result-dir ./eval1/amc2323 \
    --tp 2 --output-file ./eval1/amc2323/32B.txt

eval speculative thinking

  • first prepare the config, you could find the some configs in speculative/config
mode: "vllm" # 'hf' or "vllm"
target_model_name: "deepseek-ai/DeepSeek-R1-Distill-Qwen-14B"
speculative_model_name: "deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B"
target_model_gpu: 2
speculative_model_gpu: 1
...
  • then run the command
python /home/wxy320/ondemand/program/speculative_thinking/skythought_evals/eval.py \
    --evals amc23 --n 1 --result-dir ./eval1/amc2323 \
    --tp 3 --output-file ./eval1/amc2323/1b_14b.txt --spe_config ./speculative/config/1b_14b.yml

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
0