8000 Workflow runs · stanford-crfm/helm · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Actions: stanford-crfm/helm

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
3,743 workflow runs
3,743 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Set up logging in Slurm runner
Test #8828: Pull request #3691 opened by yifanmai
June 24, 2025 23:48 11m 22s yifanmai/slurm-runner-logging
June 24, 2025 23:48 11m 22s
Add Scenario for evaluating LLMs in replicating undergraduate student code
Test #8826: Pull request #3644 synchronize by martinakaduc
June 24, 2025 15:38 11m 17s Kazf28:main
June 24, 2025 15:38 11m 17s
Scenario tests
Scenario tests #421: Scheduled
June 24, 2025 15:38 9m 40s main
June 24, 2025 15:38 9m 40s
Add Scenario for evaluating LLMs in replicating undergraduate student code
Test #8825: Pull request #3644 synchronize by martinakaduc
June 24, 2025 14:30 Action required Kazf28:main
June 24, 2025 14:30 Action required
Add InfiniteBench En.MC scenario (#3687)
Test #8824: Commit 7448960 pushed by yifanmai
June 24, 2025 03:29 11m 1s main
June 24, 2025 03:29 11m 1s
Update requirements.txt (#3688)
Test #8823: Commit e115c06 pushed by yifanmai
June 23, 2025 21:28 11m 26s main
June 23, 2025 21:28 11m 26s
LMKT: Language model cultural alignment transfer (#3682)
Update requirements.txt #127: Commit 67c5e15 pushed by yifanmai
June 23, 2025 21:01 11m 17s main
June 23, 2025 21:01 11m 17s
LMKT: Language model cultural alignment transfer (#3682)
Test #8822: Commit 67c5e15 pushed by yifanmai
June 23, 2025 21:01 10m 53s main
June 23, 2025 21:01 10m 53s
Add InfiniteBench En.MC scenario
Test #8821: Pull request #3687 opened by yifanmai
June 23, 2025 20:34 10m 43s yifanmai/en-mc
June 23, 2025 20:34 10m 43s
Add support for Brazilian Models
Test #8820: Pull request #3686 synchronize by IriedsonSouto
June 23, 2025 18:52 Action required llm-pt-ibm:feat/add_brazilian_models
June 23, 2025 18:52 Action required
Add Scenario for evaluating LLMs in replicating undergraduate student code
Test #8819: Pull request #3644 synchronize by martinakaduc
June 23, 2025 17:51 11m 11s Kazf28:main
June 23, 2025 17:51 11m 11s
Scenario tests
Scenario tests #420: Scheduled
June 23, 2025 15:38 12m 18s main
June 23, 2025 15:38 12m 18s
Upgrade requirements.txt
Upgrade requirements.txt #27: Scheduled
June 23, 2025 15:38 7m 22s main
June 23, 2025 15:38 7m 22s
Add Scenario for evaluating LLMs in replicating undergraduate student code
Test #8817: Pull request #3644 synchronize by Kazf28
June 23, 2025 13:30 Action required Kazf28:main
June 23, 2025 13:30 Action required
Add Scenario for evaluating LLMs in replicating undergraduate student code
Test #8816: Pull request #3644 synchronize by martinakaduc
June 22, 2025 16:25 Action required Kazf28:main
June 22, 2025 16:25 Action required
Scenario tests
Scenario tests #419: Scheduled
June 22, 2025 15:35 10m 33s main
June 22, 2025 15:35 10m 33s
LMKT: Language model cultural alignment transfer
Test #8815: Pull request #3682 synchronize by martinakaduc
June 22, 2025 02:26 10m 47s martinakaduc:lmkt-en
June 22, 2025 02:26 10m 47s
Scenario tests
Scenario tests #418: Scheduled
June 21, 2025 15:35 11m 44s main
June 21, 2025 15:35 11m 44s
LMKT: Language model cultural alignment transfer
Test #8812: Pull request #3682 synchronize by martinakaduc
June 21, 2025 08:22 10m 26s martinakaduc:lmkt-en
June 21, 2025 08:22 10m 26s
Add o3-pro support (#3671)
Test #8811: Commit 2493ac9 pushed by yifanmai
June 21, 2025 00:43 11m 8s main
June 21, 2025 00:43 11m 8s
0