-
Notifications
You must be signed in to change notification settings - Fork 589
[Evaluation] Migrate LM Harness integration point from simple_evaluate
to evaluate
#1455
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@kaisopos if we don't have them already this is a good opportunity to add unit tests !
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you add in your PR description the steps you took to test this PR? Would be good to verify that the results didn't change as a result of this migration, and that the output platform_task_config looks correct.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1 to adding unit tests before you submit here.
Yep, will add unit tests before checkin, as discussed. |
Up to you! Personally I like adding integration tests separately, but adding them here is fine as well. |
…umi-ai/oumi into kostas/evaluation_lm_harness_migration
…umi-ai/oumi into kostas/evaluation_lm_harness_migration
Description
simple_evaluate
toevaluate
, to support (follow-up PR) LM Harness custom tasks.Related issues
Towards OPE-1029, OPE-1097
Before submitting
Reviewers
At least one review from a member of
oumi-ai/oumi-staff
is required.