Revamped Eval Function

During investigation with Sakana's Kernel in #25, we created a stronger eval function to avoid that kind of exploits that some observed.
I didn't merge it in (sit on a branch) because we want to make sure our paper result didn't change from such an update (for ICML rebuttal).

During ICML rebuttal, I have also checked if any of our existing kernels have similar kind of exploits. Luckily, none of our kernels are smart enough to do that yet.

Now that ICML is over, I plan to merge the more robust eval function in.

In particular, the simple fix is

compute reference, Model
clear cache
compute reference, ModelNew
check if they are equivalent

AND

compute reference, ModelNew
clear cache
compute reference, Model
check if they are equivalent

Check both directions to be extra sure!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions