8000 Questions about LDS Calculation and Examples in CIFAR and QNLI · Issue #79 · MadryLab/trak · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Questions about LDS Calculation and Examples in CIFAR and QNLI #79

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
hhf-hd opened this issue Nov 28, 2024 · 1 comment
Open

Questions about LDS Calculation and Examples in CIFAR and QNLI #79

hhf-hd opened this issue Nov 28, 2024 · 1 comment

Comments

@hhf-hd
Copy link
hhf-hd commented Nov 28, 2024

I would like to express my admiration for your outstanding work. The methods and examples you provided are highly intuitive and leave a lasting impression. The effectiveness of your approach is evident, and I am genuinely inspired by your contributions to the field.

I have a few questions and would greatly appreciate your clarification:

1.LDS Calculation in examples/cifar_quickstart.ipynb:
In the final step of the notebook where LDS is calculated, I noticed that even after training numerous models, the LDS value remains very low. Could you please explain the reasoning behind this observation? Is it an expected outcome, or could there be something I am overlooking?
image

  1. Comparison Between Image and Text Examples:
    While the results on image datasets are visually striking and highly effective, I noticed that some examples in the QNLI dataset seemed less convincing. I understand that textual data is inherently more diverse, rich in hidden semantic information, and subjective. Could you provide additional examples on QNLI to better showcase the effectiveness of your approach in the text domain like example in pre-computed TRAK scores for CIFAR-10 (Google Colab notebook)?

  2. Reproducibility of LDS on QNLI:
    For QNLI, I am particularly interested in understanding the process of calculating the LDS metric. As it requires a subset mask and models trained on the subsets (similar to the approach in cifar_quickstart.ipynb), would it be possible for you to share the training process and code for calculating LDS on QNLI? This would be immensely helpful for reproducing the results in your paper efficiently.

Thank you again for your remarkable work and for making your research open to the community. I sincerely hope you can provide some insights into these questions.

Looking forward to your response!

@eichinflo
Copy link

Also thanks for the paper and this amazing code repo from my side! Hijacking this issue since I'm currently also looking for a way to achieve point 3. Are there any chances we could get access to the dataset masks and data model scores of the QNLI and Imagenet experiments you carried out?

Thanks for your time and help.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants
0