fixed model output function when computing gradients in float16 #36

AlaaKhaddaj · 2023-05-27T15:41:30Z

When computing the margins from image_classification task, the default dtype of ch.tensor(-ch.inf) is float32. This leads to a datatype mismatch if the model gradients and output were computed in float16.

kristian-georgiev · 2023-05-31T17:12:25Z

Great catch, thanks!

* clean up old nb * trak scores quickstart fig * clean up quickstart * minor docs updates * no-op projector * bump version * test for scoring in shards * test for featurizing in shards * tie experiment name to scoring targets; simplify saver; add logging * support dataset sharding during featurizing and scoring * save scores as mmap * migrate to torch.func * bump torch dep requirement to 2.0.0 bc of torch.func * project and store in float16 by default * test autocast vs .half() on the model with functional_call * test_install function * minor edits in tests and install docs * pass in an instance of a class for tasks, rather than init inside of gradientcomputer * bug fix * normalization factor for numerical stability * fixed model output function when computing gradients in float16 (#36) * fixed model output function when computing gradients in float16 6889 * also fix for text clsf MOF * instantiate on device directly --------- Co-authored-by: alaakh <alaakh@mit.edu> Co-authored-by: Kristian Georgiev <krisgrg@mit.edu> * _is_featurized array * handle pre-emption for featurizing * vectorize without stacking to save memory * add assertion to load ckpt * python >=3.8 for pytorch 2.0 * make it easy to use GPU with smaller cuda mem * pytest cuda markers * fix CLIP modelout function * bring back iter gradient computer --------- Co-authored-by: alaakh <alaakh@mit.edu>

fixed model output function when computing gradients in float16

7a89c0b

AlaaKhaddaj requested a review from kristian-georgiev May 27, 2023 15:41

kristian-georgiev added 2 commits May 31, 2023 13:10

also fix for text clsf MOF

42e8566

instantiate on device directly

f1130d6

kristian-georgiev merged commit e18838a into 0.2.0 May 31, 2023

kristian-georgiev deleted the 0.2.0_float16 branch May 31, 2023 17:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fixed model output function when computing gradients in float16 #36

fixed model output function when computing gradients in float16 #36

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fixed model output function when computing gradients in float16 #36

fixed model output function when computing gradients in float16 #36

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!