Drop incomplete batches for Ray and Pandas to prevent Batchnorm computation errors #2778

arnavgarg1 · 2022-11-22T16:26:48Z

This PR implements logic to drop the last batch for Pandas and Ray dataset batchers to prevent issues when there is only 1 row in a batch.

A single row in a batch causes issues when computing Batchnorm (such as in FC layers, Tabnet, etc.) because it can't compute the mean/stddev for a single sample. Additionally, Ludwig has some logic for Tabnet that has a conditional check for sample_size == 1 that we can now safely remove.

Skipping adding tests since our current test suite already has tests that use the batcher classes for model training, so if the tests pass, then this logic should work as well. I modified individual tests locally to be sure.

Note: This only drops incomplete training batches if the batch size < total number of rows in the dataset. Additionally, it will always keep incomplete training batches for validation/test sets.

To follow:

Remove check for batch_size == 1 in Tabnet

github-actions · 2022-11-22T17:39:44Z

Unit Test Results

        6 files ±  0         6 suites ±0 3h 31m 23s ⏱️ - 14m 16s
  3 527 tests -   5   3 456 ✔️ - 1   71 💤 -   4 0 ❌ ±0
10 581 runs - 15 10 353 ✔️ ±0 228 💤 - 15 0 ❌ ±0

Results for commit bd759ff. ± Comparison against base commit 8f5d013.

♻️ This comment has been updated with latest results.

…p_last_batch

ludwig/data/dataset/ray.py

ShreyaR

lgtm!

Add logic to correctly drop last batch for Ray and Pandas

b94e43b

arnavgarg1 changed the title ~~Add logic to correctly drop last batch for Ray and Pandas~~ Drop incomplete batches for Ray and Pandas to prevent Batchnorm computation errors Nov 22, 2022

arnavgarg1 requested review from justinxzhao and ShreyaR and removed request for justinxzhao November 22, 2022 16:28

arnavgarg1 added 5 commits November 30, 2022 14:23

Merge branch 'master' of https://github.com/ludwig-ai/ludwig into dro…

4b2c808

…p_last_batch

Only drop last batch if it has 1 row

00136b0

update comment

0b6950f

add API annotations

8229753

Add logging statement for Pandas based batcher

9a95e4f

ShreyaR reviewed Dec 9, 2022

View reviewed changes

ludwig/data/dataset/ray.py Show resolved Hide resolved

justinxzhao approved these changes Dec 12, 2022

View reviewed changes

arnavgarg1 added 2 commits December 12, 2022 17:14

simplify logic for ray dataset

9243a54

Clean up logic for Ray datasets

bd759ff

arnavgarg1 requested a review from ShreyaR December 12, 2022 17:55

ShreyaR approved these changes 8000 Dec 12, 2022

View reviewed changes

arnavgarg1 merged commit bec4880 into master Dec 13, 2022

arnavgarg1 deleted the drop_last_batch branch December 13, 2022 15:59

justinxzhao mentioned this pull request Sep 25, 2023

fix: The final batch of an epoch is skipped when batch size is 1 #3653

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Drop incomplete batches for Ray and Pandas to prevent Batchnorm computation errors #2778

Drop incomplete batches for Ray and Pandas to prevent Batchnorm computation errors #2778

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Drop incomplete batches for Ray and Pandas to prevent Batchnorm computation errors #2778

Drop incomplete batches for Ray and Pandas to prevent Batchnorm computation errors #2778

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Unit Test Results

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!