Add a rule about DLRM training data shuffling #441

johntran-nv · 2021-04-21T20:25:36Z

Shuffling rules about DLRM were not clear enough in the v0.7 round and they left a lot of room for interpretation. This update makes a clear rule that is easy to follow and should not impact convergence or performance of DLRM implementations.

This was actually part of #411, which we discussed, but I mistakenly closed that thinking it was only about packing, which we no longer are using that PR for. This is cleaner to break out data shuffling into its own PR, anyway.

Shuffling rules about DLRM were not clear enough in the v0.7 round and they left a lot of room for interpretation. This update makes a clear rule that is easy to follow and should not impact convergence or performance of DLRM implementations.

github-actions · 2021-04-21T20:25:53Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

johntran-nv · 2021-04-21T20:32:53Z

+emizan@google.com, +deepak.r.canchi@intel.com, could you please review/approve?

johntran-nv · 2021-04-30T04:05:31Z

Deepak suggested that it is too late for v1.0 to change this, which is fair. Let's defer discussion to v1.1.

Separately, it looks like I inadvertently merged this, maybe as part of another PR. I'll go fix that now as well.

ShriyaPalsamudram · 2024-10-24T15:24:30Z

Closing due to inactivity.

Create training_rules.adoc

1c06aa5

Shuffling rules about DLRM were not clear enough in the v0.7 round and they left a lot of room for interpretation. This update makes a clear rule that is easy to follow and should not impact convergence or performance of DLRM implementations.

johntran-nv mentioned this pull request Apr 21, 2021

Add a rule about DLRM training data shuffling #440

Closed

emizan76 approved these changes Apr 23, 2021

View reviewed changes

johntran-nv added the v1.1 label May 12, 2021

ShriyaPalsamudram closed this Oct 24, 2024

github-actions bot locked and limited conversation to collaborators Oct 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a rule about DLRM training data shuffling #441

Add a rule about DLRM training data shuffling #441

johntran-nv commented Apr 21, 2021 •

edited

Loading

github-actions bot commented Apr 21, 2021

johntran-nv commented Apr 21, 2021

johntran-nv commented Apr 30, 2021

ShriyaPalsamudram commented Oct 24, 2024

Add a rule about DLRM training data shuffling #441

Add a rule about DLRM training data shuffling #441

Conversation

johntran-nv commented Apr 21, 2021 • edited Loading

github-actions bot commented Apr 21, 2021

johntran-nv commented Apr 21, 2021

johntran-nv commented Apr 30, 2021

ShriyaPalsamudram commented Oct 24, 2024

johntran-nv commented Apr 21, 2021 •

edited

Loading