Skip to content

Commit

Permalink
fixing eval frequancy per taskforce decision
Browse files Browse the repository at this point in the history
  • Loading branch information
itayhubara committed Mar 6, 2024
1 parent c01a50b commit bafb600
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion training_rules.adoc
Original file line number Diff line number Diff line change
Expand Up @@ -456,7 +456,7 @@ CLOSED: The same quality measure as the reference implementation must be used. T
| |Text to image |Stable Diffusion v2.0 | See <<benchmark_specific_rules>>
|Language|Speech recognition |RNN-T|Every 1 epoch
| |NLP |BERT| eval_interval_samples=FLOOR(0.05*(230.23*GBS+3000000), 25000), skipping 0
| |large Language Model |Llama2_70B_LoRA| Every 512 sequences, CEIL(512 / global_batch_size) steps if 512 is not divisible by GBS. skipping 0
| |large Language Model |Llama2_70B_LoRA| Every 384 sequences, CEIL(384 / global_batch_size) steps if 384 is not divisible by GBS. skipping 0
| |large Language Model |GPT3| Every 24576 sequences. CEIL(24576 / global_batch_size) if 24576 is not divisible by GBS
|Commerce|Recommendation |DLRMv2 (DCNv2)|Every FLOOR(TOTAL_TRAINING_SAMPLES / (GLOBAL_BATCH_SIZE * NUM_EVAL) samples, where TOTAL_TRAINING_SAMPLES = 4195197692 and NUM_EVAL = 20
|===
Expand Down

0 comments on commit bafb600

Please sign in to comment.