throughput logger #798

galrotem · 2024-04-24T23:17:13Z

Summary:
Introduce throughput logger.

Internal

Context

The stack adds a throughput logger that can be used to log generic throughput per second, based on user config.

This diff will add the throughput logger including logging per step. The next diff will add throughput on an epoch granularity.

This diff

Adds throughput logger:

It uses the already collected iteration time and data wait time timers to get the step time.
It's slightly confusing but when on_train_step_end is called, the iteration time timer hasn't been populated yet, while the data wait time timer has been populated, hence there's a difference between the two when we are logging for (step-1). On the on_train_end both lists are fully populated so we can just use the last element safely.

Differential Revision: D56496451

Differential Revision: D56496429

Summary: Introduce throughput logger. Internal # Context The stack adds a throughput logger that can be used to log generic throughput per second, based on user config. This diff will add the throughput logger including logging per step. The next diff will add throughput on an epoch granularity. # This diff Adds throughput logger: 1. It uses the already collected iteration time and data wait time timers to get the step time. 2. It's slightly confusing but when `on_train_step_end` is called, the iteration time timer hasn't been populated yet, while the data wait time timer has been populated, hence there's a difference between the two when we are logging for (step-1). On the `on_train_end` both lists are fully populated so we can just use the last element safely. Reviewed By: JKSenthil Differential Revision: D56496451

facebook-github-bot added the cla signed label Apr 24, 2024

galrotem force-pushed the export-D56496451 branch 2 times, most recently from b77c415 to 33fc106 Compare April 25, 2024 20:26

galrotem and others added 2 commits April 25, 2024 13:31

state helper - active phase state

3fee0b5

Differential Revision: D56496429

galrotem force-pushed the export-D56496451 branch from 33fc106 to da1e8d2 Compare April 25, 2024 20:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

throughput logger #798

throughput logger #798

galrotem commented Apr 24, 2024

throughput logger #798

Are you sure you want to change the base?

throughput logger #798

Conversation

galrotem commented Apr 24, 2024

Context

This diff