add unbalanced param_sync example. #126

charles9304 · 2024-10-17T09:56:16Z

No description provided.

examples/megatron/configs/llama2/rlhf_param_sync.yaml

haolin-nju

LGTM

SeaOfOcean · 2024-10-18T09:12:04Z

examples/megatron/tests/test_unbalanced_param_sync.py

+
+if __name__ == "__main__":
+    chatlearn.init()
+    args = chatlearn.get_args()


可以像https://github.com/alibaba/ChatLearn/blob/main/examples/megatron/tests/test_parameter_sync.py#L37 一样设置debug=True，另外 parameter_sync 文件中的 validate 函数是不是还不支持

validate 函数单独提一个来做校验，这个用来测试第一个episode的str outputs是否正确

SeaOfOcean · 2024-10-25T05:31:58Z

examples/megatron/tests/test_unbalanced_param_sync.sh

+reward_load_iteration=${REWARD_LOAD_ITERATION} \
+reward_load=${REWARD_LOAD} \
+tokenizer_model=${TOKENIZER_MODEL} \
+num_episode=${num_ppo_episode:-0} \


这个地方可以设置成2

SeaOfOcean · 2024-10-25T05:32:47Z

examples/megatron/tests/test_unbalanced_param_sync.sh

+reward_load=${REWARD_LOAD} \
+tokenizer_model=${TOKENIZER_MODEL} \
+num_episode=${num_ppo_episode:-0} \
+data_path=${DATASET_PATH} \


设置环境变量 validate_param_sync 为True

这个不是e2e的测试，只是测试一次param sync的正确性。e2e的得换成rlhf或其他alignment格式的逻辑。多个episode会有ppo_policy forward step参数不足的问题

validate_param_sync 这个参数只是在parameter sync的时候触发 validate函数，和是否是e2e无关

chatlearn/runtime/parameter_sync.py

haolin-nju

LGTM

charles9304 added 2 commits October 17, 2024 17:54

add unbalanced param_sync example.

b937a85

fix pylint.

abc4bb4

haolin-nju reviewed Oct 17, 2024

View reviewed changes

examples/megatron/configs/llama2/rlhf_param_sync.yaml Outdated Show resolved Hide resolved

fix comment.

36e3e74

haolin-nju previously approved these changes Oct 18, 2024

View reviewed changes

SeaOfOcean reviewed Oct 18, 2024

View reviewed changes

charles9304 added 2 commits October 23, 2024 10:24

Merge branch 'main' of github.com:alibaba/ChatLearn

1688237

add validate for unbalanced tp.

170e9a2

charles9304 dismissed haolin-nju’s stale review via 170e9a2 October 25, 2024 02:00

SeaOfOcean reviewed Oct 25, 2024

View reviewed changes

chatlearn/runtime/parameter_sync.py Show resolved Hide resolved

SeaOfOcean reviewed Oct 25, 2024

View reviewed changes

chatlearn/runtime/parameter_sync.py Show resolved Hide resolved

fix pylint.

f5feb57

haolin-nju reviewed Oct 28, 2024

View reviewed changes

chatlearn/runtime/parameter_sync.py Show resolved Hide resolved

chatlearn/runtime/parameter_sync.py Show resolved Hide resolved

raise error when param sync occurs nan value.

fc99097

haolin-nju previously approved these changes Oct 28, 2024

View reviewed changes

fix name.

ac505ee

charles9304 dismissed haolin-nju’s stale review via ac505ee October 28, 2024 06:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add unbalanced param_sync example. #126

add unbalanced param_sync example. #126

charles9304 commented Oct 17, 2024

haolin-nju left a comment

SeaOfOcean Oct 18, 2024

charles9304 Oct 22, 2024

charles9304 Oct 28, 2024

SeaOfOcean Oct 25, 2024

charles9304 Oct 28, 2024

SeaOfOcean Oct 25, 2024

charles9304 Oct 28, 2024

SeaOfOcean Oct 28, 2024

charles9304 Oct 28, 2024

haolin-nju left a comment

add unbalanced param_sync example. #126

Are you sure you want to change the base?

add unbalanced param_sync example. #126

Conversation

charles9304 commented Oct 17, 2024

haolin-nju left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

haolin-nju left a comment

Choose a reason for hiding this comment