Normalize Trainer #271

pan-x-c · 2025-09-12T08:15:24Z

Description

Extract save_checkpoint operation from the trainer engine to the Trainer
train_step and sync_weight use async
Optimize metrics recorded in Trainer

Checklist

Please check the following items before code is ready to be reviewed.

Code has passed all tests
Docstrings have been added/updated in Google Style
Documentation has been updated
Code is ready for review

gemini-code-assist

Summary of Changes

Hello @pan-x-c, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request refactors the core Trainer class to centralize control over key training lifecycle events. It introduces asynchronous operations for train_step and sync_weight to enhance performance and responsiveness, while also standardizing metric collection. Furthermore, checkpoint saving logic is consolidated within the Trainer, removing previous redundancies and preparing for more robust and configurable checkpointing.

Highlights

Centralized Checkpoint Management: The save_checkpoint operation has been extracted from the trainer engine and is now managed directly by the Trainer class, allowing for more unified control over checkpointing frequency and logic.
Asynchronous Training Operations: The train_step, sync_weight, and need_sync methods within the Trainer have been converted to asynchronous functions, enabling non-blocking execution and improved concurrency.
Enhanced Metric Recording: Metric collection has been optimized across the training process, with Timer usage updated to use more descriptive keys (e.g., "time/read_experience") and train_step and sync_weight now returning metrics for consolidated logging.
Streamlined verl_trainer Checkpointing: The verl_trainer's internal checkpointing logic has been simplified by removing redundant save calls from train_step and eliminating the direct saving of dataloader state, delegating these responsibilities to the main Trainer class.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in issue comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

pan-x-c · 2025-09-12T08:16:10Z

/unittest-module-synchronizer

gemini-code-assist

Code Review

This pull request effectively refactors the Trainer class by making train_step and sync_weight asynchronous, centralizing checkpoint saving logic, and improving metric collection. The main training loop in train is now much cleaner and easier to follow. The changes to use await instead of ray.get are correctly implemented. I've found one issue with a redundant call which I've commented on. Overall, this is a great improvement to the codebase.

trinity/trainer/trainer.py

github-actions · 2025-09-12T08:17:07Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
0	0	0	0	0	0	1ms

Tests

Test Name	Status	Flaky	Duration

Github Test Reporter by CTRF 💚

pan-x-c · 2025-09-12T08:17:31Z

/unittest-module-manager

github-actions · 2025-09-12T08:24:51Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
7	1	6	0	0	0	377ms

Failed Tests

Failed Tests ❌	Fail Message
❌ tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_0::test_synchronizer	The test failed in the call phase due to an assertion error
❌ tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_1::test_synchronizer	The test failed in the call phase due to an assertion error
❌ tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_2::test_synchronizer	The test failed in the call phase due to an assertion error
❌ tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_3::test_synchronizer	The test failed in the call phase due to an assertion error
❌ tests/manager/synchronizer_test.py::TestNCCLBasedSynchronizer_0::test_synchronizer	The test failed in the call phase due to an assertion error
❌ tests/manager/synchronizer_test.py::TestNCCLBasedSynchronizer_1::test_synchronizer	The test failed in the call phase due to an assertion error

Tests

Test Name	Status	Duration
tests/manager/synchronizer_test.py::TestSynchronizerExit::test_synchronizer	✅	31ms
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_0::test_synchronizer	❌	59ms
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_1::test_synchronizer	❌	66ms
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_2::test_synchronizer	❌	57ms
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_3::test_synchronizer	❌	64ms
tests/manager/synchronizer_test.py::TestNCCLBasedSynchronizer_0::test_synchronizer	❌	48ms
tests/manager/synchronizer_test.py::TestNCCLBasedSynchronizer_1::test_synchronizer	❌	45ms

Github Test Reporter by CTRF 💚

pan-x-c · 2025-09-12T08:49:14Z

/unittest-module-manager

github-actions · 2025-09-12T08:57:49Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
7	7	0	0	0	0	454ms

Tests

Test Name	Status	Duration
tests/manager/synchronizer_test.py::TestSynchronizerExit::test_synchronizer	✅	32ms
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_0::test_synchronizer	✅	62ms
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_1::test_synchronizer	✅	65ms
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_2::test_synchronizer	✅	90ms
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_3::test_synchronizer	✅	88ms
tests/manager/synchronizer_test.py::TestNCCLBasedSynchronizer_0::test_synchronizer	✅	56ms
tests/manager/synchronizer_test.py::TestNCCLBasedSynchronizer_1::test_synchronizer	✅	55ms

Github Test Reporter by CTRF 💚

pan-x-c · 2025-09-12T08:58:35Z

/unittest-module-trainer

github-actions · 2025-09-12T09:15:47Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
16	13	2	1	0	0	948ms

Failed Tests

Failed Tests ❌	Fail Message
❌ tests/trainer/trainer_test.py::TestTrainerCountdown_1_megatron::test_trainer	The test failed in the call phase due to an assertion error
❌ tests/trainer/trainer_test.py::TestFullyAsyncMode_2_megatron::test_fully_async_mode	The test failed in the call phase

Skipped

Tests	Status
tests/trainer/trainer_test.py::TestTrainerMultiModal::test_trainer	skipped ⏭️

Tests

Test Name	Status	Duration
tests/trainer/trainer_test.py::TestTrainerCountdown_0_fsdp::test_trainer	✅	158ms
tests/trainer/trainer_test.py::TestTrainerCountdown_1_megatron::test_trainer	❌	67ms
tests/trainer/trainer_test.py::TestStepAheadAsyncRL::test_trainer	✅	51ms
tests/trainer/trainer_test.py::TestTrainerGSM8K_0_fsdp::test_trainer	✅	47ms
tests/trainer/trainer_test.py::TestTrainerGSM8K_1_fsdp2::test_trainer	✅	49ms
tests/trainer/trainer_test.py::TestTrainerGSM8K_2_fsdp::test_trainer	✅	49ms
tests/trainer/trainer_test.py::TestTrainerGSM8K_3_fsdp2::test_trainer	✅	59ms
tests/trainer/trainer_test.py::TestTrainerSFTWarmupGSM8K::test_trainer	✅	92ms
tests/trainer/trainer_test.py::TestTrainerDPO::test_trainer	✅	34ms
tests/trainer/trainer_test.py::TestTrainerSFT::test_trainer	✅	30ms
tests/trainer/trainer_test.py::TestTrainerToolsSFT::test_trainer_tools	✅	31ms
tests/trainer/trainer_test.py::TestFullyAsyncMode_0_fsdp::test_fully_async_mode	✅	78ms
tests/trainer/trainer_test.py::TestFullyAsyncMode_1_fsdp::test_fully_async_mode	✅	68ms
tests/trainer/trainer_test.py::TestFullyAsyncMode_2_megatron::test_fully_async_mode	❌	58ms
tests/trainer/trainer_test.py::TestTrainerMIX::test_trainer	✅	67ms
tests/trainer/trainer_test.py::TestTrainerMultiModal::test_trainer	⏭️	1ms

Github Test Reporter by CTRF 💚

pan-x-c · 2025-09-12T09:37:19Z

/unittest-all

Copilot

Pull Request Overview

This PR normalizes the Trainer implementation by extracting checkpoint saving operations from the trainer engine, making certain operations async, and optimizing metrics recording.

Extract save_checkpoint operation from trainer engine to the main Trainer class
Convert train_step and sync_weight methods to async and update timing metrics
Add save interval control and optimize metrics collection

Reviewed Changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
trinity/trainer/verl_trainer.py	Remove checkpoint saving from train_step, add save_as_hf parameter
trinity/trainer/verl/megatron_workers.py	Add save_as_hf parameter to checkpoint save methods
trinity/trainer/verl/fsdp_workers.py	Add save_as_hf parameter to checkpoint save methods
trinity/trainer/trainer.py	Convert methods to async, extract checkpoint saving, add timing metrics
trinity/algorithm/sample_strategy/sample_strategy.py	Update timer metric names for consistency
tests/trainer/trainer_test.py	Add trainer cache validation in test
tests/manager/synchronizer_test.py	Update train_step method signature in test mock

Comments suppressed due to low confidence (1)

trinity/trainer/verl_trainer.py:378

The removed import statement from verl.utils.fs import local_mkdir_safe is still needed for the dataloader saving code that was removed. However, since the dataloader saving code was also removed, this change is consistent.

    def _save_checkpoint(self, save_as_hf: bool = False):
        # path: given_path + `/global_step_{global_steps}` + `/actor`
        local_global_step_folder = os.path.join(
            self.config.trainer.default_local_dir, f"global_step_{self.global_steps}"
        )

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

trinity/trainer/trainer.py

github-actions · 2025-09-12T10:22:23Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
144	143	0	1	0	0	2.6s

Skipped

Tests	Status
tests/trainer/trainer_test.py::TestTrainerMultiModal::test_trainer	skipped ⏭️

Tests

Test Name	Status	Duration
tests/algorithm/advantage_fn_test.py::TestGroupedAdvantageFn::test_duplicate_grpo	✅	1ms
tests/algorithm/advantage_fn_test.py::TestGroupedAdvantageFn::test_grpo_advantage	✅	1ms
tests/algorithm/advantage_fn_test.py::TestGroupedAdvantageFn::test_grpo_correct_bias	✅	1ms
tests/algorithm/advantage_fn_test.py::TestGroupedAdvantageFn::test_grpo_reward_std	✅	1ms
tests/algorithm/advantage_fn_test.py::TestGroupedAdvantageFn::test_step_wise_grpo_advantage	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_dpo_policy_loss	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_gspo_policy_loss	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_mix_policy_loss	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_opmd_policy_loss	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_ppo_policy_loss	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_sft_policy_loss	✅	1ms
tests/buffer/experience_pipeline_test.py::TestExperiencePipeline::test_experience_pipeline	✅	11ms
tests/buffer/experience_storage_test.py::ExperienceStorageTest::test_sql_experience_buffer	✅	3ms
tests/buffer/experience_storage_test.py::ExperienceStorageTest::test_sql_storage_0_sft	✅	4ms
tests/buffer/experience_storage_test.py::ExperienceStorageTest::test_sql_storage_1_dpo	✅	5ms
tests/buffer/file_test.py::TestFileBuffer::test_file_reader	✅	1ms
tests/buffer/file_test.py::TestFileBuffer::test_file_writer	✅	2ms
tests/buffer/formatter_test.py::TestFormatter::test_dpo_messages_formatter	✅	1ms
tests/buffer/formatter_test.py::TestFormatter::test_dpo_plaintext_formatter	✅	1ms
tests/buffer/formatter_test.py::TestFormatter::test_sft_messages_formatter	✅	1ms
tests/buffer/formatter_test.py::TestFormatter::test_sft_plaintext_formatter	✅	1ms
tests/buffer/formatter_test.py::TestFormatter::test_task_formatter	✅	1ms
tests/buffer/queue_test.py::TestQueueBuffer::test_priority_queue_buffer_reuse	✅	7ms
tests/buffer/queue_test.py::TestQueueBuffer::test_priority_queue_capacity	✅	3ms
tests/buffer/queue_test.py::TestQueueBuffer::test_queue_buffer_0_queue	✅	4ms
tests/buffer/queue_test.py::TestQueueBuffer::test_queue_buffer_1_priority_queue	✅	4ms
tests/buffer/queue_test.py::TestQueueBuffer::test_queue_buffer_capacity	✅	4ms
tests/buffer/reward_shaping_mapper_test.py::TestRewardShapingMapper::test_basic_usage	✅	1ms
tests/buffer/sql_test.py::TestSQLBuffer::test_sql_buffer_read_write	✅	3ms
tests/buffer/task_storage_test.py::TaskStorageTest::test_read_task_0	✅	1ms
tests/buffer/task_storage_test.py::TaskStorageTest::test_read_task_1	✅	3ms
tests/buffer/task_storage_test.py::TaskStorageTest::test_read_task_2	✅	1ms
tests/buffer/task_storage_test.py::TaskStorageTest::test_read_task_3	✅	2ms
tests/buffer/task_storage_test.py::TaskStorageTest::test_read_task_4	✅	1ms
tests/buffer/task_storage_test.py::TaskStorageTest::test_read_task_5	✅	3ms
tests/cli/launcher_test.py::TestLauncherMain::test_main_run_command	✅	5ms
tests/cli/launcher_test.py::TestLauncherMain::test_main_run_in_dlc	✅	1ms
tests/cli/launcher_test.py::TestLauncherMain::test_main_studio_command	✅	1ms
tests/cli/launcher_test.py::TestLauncherMain::test_multi_stage_run	✅	2ms
tests/common/config_test.py::TestConfig::test_all_examples_are_valid	✅	10ms
tests/common/config_test.py::TestConfig::test_config_flatten	✅	1ms
tests/common/config_test.py::TestConfig::test_continue_from_checkpoint_is_valid	✅	1ms
tests/common/config_test.py::TestConfig::test_load_default_config	✅	4ms
tests/common/experience_test.py::TestEID::test_eid_properties	✅	1ms
tests/common/experience_test.py::TestExperience::test_action_mask_and_logprobs_type	✅	1ms
tests/common/experience_test.py::TestExperience::test_assertions	✅	1ms
tests/common/experience_test.py::TestExperience::test_dpo_experience	✅	1ms
tests/common/experience_test.py::TestExperience::test_gather	✅	1ms
tests/common/experience_test.py::TestExperience::test_hf_datasets_conversion	✅	1ms
tests/common/experience_test.py::TestExperience::test_multi_turn_experience	✅	1ms
tests/common/experience_test.py::TestExperience::test_serialize_deserialize	✅	1ms
tests/common/experience_test.py::TestExperience::test_single_turn_experience	✅	1ms
tests/common/experience_test.py::TestExperience::test_to_dict	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_batch_conversion	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_dpo_experience_batch_conversion	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_experience_model_experience_conversion	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_gather_experiences_with_custom_fields	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_multiturn_experience_batch_converstion	✅	1ms
tests/common/vllm_test.py::ModelWrapperTest_0::test_generate	✅	37ms
tests/common/vllm_test.py::ModelWrapperTest_1::test_generate	✅	16ms
tests/common/vllm_test.py::ModelWrapperTest_2::test_generate	✅	17ms
tests/common/vllm_test.py::ModelWrapperTest_3::test_generate	✅	54ms
tests/common/vllm_test.py::ModelWrapperTest_4::test_generate	✅	49ms
tests/common/vllm_test.py::ModelWrapperTest_5::test_generate	✅	35ms
tests/common/vllm_test.py::ModelWrapperTest_6::test_generate	✅	48ms
tests/common/vllm_test.py::TestAPIServer::test_api	✅	24ms
tests/common/vllm_test.py::TestAsyncAPIServer::test_api_async	✅	23ms
tests/common/vllm_test.py::TestTokenizer::test_action_mask	✅	1ms
tests/common/vllm_test.py::TestTokenizer::test_action_mask_with_tools	✅	1ms
tests/common/vllm_test.py::TestAPIServerToolCall_0_deepseek_r1::test_api_tool_calls	✅	22ms
tests/common/vllm_test.py::TestAPIServerToolCall_1::test_api_tool_calls	✅	20ms
tests/explorer/explorer_test.py::BaseExplorerCase::test_explorer	✅	1ms
tests/explorer/explorer_test.py::TestExplorerCountdownEval::test_explorer	✅	55ms
tests/explorer/explorer_test.py::TestExplorerCountdownNoEval::test_explorer	✅	52ms
tests/explorer/explorer_test.py::TestExplorerGSM8k::test_explorer	✅	200ms
tests/explorer/scheduler_test.py::SchedulerTest::test_async_workflow	✅	4ms
tests/explorer/scheduler_test.py::SchedulerTest::test_concurrent_operations	✅	4ms
tests/explorer/scheduler_test.py::SchedulerTest::test_get_results	✅	19ms
tests/explorer/scheduler_test.py::SchedulerTest::test_multi_step_execution	✅	4ms
tests/explorer/scheduler_test.py::SchedulerTest::test_non_repeatable_workflow	✅	4ms
tests/explorer/scheduler_test.py::SchedulerTest::test_scheduler_all_methods	✅	13ms
tests/explorer/scheduler_test.py::SchedulerTest::test_scheduler_restart_after_stop	✅	7ms
tests/explorer/scheduler_test.py::SchedulerTest::test_split_tasks	✅	7ms
tests/explorer/scheduler_test.py::SchedulerTest::test_stepwise_experience_eid	✅	4ms
tests/explorer/scheduler_test.py::SchedulerTest::test_wait_all	✅	7ms
tests/explorer/scheduler_test.py::SchedulerTest::test_wait_all_timeout_with_multi_batch	✅	12ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_reward_propagation_workflow	✅	1ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_step_wise_reward_workflow	✅	1ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_workflows_raise_error	✅	1ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_workflows_stop_at_max_env_steps	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_gsm8k_workflow	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_boxed_workflow	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_complex_workflow	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_eval_workflow	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_fraction_workflow	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_workflow	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_rm_gallery_workflow	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_repeatable	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_resettable	✅	1ms
tests/manager/synchronizer_test.py::TestSynchronizerExit::test_synchronizer	✅	28ms
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_0::test_synchronizer	✅	60ms
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_1::test_synchronizer	✅	64ms
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_2::test_synchronizer	✅	102ms
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_3::test_synchronizer	✅	80ms
tests/manager/synchronizer_test.py::TestNCCLBasedSynchronizer_0::test_synchronizer	✅	55ms
tests/manager/synchronizer_test.py::TestNCCLBasedSynchronizer_1::test_synchronizer	✅	57ms
tests/service/data_juicer_test.py::TestDataJuicer::test_config	✅	1ms
tests/service/data_juicer_test.py::TestDataJuicer::test_server_start	✅	22ms
tests/service/data_juicer_test.py::TestDataJuicerExperiencePipeline::test_data_juicer_operators	✅	21ms
tests/service/data_juicer_test.py::TestDataJuicerTaskPipeline::test_data_juicer_task_pipeline	✅	14ms
tests/trainer/trainer_test.py::TestTrainerCountdown_0_fsdp::test_trainer	✅	134ms
tests/trainer/trainer_test.py::TestTrainerCountdown_1_megatron::test_trainer	✅	303ms
tests/trainer/trainer_test.py::TestStepAheadAsyncRL::test_trainer	✅	56ms
tests/trainer/trainer_test.py::TestTrainerGSM8K_0_fsdp::test_trainer	✅	50ms
tests/trainer/trainer_test.py::TestTrainerGSM8K_1_fsdp2::test_trainer	✅	50ms
tests/trainer/trainer_test.py::TestTrainerGSM8K_2_fsdp::test_trainer	✅	51ms
tests/trainer/trainer_test.py::TestTrainerGSM8K_3_fsdp2::test_trainer	✅	57ms
tests/trainer/trainer_test.py::TestTrainerSFTWarmupGSM8K::test_trainer	✅	99ms
tests/trainer/trainer_test.py::TestTrainerDPO::test_trainer	✅	34ms
tests/trainer/trainer_test.py::TestTrainerSFT::test_trainer	✅	31ms
tests/trainer/trainer_test.py::TestTrainerToolsSFT::test_trainer_tools	✅	32ms
tests/trainer/trainer_test.py::TestFullyAsyncMode_0_fsdp::test_fully_async_mode	✅	69ms
tests/trainer/trainer_test.py::TestFullyAsyncMode_1_fsdp::test_fully_async_mode	✅	75ms
tests/trainer/trainer_test.py::TestFullyAsyncMode_2_megatron::test_fully_async_mode	✅	159ms
tests/trainer/trainer_test.py::TestTrainerMIX::test_trainer	✅	68ms
tests/trainer/trainer_test.py::TestTrainerMultiModal::test_trainer	⏭️	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_both_boxed_and_equivalent	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_both_boxed_and_not_equivalent	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_empty_ground_truth	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_empty_solution_string	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_multiple_boxed_answers_in_solution	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_solution_boxed_truth_raw_and_equivalent	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_solution_boxed_truth_raw_and_not_equivalent	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_solution_not_boxed	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_solution_raw_and_ground_truth_boxed_equivalent	✅	1ms
tests/utils/eval_utils_test.py::TestMathEvalUtils::test_extract_answer	✅	1ms
tests/utils/eval_utils_test.py::TestMathEvalUtils::test_verify_math_answer	✅	1ms
tests/utils/eval_utils_test.py::TestEvalUtils::test_is_equiv	✅	1ms
tests/utils/log_test.py::LogTest::test_actor_log	✅	2ms
tests/utils/log_test.py::LogTest::test_group_by_node	✅	2ms
tests/utils/log_test.py::LogTest::test_no_actor_log	✅	1ms
tests/utils/plugin_test.py::TestPluginLoader::test_load_plugins_local	✅	1ms
tests/utils/plugin_test.py::TestPluginLoader::test_load_plugins_remote	✅	6ms
tests/utils/plugin_test.py::TestPluginLoader::test_passing_custom_class	✅	4ms

Github Test Reporter by CTRF 💚

pan-x-c · 2025-09-15T02:07:17Z

/unittest-module-trainer

github-actions · 2025-09-15T02:29:02Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
16	14	1	1	0	0	1.2s

Failed Tests

Failed Tests ❌	Fail Message
❌ tests/trainer/trainer_test.py::TestTrainerCountdown_0_fsdp::test_trainer	The test failed in the call phase due to an assertion error

Skipped

Tests	Status
tests/trainer/trainer_test.py::TestTrainerMultiModal::test_trainer	skipped ⏭️

Tests

Test Name	Status	Duration
tests/trainer/trainer_test.py::TestTrainerCountdown_0_fsdp::test_trainer	❌	117ms
tests/trainer/trainer_test.py::TestTrainerCountdown_1_megatron::test_trainer	✅	306ms
tests/trainer/trainer_test.py::TestStepAheadAsyncRL::test_trainer	✅	53ms
tests/trainer/trainer_test.py::TestTrainerGSM8K_0_fsdp::test_trainer	✅	49ms
tests/trainer/trainer_test.py::TestTrainerGSM8K_1_fsdp2::test_trainer	✅	47ms
tests/trainer/trainer_test.py::TestTrainerGSM8K_2_fsdp::test_trainer	✅	50ms
tests/trainer/trainer_test.py::TestTrainerGSM8K_3_fsdp2::test_trainer	✅	58ms
tests/trainer/trainer_test.py::TestTrainerSFTWarmupGSM8K::test_trainer	✅	96ms
tests/trainer/trainer_test.py::TestTrainerDPO::test_trainer	✅	34ms
tests/trainer/trainer_test.py::TestTrainerSFT::test_trainer	✅	33ms
tests/trainer/trainer_test.py::TestTrainerToolsSFT::test_trainer_tools	✅	31ms
tests/trainer/trainer_test.py::TestFullyAsyncMode_0_fsdp::test_fully_async_mode	✅	71ms
tests/trainer/trainer_test.py::TestFullyAsyncMode_1_fsdp::test_fully_async_mode	✅	70ms
tests/trainer/trainer_test.py::TestFullyAsyncMode_2_megatron::test_fully_async_mode	✅	163ms
tests/trainer/trainer_test.py::TestTrainerMIX::test_trainer	✅	53ms
tests/trainer/trainer_test.py::TestTrainerMultiModal::test_trainer	⏭️	1ms

Github Test Reporter by CTRF 💚

pan-x-c · 2025-09-15T03:13:15Z

/unittest-module-trainer

github-actions · 2025-09-15T03:35:26Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
16	15	0	1	0	0	1.3s

Skipped

Tests	Status
tests/trainer/trainer_test.py::TestTrainerMultiModal::test_trainer	skipped ⏭️

Tests

Test Name	Status	Duration
tests/trainer/trainer_test.py::TestTrainerCountdown_0_fsdp::test_trainer	✅	130ms
tests/trainer/trainer_test.py::TestTrainerCountdown_1_megatron::test_trainer	✅	309ms
tests/trainer/trainer_test.py::TestStepAheadAsyncRL::test_trainer	✅	64ms
tests/trainer/trainer_test.py::TestTrainerGSM8K_0_fsdp::test_trainer	✅	46ms
tests/trainer/trainer_test.py::TestTrainerGSM8K_1_fsdp2::test_trainer	✅	47ms
tests/trainer/trainer_test.py::TestTrainerGSM8K_2_fsdp::test_trainer	✅	50ms
tests/trainer/trainer_test.py::TestTrainerGSM8K_3_fsdp2::test_trainer	✅	56ms
tests/trainer/trainer_test.py::TestTrainerSFTWarmupGSM8K::test_trainer	✅	95ms
tests/trainer/trainer_test.py::TestTrainerDPO::test_trainer	✅	34ms
tests/trainer/trainer_test.py::TestTrainerSFT::test_trainer	✅	31ms
tests/trainer/trainer_test.py::TestTrainerToolsSFT::test_trainer_tools	✅	31ms
tests/trainer/trainer_test.py::TestFullyAsyncMode_0_fsdp::test_fully_async_mode	✅	70ms
tests/trainer/trainer_test.py::TestFullyAsyncMode_1_fsdp::test_fully_async_mode	✅	78ms
tests/trainer/trainer_test.py::TestFullyAsyncMode_2_megatron::test_fully_async_mode	✅	167ms
tests/trainer/trainer_test.py::TestTrainerMIX::test_trainer	✅	52ms
tests/trainer/trainer_test.py::TestTrainerMultiModal::test_trainer	⏭️	1ms

Github Test Reporter by CTRF 💚

pan-x-c · 2025-09-15T07:21:13Z

/unittest-all

pan-x-c · 2025-09-15T09:06:34Z

/unittest-module-manager

github-actions · 2025-09-15T09:16:48Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
7	7	0	0	0	0	550ms

Tests

Test Name	Status	Duration
tests/manager/synchronizer_test.py::TestSynchronizerExit::test_synchronizer	✅	31ms
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_0::test_synchronizer	✅	71ms
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_1::test_synchronizer	✅	73ms
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_2::test_synchronizer	✅	115ms
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_3::test_synchronizer	✅	118ms
tests/manager/synchronizer_test.py::TestNCCLBasedSynchronizer_0::test_synchronizer	✅	68ms
tests/manager/synchronizer_test.py::TestNCCLBasedSynchronizer_1::test_synchronizer	✅	68ms

Github Test Reporter by CTRF 💚

pan-x-c · 2025-09-15T09:17:25Z

/unittest-module-trainer

github-actions · 2025-09-15T09:40:50Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
16	15	0	1	0	0	1.3s

Skipped

Tests	Status
tests/trainer/trainer_test.py::TestTrainerMultiModal::test_trainer	skipped ⏭️

Tests

Test Name	Status	Duration
tests/trainer/trainer_test.py::TestTrainerCountdown_0_fsdp::test_trainer	✅	138ms
tests/trainer/trainer_test.py::TestTrainerCountdown_1_megatron::test_trainer	✅	314ms
tests/trainer/trainer_test.py::TestStepAheadAsyncRL::test_trainer	✅	56ms
tests/trainer/trainer_test.py::TestTrainerGSM8K_0_fsdp::test_trainer	✅	51ms
tests/trainer/trainer_test.py::TestTrainerGSM8K_1_fsdp2::test_trainer	✅	50ms
tests/trainer/trainer_test.py::TestTrainerGSM8K_2_fsdp::test_trainer	✅	57ms
tests/trainer/trainer_test.py::TestTrainerGSM8K_3_fsdp2::test_trainer	✅	60ms
tests/trainer/trainer_test.py::TestTrainerSFTWarmupGSM8K::test_trainer	✅	96ms
tests/trainer/trainer_test.py::TestTrainerDPO::test_trainer	✅	39ms
tests/trainer/trainer_test.py::TestTrainerSFT::test_trainer	✅	35ms
tests/trainer/trainer_test.py::TestTrainerToolsSFT::test_trainer_tools	✅	36ms
tests/trainer/trainer_test.py::TestFullyAsyncMode_0_fsdp::test_fully_async_mode	✅	81ms
tests/trainer/trainer_test.py::TestFullyAsyncMode_1_fsdp::test_fully_async_mode	✅	77ms
tests/trainer/trainer_test.py::TestFullyAsyncMode_2_megatron::test_fully_async_mode	✅	168ms
tests/trainer/trainer_test.py::TestTrainerMIX::test_trainer	✅	58ms
tests/trainer/trainer_test.py::TestTrainerMultiModal::test_trainer	⏭️	1ms

Github Test Reporter by CTRF 💚

pan-x-c · 2025-09-15T09:42:05Z

/unittest-module-common

github-actions · 2025-09-15T09:49:13Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
32	32	0	0	0	0	368ms

Tests

Test Name	Status	Duration
tests/common/config_test.py::TestConfig::test_all_examples_are_valid	✅	11ms
tests/common/config_test.py::TestConfig::test_config_flatten	✅	1ms
tests/common/config_test.py::TestConfig::test_continue_from_checkpoint_is_valid	✅	1ms
tests/common/config_test.py::TestConfig::test_load_default_config	✅	4ms
tests/common/experience_test.py::TestEID::test_eid_properties	✅	1ms
tests/common/experience_test.py::TestExperience::test_action_mask_and_logprobs_type	✅	1ms
tests/common/experience_test.py::TestExperience::test_assertions	✅	1ms
tests/common/experience_test.py::TestExperience::test_dpo_experience	✅	1ms
tests/common/experience_test.py::TestExperience::test_gather	✅	1ms
tests/common/experience_test.py::TestExperience::test_hf_datasets_conversion	✅	1ms
tests/common/experience_test.py::TestExperience::test_multi_turn_experience	✅	1ms
tests/common/experience_test.py::TestExperience::test_serialize_deserialize	✅	1ms
tests/common/experience_test.py::TestExperience::test_single_turn_experience	✅	1ms
tests/common/experience_test.py::TestExperience::test_to_dict	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_batch_conversion	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_dpo_experience_batch_conversion	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_experience_model_experience_conversion	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_gather_experiences_with_custom_fields	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_multiturn_experience_batch_converstion	✅	1ms
tests/common/vllm_test.py::ModelWrapperTest_0::test_generate	✅	38ms
tests/common/vllm_test.py::ModelWrapperTest_1::test_generate	✅	16ms
tests/common/vllm_test.py::ModelWrapperTest_2::test_generate	✅	16ms
tests/common/vllm_test.py::ModelWrapperTest_3::test_generate	✅	53ms
tests/common/vllm_test.py::ModelWrapperTest_4::test_generate	✅	48ms
tests/common/vllm_test.py::ModelWrapperTest_5::test_generate	✅	36ms
tests/common/vllm_test.py::ModelWrapperTest_6::test_generate	✅	48ms
tests/common/vllm_test.py::TestAPIServer::test_api	✅	23ms
tests/common/vllm_test.py::TestAsyncAPIServer::test_api_async	✅	24ms
tests/common/vllm_test.py::TestTokenizer::test_action_mask	✅	1ms
tests/common/vllm_test.py::TestTokenizer::test_action_mask_with_tools	✅	1ms
tests/common/vllm_test.py::TestAPIServerToolCall_0_deepseek_r1::test_api_tool_calls	✅	21ms
tests/common/vllm_test.py::TestAPIServerToolCall_1::test_api_tool_calls	✅	19ms

Github Test Reporter by CTRF 💚

trainer async

496fa6c

gemini-code-assist bot reviewed Sep 12, 2025

View reviewed changes

pan-x-c closed this Sep 12, 2025

pan-x-c reopened this Sep 12, 2025

gemini-code-assist bot reviewed Sep 12, 2025

View reviewed changes

trinity/trainer/trainer.py Outdated Show resolved Hide resolved

fix comments

66bc3ac

fix synchronizer tests

f1f9877

fix megatron checkpoint

a70c603

pan-x-c requested a review from Copilot September 12, 2025 09:57

Copilot AI reviewed Sep 12, 2025

View reviewed changes

trinity/trainer/trainer.py Show resolved Hide resolved

simplify verl trainer

5e66a85

pan-x-c added 2 commits September 15, 2025 11:09

fix save checkpoint

4e74556

fix save checkpoint

71387de

add deprecated warning

1076382

fix sample blocked

dcce4f6

pan-x-c added 3 commits September 15, 2025 15:28

fix config

f326ec4

fix synchronizer tests

1968913

fix synchronizer tests

9dc3a62

add comments

dcf29bd

chenyushuo approved these changes Sep 15, 2025

View reviewed changes

pan-x-c merged commit c78ca29 into modelscope:main Sep 15, 2025
2 checks passed

yaochaorui pushed a commit to yaochaorui/Trinity-RFT that referenced this pull request Sep 19, 2025

Normalize Trainer (modelscope#271)

0031510

Normalize Trainer #271

Normalize Trainer #271

Uh oh!

Conversation

pan-x-c commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Summary of Changes

Highlights

Footnotes

Uh oh!

pan-x-c commented Sep 12, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

github-actions bot commented Sep 12, 2025

Summary

Tests

Uh oh!

pan-x-c commented Sep 12, 2025

Uh oh!

github-actions bot commented Sep 12, 2025

Summary

Failed Tests

Tests

Uh oh!

pan-x-c commented Sep 12, 2025

Uh oh!

github-actions bot commented Sep 12, 2025

Summary

Tests

Uh oh!

pan-x-c commented Sep 12, 2025

Uh oh!

github-actions bot commented Sep 12, 2025

Summary

Failed Tests

Skipped

Tests

Uh oh!

pan-x-c commented Sep 12, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

github-actions bot commented Sep 12, 2025

Summary

Skipped

Tests

Uh oh!

pan-x-c commented Sep 15, 2025

Uh oh!

github-actions bot commented Sep 15, 2025

Summary

Failed Tests

Skipped

Tests

Uh oh!

pan-x-c commented Sep 15, 2025

Uh oh!

github-actions bot commented Sep 15, 2025

Summary

Skipped

Tests

Uh oh!

pan-x-c commented Sep 15, 2025

Uh oh!

pan-x-c commented Sep 15, 2025

Uh oh!

github-actions bot commented Sep 15, 2025

pan-x-c commented Sep 12, 2025 •

edited

Loading