mirror of
https://github.com/open-thought/reasoning-gym.git
synced 2025-10-09 13:40:09 +03:00
* feat: Add optional curriculum support to dataset registration and creation * docs: Add docstrings to create_curriculum() and register_dataset() * feat: Add curriculum configuration classes for CurriculumExperiment * feat: Add weight parameter to CurriculumAttributeConfig and use in DatasetSpec * refactor: Simplify CurriculumAttributeConfig with "*" attribute level support * test: Add unit tests for CurriculumExperiment class * feat: Add from_yaml() method to CurriculumExperimentConfig with unit test
10 lines
205 B
Bash
Executable File
10 lines
205 B
Bash
Executable File
#!/bin/bash
|
|
|
|
export N_GPUS=4
|
|
export BASE_MODEL=meta-llama/Llama-3.2-1B-Instruct
|
|
export ROLLOUT_TP_SIZE=2
|
|
export EXPERIMENT_NAME=chain_sum_llama
|
|
export VLLM_ATTENTION_BACKEND=XFORMERS
|
|
|
|
bash ./train_grpo.sh
|