reasoning-gym

mirror of https://github.com/open-thought/reasoning-gym.git synced 2025-10-09 13:40:09 +03:00

Files

Oliver Stanley 224532f12a first inter-domain generalisation experiments (#412 )

* tweak len reward

* first inter-generalisation experiment config

* update inter algorithmic config

* default to empty config

* fix typo

* change config to match experiment script

* long prompt fixes

* algorithmic training config tweaks

* imports

* update algorithmic training cfgs

* first logic composite config

* fix dset name

* tweaks

* fix syllogisms dataset

* rm temp print

* initial algebra config

* algebra cfg tweaks

* add gc

* add initial games cfg

* rename games cfg

* fix dset name

* fix sokoban metadata

* remove boxnet

* games cfg tweak

2025-04-14 21:06:40 +01:00

__init__.py

Feat/curr adj (#394 )

2025-04-02 06:39:14 +01:00

reward.py

first inter-domain generalisation experiments (#412 )

2025-04-14 21:06:40 +01:00