49 Commits

Author SHA1 Message Date
Andreas Köpf
dd3117bbaf bump version to v0.1.25.dev0 (#509) 2025-09-29 14:36:30 +01:00
Andreas Köpf
be2babea9c Use raw URLs for images in README.md (#459)
On pypi images were not correctly rendered because the old img src urls in README.md
pointed to files on github with UI.
2025-06-06 21:23:59 +01:00
Oliver Stanley
1a727ecf4e support python 3.10 (#450)
* support python 3.10

* add 3.10 to tests

* new StrEnum
2025-06-04 10:34:01 +01:00
rasdani
dca8117e7e Feat: Add script to save datasets on HuggingFace (#416)
* feat: add script to save datasets on HuggingFace

* fix

* refactor

* fix formatting

---------

Co-authored-by: Oliver Stanley <olivergestanley@gmail.com>
2025-04-28 18:04:53 +01:00
Andreas Köpf
7853263650 bump dev version to 0.1.19 2025-04-03 22:19:07 +01:00
Andreas Koepf
f25820a0c7 bump version 2025-03-16 15:54:03 +01:00
Andreas Koepf
8c12fe86e2 bump version 2025-03-14 16:21:34 +01:00
Andreas Koepf
1b004bf888 bump version 2025-03-10 00:32:57 +01:00
Oliver Stanley
f490b9f760 Tolerant scoring for CodeI/O based on edit distances (#277)
* add zss dep

* codeio edit distance-based scoring

* edit distance tweaks
2025-03-07 22:49:35 +01:00
Andreas Koepf
fb06038e88 update gallery 2025-03-07 16:24:47 +01:00
Andreas Koepf
2802066233 remove data/ from main .gitignore 2025-03-07 16:16:40 +01:00
Andreas Koepf
862617b7e0 update gallery, pypi release, bump version 2025-03-05 23:45:45 +01:00
Andreas Koepf
72233fc2ea bump version, pypi release of 0.1.12 2025-02-26 18:25:16 +01:00
Andreas Koepf
b5f6f7d753 bump version, update gallery 2025-02-23 22:36:39 +01:00
Andreas Koepf
e4102a44f6 dev minor version one ahead of PyPI released version 2025-02-22 16:54:05 +01:00
Andreas Koepf
f1daa8d1da update GALLERY.my, bump version 2025-02-20 23:03:54 +01:00
Andreas Köpf
e2702092f4 reasoning-gym-server & cli tool (#154)
* feat: Add initial server structure with configuration, registry, and middleware

* feat: Add chain_sum dataset to experiment registry test

* fix: Update test_registry to use DatasetSpec for composite config validation

* refactor: Update Pydantic config to use json_schema_extra and ConfigDict

* feat: Add Pydantic models for API request/response data

* feat: Implement basic experiment management endpoints with tests

* feat: Implement composite configuration endpoints for experiments

* fix: Add missing DatasetConfigUpdate import in server.py

* refactor: Update dataset config update method to properly merge config updates

* fix: Correctly retrieve current dataset config in composite endpoint

* feat: Add basic CLI structure with experiments and config commands

* feat: Add initial CLI tool with basic experiment management commands

* refactor: Reorganize CLI package structure and fix import paths

* refactor: Implement initial CLI commands for experiment management

* feat: Implement HTTP client for Reasoning Gym server in RGC CLI tool

* fix: Move print statements inside try block to resolve SyntaxError

* fix: Resolve SyntaxError in edit_config function by adding missing except block

* feat: Add default app instance in server module for easier uvicorn startup

* docs: Add README.md with server and RGC tool documentation

* remove unused files

* refactor: Remove unsupported type annotation in registry.py

* refactor: Move ExperimentRegistry to coaching module and add Experiment class

* fix: Add missing CompositeDataset import in test_registry.py

* refactor: Implement lazy ASGI app creation for server initialization

* feat: Add health check command to RGC CLI for server connection

* feat: Add version tracking support to CompositeDataset

* feat: Add DatasetVersionManager for tracking dataset versions

* feat: Add entry_id metadata and score_answer_with_id method to CompositeDataset

* feat: Add entry_id metadata combining version and index

* fix: Resolve undefined variable by storing version_id before use

* test: Add comprehensive unit tests for score_answer_with_id() function

* test: Add comprehensive version tracking test for dataset config updates

* feat: Validate dataset weights are positive in CompositeDataset initialization

* feat: Add weight update and normalization methods to CompositeDataset

* refactor: Centralize weight normalization in CompositeDataset and allow zero-weight datasets

* feat: Add negative weight validation to CompositeDataset constructor

* feat: Add duplicate dataset name check in CompositeDataset and update test

* refactor: Move duplicate dataset name check inside dataset iteration loop

* refactor: Update CompositeDataset weight management to use config as source of truth

* refactor: Move duplicate dataset name check to CompositeConfig.validate()

* test: Update composite dataset weight test assertions and validation

* feat: Add methods to add and remove datasets in CompositeDataset

* refactor: Remove weight normalization and use unnormalized weights directly

* refactor: Remove redundant total weight check in update_dataset_weights

* feat: Add batch generation and scoring endpoints to server

* fix: Import BatchEntry in server.py to resolve undefined name error

* refactor: Update ReasoningGymDataset to use server for batch generation and scoring

* fix: Add missing List and Dict type imports

* feat: Add get_batch() and score_outputs() methods to RGClient

* test: Add unit tests for generate_batch and score_outputs endpoints

* refactor: Add DatasetVersionManager to Experiment class and CompositeDataset constructor

* feat: Add validation for base_index and batch_size in generate_batch endpoint

* refactor: Remove unused BatchRequest type from imports

* refactor: Convert models to use Pydantic exclusively

* test: Update scoring endpoint tests to use correct request model format

* refactor: Rename ScoreItem to AnswerItem and update related code

* feat: Update scoring endpoint to return ordered ScoringResponse with scores and entry_ids

* fix: Add missing ScoringResponse import in server.py

* move verl ppo sample with server into own file

* refactor: Use Pydantic models for get_batch() and score_outputs() in RGClient

* refactor: Update client methods to use Pydantic models for type safety

* refactor: Use Pydantic models for experiment and dataset config operations

* refactor: Clean up duplicate methods and improve error handling in main.py

* first bits of rg server use for verl

* refactor: Optimize scoring with single HTTP request in _score_output

* fix: Correct experiment creation with ExperimentCreate object

* grpo tests with server
2025-02-19 22:41:33 +01:00
Andreas Koepf
f7e928711e bump version, update GALLERY.md 2025-02-14 23:34:59 +01:00
Andreas Koepf
56ba500959 update GALLERY.md & bump version 2025-02-11 23:43:56 +01:00
Andreas Koepf
1f9d9d27ab bump version 2025-02-09 00:39:48 +01:00
Andreas Koepf
127f505798 add ArcAgiDataset class, fix score_entry() metadata params 2025-02-08 23:18:18 +01:00
Andreas Koepf
ff74dfb5f2 fix tool.hatch.build section in pyproject.toml 2025-02-07 19:02:43 +01:00
Andreas Koepf
d3752a0d76 bump version to 0.1.14 2025-02-07 18:28:06 +01:00
Andreas Koepf (aider)
0c08ee5daa feat: Add pyyaml dependency to project configuration 2025-02-04 19:07:52 +01:00
Rich Jones
0c9094e9f4 adds zebrapuzzles 2025-02-03 14:34:57 +01:00
Andreas Koepf
82196bd2df bump version to 0.1.3, uploaded to pypi 2025-02-02 22:26:24 +01:00
Andreas Koepf (aider)
9eabc01e23 deps: Add pytz dependency to pyproject.toml 2025-02-01 02:15:45 +01:00
Andreas Koepf
7eb1466340 Add time interval dataset class 2025-02-01 02:10:48 +01:00
Andreas Koepf (aider)
4ac41bd174 fix: Remove duplicate project section and reorganize pyproject.toml 2025-01-31 12:15:27 +01:00
Andreas Koepf (aider)
69fb127b4e fix: Move project metadata to correct section in pyproject.toml 2025-01-31 12:13:41 +01:00
Andreas Koepf (aider)
0f362920d5 ci: Add GitHub Actions workflow for running tests 2025-01-31 00:10:12 +01:00
Andreas Koepf
c6634fd538 bump version 2025-01-30 23:19:58 +01:00
Andreas Koepf
ebb88e6c6a lint 2025-01-30 22:55:04 +01:00
Rich Jones
fc81bbec06 game of life via cellpylib 2025-01-30 20:08:44 +01:00
Rich Jones
2393ae0525 difficulty levels 2025-01-30 16:24:28 +01:00
Andreas Köpf
41d0f973d0 Merge branch 'main' into miserlou/figlet 2025-01-29 23:59:27 +01:00
Rich Jones
32e229954e dep typo 2025-01-29 22:23:00 +01:00
Rich Jones
cbf5087887 figlet fonts 2025-01-29 20:38:38 +01:00
Rich Jones
0d67fc102b inital stab at rubikscube 2025-01-29 15:53:29 +01:00
Andreas Koepf
cae7f0f98b min python 3.11 to support StrEnum 2025-01-26 22:17:43 +01:00
Andreas Koepf
ecbb155184 fix unit tests, lower python dependency to 3.9 2025-01-26 16:55:17 +01:00
Andreas Koepf
862f4f3c4d add Apache-2.0 license info to pyproject.toml 2025-01-25 17:48:11 +01:00
Andreas Koepf
fc08050de8 add sympy dep 2025-01-24 15:15:32 +01:00
Andreas Koepf
4112f57ea2 bump version, remove accidentially checked in file 2025-01-23 23:37:24 +01:00
Andreas Koepf (aider)
22538be4c5 feat: Add data file packaging configuration and __init__.py for reasoning_gym.data 2025-01-23 18:57:34 +01:00
Andreas Koepf (aider)
324e84897a feat: Add Black and isort pre-commit hooks with line length configuration 2025-01-23 11:02:13 +01:00
Andreas Koepf (aider)
8d53002e1b chore: Update project homepage and bug tracker URLs 2025-01-23 10:58:29 +01:00
Andreas Koepf (aider)
ba45a1e6a0 chore: Update license to Apache 2.0 and author information 2025-01-23 10:55:01 +01:00
Andreas Koepf (aider)
3f80fd7b80 build: Initialize reasoning_gym package structure with packaging and development setup 2025-01-23 10:50:54 +01:00