mirror of
https://github.com/open-thought/reasoning-gym.git
synced 2025-10-09 13:40:09 +03:00
fix(README): Arxiv link (#447)
This commit is contained in:
committed by
GitHub
parent
17a8431013
commit
b3f81a6609
@@ -8,7 +8,7 @@
|
||||
<!-- badges -->
|
||||
<p align="center">
|
||||
<a href="https://arxiv.org/abs/2505.24760">
|
||||
<img src="https://img.shields.io/badge/arXiv-2306.00777-b31b1b.svg?style=for-the-badge" alt="Paper PDF">
|
||||
<img src="https://img.shields.io/badge/arXiv-2505.24760-b31b1b.svg?style=for-the-badge" alt="Paper PDF">
|
||||
</a>
|
||||
</p>
|
||||
</p>
|
||||
@@ -80,6 +80,12 @@ If you have ideas for dataset generators please create an issue here or contact
|
||||
[](https://discord.gg/gpumode)
|
||||
|
||||
|
||||
## 🚀 Projects Using Reasoning Gym
|
||||
|
||||
Following is a list of awesome projects building on top of Reasoning Gym:
|
||||
- [Verifiers: Reinforcement Learning with LLMs in Verifiable Environments](https://github.com/willccbb/verifiers)
|
||||
- [(NVIDIA) ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models](https://arxiv.org/abs/2505.24864)
|
||||
|
||||
## 📝 Citation
|
||||
|
||||
If you use this library in your research, please cite the paper:
|
||||
|
||||
Reference in New Issue
Block a user