Default Branch

2c4e45d9a9 · Update spiral_matrix.py (#511) · Updated 2025-10-06 14:02:32 +03:00

Branches

678622faec · add imports · Updated 2025-08-03 18:00:49 +03:00    alihan

10
5

70af0ad699 · Update load_fsdp_to_hf.py · Updated 2025-07-28 17:57:15 +03:00    alihan

20
3

d73d881073 · reps · Updated 2025-07-28 14:52:31 +03:00    alihan

16
9

37697e2421 · Update README.md · Updated 2025-07-27 06:59:58 +03:00    alihan

16
1

c44ff8c542 · updated failing hooks · Updated 2025-06-27 10:59:58 +03:00    alihan

20
2

a4006f6d0e · Update README.md - Add Synthetic-2 · Updated 2025-06-24 15:07:55 +03:00    alihan

24
1

02e0fc1c22 · pull fra main · Updated 2025-06-06 14:43:29 +03:00    alihan

33
4

b7d8832267 · cfg · Updated 2025-04-29 21:23:16 +03:00    alihan

58
40

cc04297995 · added llama 3b training conf · Updated 2025-04-26 22:30:46 +03:00    alihan

62
2

bf9d3b9bab · Changed params in knight_swap and make some clean up · Updated 2025-04-03 17:10:48 +03:00    alihan

71
5

4c1a5926dd · Delete files from git history · Updated 2025-04-02 08:26:46 +03:00    alihan

1325
1254

089943b710 · removed results from pr · Updated 2025-04-01 22:40:42 +03:00    alihan

77
38

3babfbbd29 · wip eval script · Updated 2025-03-27 23:42:53 +03:00    alihan

82
5

eb67c7be9c · add max_model_len to qwen config · Updated 2025-03-21 13:43:43 +03:00    alihan

84
1

e4d54c9e5b · merge · Updated 2025-03-20 15:14:50 +03:00    alihan

85
45

37170afb50 · Revert "figlet font curriculum" · Updated 2025-03-19 01:36:06 +03:00    alihan

88
2

60d9a29d38 · fix dice · Updated 2025-03-11 02:44:48 +03:00    alihan

160
3

789fa11815 · add count to gallery · Updated 2025-03-07 16:58:32 +03:00    alihan

205
1

8dc6cb5228 · fix: Move EpochTrackingDataLoader after ReasoningGymDataset to resolve undefined name error · Updated 2025-02-23 00:18:38 +03:00    alihan

334
7

18b6e71fa9 · Refactor LetterJumble · Updated 2025-02-09 15:38:16 +03:00    alihan

944
21