Commit Graph

  • df9b4fb90a Updated README with new information 刘一博 2024-03-20 14:11:28 +08:00
  • bea31b9b12 Update wechat.jpg hiyouga 2024-03-18 16:48:32 +08:00
  • 8e04794b2d fix packages hiyouga 2024-03-17 22:32:03 +08:00
  • 85c376fc1e fix patcher hiyouga 2024-03-15 19:18:42 +08:00
  • 113cc04719 Merge pull request #2849 from S3Studio/DockerizeSupport hoshi-hiyouga 2024-03-15 19:16:02 +08:00
  • 6bc2c23b6d fix export hiyouga 2024-03-15 15:06:30 +08:00
  • e75407febd Use official Nvidia base image S3Studio 2024-03-14 18:03:33 +08:00
  • 6a5693d11d improve Docker build and runtime parameters S3Studio 2024-03-12 14:05:10 +08:00
  • 6ebde4f23e tiny fix hiyouga 2024-03-14 21:19:06 +08:00
  • 3b4a59bfb1 fix export hiyouga 2024-03-14 18:17:01 +08:00
  • 8172530d54 fix bug hiyouga 2024-03-13 23:55:31 +08:00
  • 714d936dfb fix bug hiyouga 2024-03-13 23:43:42 +08:00
  • 72367307df improve lora+ impl. hiyouga 2024-03-13 23:32:51 +08:00
  • 4e5e99af43 Merge pull request #2830 from qibaoyuan/lora_plus hoshi-hiyouga 2024-03-13 20:15:46 +08:00
  • a0965cd62c [FEATURE]: ADD LORA+ ALGORITHM 齐保元 2024-03-13 19:43:27 +08:00
  • dfd451b722 Update wechat.jpg hiyouga 2024-03-13 19:03:00 +08:00
  • 0b4a5bf509 fix #2817 hiyouga 2024-03-13 12:42:03 +08:00
  • b9f87cdc11 fix #2802 hiyouga 2024-03-13 12:33:45 +08:00
  • 96ce76cd27 fix kv cache hiyouga 2024-03-13 01:21:50 +08:00
  • 19ef482649 support QDoRA hiyouga 2024-03-12 22:12:42 +08:00
  • 70a3052dd8 patch for gemma cpt hiyouga 2024-03-12 21:21:54 +08:00
  • 60cc17f3a8 fix plot issues hiyouga 2024-03-12 18:41:35 +08:00
  • b3247d6a16 support olmo hiyouga 2024-03-12 18:30:38 +08:00
  • 8d8956bad5 fix #2802 hiyouga 2024-03-12 17:08:34 +08:00
  • 06c97083e1 fix #2803 hiyouga 2024-03-12 16:57:39 +08:00
  • 07f9b754a7 fix #2782 #2798 hiyouga 2024-03-12 15:53:29 +08:00
  • c901aa63ff Merge pull request #2743 from S3Studio/DockerizeSupport hoshi-hiyouga 2024-03-12 00:05:49 +08:00
  • e874c00906 fix #2775 hiyouga 2024-03-11 00:42:54 +08:00
  • 352693e2dc tiny fix hiyouga 2024-03-11 00:17:18 +08:00
  • be99799413 update parser hiyouga 2024-03-10 13:35:20 +08:00
  • 8664262cde support layerwise galore hiyouga 2024-03-10 00:24:11 +08:00
  • 18ffce36b5 fix #2732 hiyouga 2024-03-09 22:37:16 +08:00
  • bdb496644c allow non-packing pretraining hiyouga 2024-03-09 22:21:46 +08:00
  • 412c52e325 fix #2766 hiyouga 2024-03-09 21:35:24 +08:00
  • af0e370fb1 use default arg for freeze tuning hiyouga 2024-03-09 06:08:48 +08:00
  • 818726e9bc add GaLore results hiyouga 2024-03-09 04:11:55 +08:00
  • 393c2de27c update hardware requirements hiyouga 2024-03-09 03:58:18 +08:00
  • 4c00bcdcae update examples hiyouga 2024-03-09 02:30:37 +08:00
  • e8dd38b7fd fix #2756 , patch #2746 hiyouga 2024-03-09 02:01:26 +08:00
  • 516d0ddc66 Merge pull request #2746 from stephen-nju/main hoshi-hiyouga 2024-03-09 01:37:00 +08:00
  • 74ff8664d7 Update setup.py hiyouga 2024-03-09 00:14:48 +08:00
  • 10be2f0ecc fix aqlm version hiyouga 2024-03-09 00:09:09 +08:00
  • 8a45213440 fix example params hiyouga 2024-03-08 20:41:43 +08:00
  • aa71571b77 update stephen_zhu 2024-03-08 12:47:44 +08:00
  • cdb7f82869 fix ppo runtime error stephen 2024-03-08 11:48:26 +08:00
  • 3d911ae713 Add dockerize support S3Studio 2024-03-08 10:47:28 +08:00
  • 4a2cc60b94 update readme hiyouga 2024-03-08 03:06:21 +08:00
  • 5d956e2a51 fix chat engine, update webui hiyouga 2024-03-08 03:01:53 +08:00
  • 5cd4947650 Update setup.py hiyouga 2024-03-08 01:23:00 +08:00
  • 0ac6b40a47 update galore args hiyouga 2024-03-08 01:17:32 +08:00
  • 33a4c24a8a fix galore hiyouga 2024-03-08 00:44:51 +08:00
  • 57452a4aa1 add Yi-9B model hiyouga 2024-03-07 23:11:57 +08:00
  • 7230e1177d add galore examples hiyouga 2024-03-07 22:53:45 +08:00
  • 28f7862188 support galore hiyouga 2024-03-07 22:41:36 +08:00
  • 725f7cd70f update readme hiyouga 2024-03-07 20:34:49 +08:00
  • 77211d9843 tiny fix hiyouga 2024-03-07 20:29:34 +08:00
  • a0dc721816 Merge pull request #2739 from hiyouga/dev-vllm hoshi-hiyouga 2024-03-07 20:28:18 +08:00
  • d07ad5cc1c support vllm hiyouga 2024-03-07 20:26:31 +08:00
  • f74f804a71 fix #2735 hiyouga 2024-03-07 16:15:53 +08:00
  • 2185855bdb Merge pull request #2730 from cx2333-gt/main hoshi-hiyouga 2024-03-07 14:37:18 +08:00
  • 94b7a1b915 revert choice name cx2333 2024-03-07 14:28:55 +08:00
  • 921ee82267 fix chatglm3 template hiyouga 2024-03-07 14:26:16 +08:00
  • 08d7dc06f2 Update wechat.jpg hiyouga 2024-03-07 13:14:10 +08:00
  • a8889498fa fix flash_attn in train_web cx2333 2024-03-07 10:13:55 +08:00
  • 0048a2021e tiny fix hiyouga 2024-03-06 17:25:08 +08:00
  • 3e84f430b1 export use balanced gpu hiyouga 2024-03-06 16:33:14 +08:00
  • 9658c63cd9 fix add tokens hiyouga 2024-03-06 15:04:02 +08:00
  • 3016e65657 fix version checking hiyouga 2024-03-06 14:51:51 +08:00
  • d1587c80de update examples hiyouga 2024-03-06 13:14:57 +08:00
  • e0c47358f9 fix arg dtype hiyouga 2024-03-05 20:53:30 +08:00
  • 259af60d28 improve aqlm optim hiyouga 2024-03-05 20:49:50 +08:00
  • d3d3dac707 optimize aqlm training hiyouga 2024-03-05 18:35:41 +08:00
  • ddf352f861 fix dora inference hiyouga 2024-03-05 11:51:41 +08:00
  • e5edcf440f fix export model hiyouga 2024-03-05 11:05:41 +08:00
  • df9e6bb063 update readme hiyouga 2024-03-05 03:20:23 +08:00
  • 76f31b18eb add examples hiyouga 2024-03-05 03:16:35 +08:00
  • 9e56eaf2d3 auto set chat template hiyouga 2024-03-05 02:41:20 +08:00
  • 24a79bd50f update readme hiyouga 2024-03-04 19:29:26 +08:00
  • cda2ff8727 fix export on cpu device hiyouga 2024-03-04 17:35:09 +08:00
  • 9c10854b46 fix sub-process error in thread hiyouga 2024-03-03 15:04:35 +08:00
  • 7c227e07dd update readme hiyouga 2024-03-03 01:41:07 +08:00
  • 894d183214 update readme, add starcoder2, cosmopedia hiyouga 2024-03-03 01:01:46 +08:00
  • 1006f372ae Update README_zh.md hoshi-hiyouga 2024-03-03 00:49:08 +08:00
  • 4bf7eb72e0 Update README.md hoshi-hiyouga 2024-03-03 00:48:47 +08:00
  • 585c884ea9 Update README.md hoshi-hiyouga 2024-03-03 00:48:06 +08:00
  • 318315c76d add colab demo hiyouga 2024-03-02 19:58:21 +08:00
  • 32884523c5 update data hiyouga 2024-03-02 19:37:18 +08:00
  • a736b349f0 move git files hiyouga 2024-03-02 18:30:11 +08:00
  • 46a06e2362 Update wechat.jpg hiyouga 2024-03-02 17:48:16 +08:00
  • 4e5fae2fac fix #2649 hiyouga 2024-03-01 13:02:41 +08:00
  • 396fd47947 tiny fix hiyouga 2024-02-29 21:03:48 +08:00
  • 1bfa70ce8e fix webui hiyouga 2024-02-29 20:09:09 +08:00
  • c0be617195 fix #2642 hiyouga 2024-02-29 18:32:54 +08:00
  • bb16502c33 add twitter hiyouga 2024-02-29 17:45:30 +08:00
  • 4a871e80e2 tiny fix hiyouga 2024-02-29 17:28:50 +08:00
  • ece3b3737e tiny fix and release v0.5.3 hiyouga 2024-02-29 00:46:47 +08:00
  • 7c87532476 Merge pull request #2575 from lungothrin/feature/chatter-with-role hoshi-hiyouga 2024-02-29 00:39:47 +08:00
  • 4cc2781efe fix #2629 hiyouga 2024-02-29 00:37:29 +08:00
  • fa5ab21ebc release v0.5.3 hiyouga 2024-02-29 00:34:19 +08:00
  • 804c1e7083 add examples hiyouga 2024-02-28 23:19:25 +08:00