27 Commits

Author SHA1 Message Date
Dane Madsen
647182125c Update README.md 2023-11-27 10:43:22 +10:00
Dane Madsen
a03e9bacd5 Update README.md 2023-11-23 07:21:59 +10:00
Dane Madsen
c95b5b4553 Update README.md 2023-11-19 22:47:52 +10:00
Dane Madsen
b93fd0eca4 Update README.md 2023-11-14 15:59:02 +10:00
Dane Madsen
1b5fe57ba8 Update README.md 2023-11-14 15:58:40 +10:00
dane madsen
b94ba8b9c9 update readme 2023-11-14 15:57:28 +10:00
Dane Madsen
5e63dfa1eb Update README.md 2023-11-13 23:54:34 +10:00
Dane Madsen
0e63d230d6 Update README.md 2023-11-12 10:42:56 +10:00
Dane Madsen
eee9eedb5e Update README.md 2023-11-06 11:18:01 +10:00
dane madsen
67dc897bdc Move non-build images out of assets 2023-10-30 18:39:13 +10:00
dane madsen
a592b2b883 Update readme and delete redundant code 2023-10-29 12:18:37 +10:00
Dane Madsen
67a38eb2ed Update README.md 2023-10-25 22:54:18 +10:00
dane madsen
1854a9854a Actions Test 2023-10-20 16:40:29 +10:00
dane madsen
a2ea524cb4 Readme 2023-10-20 16:23:53 +10:00
Dane Madsen
091ade9b1b Update README.md 2023-10-20 15:46:17 +10:00
Dane Madsen
b3d40448bb Update README.md 2023-10-15 12:37:03 +00:00
dane madsen
628687697e Image 2023-10-12 23:08:29 +10:00
dane madsen
1915f6a722 Add demo Image 2023-10-12 23:06:52 +10:00
dane madsen
5918298158 Fixed issues with trailing spaces 2023-10-12 13:39:54 +10:00
dane madsen
c310e81c46 update readme 2023-10-12 13:10:29 +10:00
dane madsen
c4358cb06e rename project 2023-10-05 11:22:08 +10:00
Daniel Drake
e4834993f6 Update llama.cpp and move core processing to native code
Update llama.cpp to the latest version as part of an effort to make this
app usable on my Samsung Galaxy S10 smartphone.

The newer llama.cpp includes a double-close fix which was causing the app
to immediately crash upon starting the AI conversation (llama.cpp commit
47f61aaa5f76d04).

It also adds support for 3B models, which are considerably smaller. The
llama-7B models were causing Android's low memory killer to terminate
Sherpa after just a few words of conversation, whereas new models such as
orca-mini-3b.ggmlv3.q4_0.bin work on this device without quickly exhausting
all available memory.

llama.cpp's model compatibility has changed within this update, so ggml
files that were working in the previous version are unlikely to work now;
they need converting. However the orca-mini offering is already in the
new format and works out of the box.

llama.cpp's API has changed in this update. Rather than rework the Dart
code, I opted to leave it in C++, using llama.cpp's example code as a base.
This solution is included in a new "llamasherpa" library which calls
into llama.cpp. Since lots of data is passed around in large arrays,
I expect running this in Dart had quite some overhead, and this native
approach should perform considerably faster.

This eliminates the need for Sherpa's Dart code to call llama.cpp directly,
so there's no need to separately maintain a modified version of llama.cpp
and we can use the official upstream.
2023-07-01 21:22:38 +02:00
tibzejoker
eb7e898a53 Update README.md 2023-04-15 01:28:48 +02:00
ThibautLEAUX
98a172e15f Update README.md 2023-03-27 23:43:28 +02:00
maxime-guerin-biprep
6bdd5ebfe5 Update README.md 2023-03-27 20:32:13 +02:00
Maxime GUERIN
b339c07cd0 initial commit of sherpa 2023-03-27 20:16:28 +02:00
ThibautLEAUX
85936d43bb Initial commit 2023-03-27 12:50:03 +02:00