14 Commits (15f06f6b4f2074da448851adfc1d887ea7cb76f0)

Author SHA1 Message Date
wizard 15f06f6b4f buffering utf-8 output to make it complete for spliting output.
2 years ago
wizard 86e967c54b buffering output for UTF-8 encoded token
2 years ago
wizard 1b87fe1e90 call a standalone function to untokenize output
2 years ago
wizard 307dba3dd2 first try to intergrate sentencepiece
2 years ago
Matvey Soloviev 96ea727f47
Add interactive mode (#61)
2 years ago
Ben Garney f385f8dee8
Allow using prompt files (#59)
2 years ago
beiller 02f0c6fe7f
Add back top_k (#56)
2 years ago
Sebastián A eb062bb012
Windows fixes (#31)
2 years ago
beiller 129c7d1ea8
Add repetition penalty (#20)
2 years ago
Georgi Gerganov 007a8f6f45
Support all LLaMA models + change Q4_0 quantization storage
2 years ago
Jean-Michaël Celerier 9dcf4dba45
Add missing headers for memcpy and assert (#3)
2 years ago
Georgi Gerganov 70bc0b8b15
Fix a bug in the rope calculation
2 years ago
Georgi Gerganov 319cdb3e1f
Final touches
2 years ago
Georgi Gerganov 26c0846629
Initial release
2 years ago