6 Commits (2d555e5b42922cda6dfc0c3ff54df7b1ee4d0ff4)

Author SHA1 Message Date
beiller 129c7d1ea8
Add repetition penalty (#20)
2 years ago
Georgi Gerganov 007a8f6f45
Support all LLaMA models + change Q4_0 quantization storage
2 years ago
Jean-Michaël Celerier 9dcf4dba45
Add missing headers for memcpy and assert (#3)
2 years ago
Georgi Gerganov 70bc0b8b15
Fix a bug in the rope calculation
2 years ago
Georgi Gerganov 319cdb3e1f
Final touches
2 years ago
Georgi Gerganov 26c0846629
Initial release
2 years ago