3 Commits (f60fa9e50afce35e7ebe1fedf34d4a9327353927)

Author SHA1 Message Date
Georgi Gerganov 007a8f6f45
Support all LLaMA models + change Q4_0 quantization storage
1 year ago
Georgi Gerganov 70bc0b8b15
Fix a bug in the rope calculation
1 year ago
Georgi Gerganov 26c0846629
Initial release
1 year ago