4 Commits (1ed5c7c75b486d26e326f3945ac5247393d08f09)

Author SHA1 Message Date
deepdiffuser 1ed5c7c75b use weights_only in conversion script
2 years ago
Georgi Gerganov 007a8f6f45
Support all LLaMA models + change Q4_0 quantization storage
2 years ago
Georgi Gerganov 70bc0b8b15
Fix a bug in the rope calculation
2 years ago
Georgi Gerganov 26c0846629
Initial release
2 years ago