7 Commits (b8f20713b9637a5466ef613413a607a98f7b3416)

Author SHA1 Message Date
Jed Fox 34af8a97e8
FIx parsing single-byte UTF-8 tokens by manually parsing the protobuf
2 years ago
Georgi Gerganov 7c9e54e55e
Revert "weights_only" arg - this causing more trouble than help
2 years ago
Oleksandr Nikitin b9bd1d0141
python/pytorch compat notes (#44)
2 years ago
deepdiffuser a93120236f
use weights_only in conversion script (#32)
2 years ago
Georgi Gerganov 007a8f6f45
Support all LLaMA models + change Q4_0 quantization storage
2 years ago
Georgi Gerganov 70bc0b8b15
Fix a bug in the rope calculation
2 years ago
Georgi Gerganov 26c0846629
Initial release
2 years ago