Juraj Bednar
|
faad7f1464
|
Add oneliner for batch quantization
|
2 years ago |
Juraj Bednar
|
6b2cb6302f
|
Fix a typo in model name (#16)
|
2 years ago |
Georgi Gerganov
|
4235e3d5b3
|
Update README.md
|
2 years ago |
Georgi Gerganov
|
f1eaff4721
|
Add AVX2 support for x86 architectures thanks to @Const-me !
|
2 years ago |
Georgi Gerganov
|
a9e58529ea
|
Fix un-initialized FP16 tables on x86 (#15, #2)
|
2 years ago |
Georgi Gerganov
|
7d9ed7b25f
|
Bump memory buffer
|
2 years ago |
Georgi Gerganov
|
0c6803321c
|
Update README.md
|
2 years ago |
Georgi Gerganov
|
f60fa9e50a
|
.gitignore models/
|
2 years ago |
Georgi Gerganov
|
7211862c94
|
Update Makefile var + add comment
|
2 years ago |
Georgi Gerganov
|
a5c5ae2f54
|
Update README.md
|
2 years ago |
Georgi Gerganov
|
ea977e85ec
|
Update README.md
|
2 years ago |
Georgi Gerganov
|
007a8f6f45
|
Support all LLaMA models + change Q4_0 quantization storage
|
2 years ago |
Simon Willison
|
5f2f970d51
|
Include Python dependencies in README (#6)
|
2 years ago |
Georgi Gerganov
|
73c6ed5e87
|
Update README.md
|
2 years ago |
Georgi Gerganov
|
01eeed8fb1
|
Update README.md
|
2 years ago |
Georgi Gerganov
|
6da2df34ee
|
Update README.md
|
2 years ago |
Jean-Michaël Celerier
|
9dcf4dba45
|
Add missing headers for memcpy and assert (#3)
|
2 years ago |
Georgi Gerganov
|
920a7fe2d9
|
Update README.md
|
2 years ago |
Georgi Gerganov
|
3a57ee59de
|
Update README.md
|
2 years ago |
Georgi Gerganov
|
b85028522d
|
Update README.md
|
2 years ago |
Georgi Gerganov
|
8a01f565ff
|
Update README.md
|
2 years ago |
Georgi Gerganov
|
70bc0b8b15
|
Fix a bug in the rope calculation
|
2 years ago |
Georgi Gerganov
|
18ebda34d6
|
Update README.md
|
2 years ago |
Georgi Gerganov
|
319cdb3e1f
|
Final touches
|
2 years ago |
Georgi Gerganov
|
775328064e
|
Create README.md
|
2 years ago |
Georgi Gerganov
|
26c0846629
|
Initial release
|
2 years ago |