55 Commits (9425a21db54a7b8ad47c19416b50ec0b3be6cad9)
 

Author SHA1 Message Date
beiller 9425a21db5
Bugfix and back to printing as normal
2 years ago
beiller 703571861f
undo complicated printing until its fixed sadly
2 years ago
beiller ee36313770
Ah -std=c++17 is needed
2 years ago
beiller 07771aab81
ensure cmake is proper version
2 years ago
beiller 3e2327c96a
Try manually adding CXX flag
2 years ago
beiller 3c04dfb436
run build in shell
2 years ago
beiller 7deae8a2ca
fix build procedure
2 years ago
beiller 67b1c842d9
Use sentencepiece tokenization
2 years ago
beiller 96dc6a0c68
work towards tokenizer integration
2 years ago
Georgi Gerganov c80e2a8f2a
Revert "10% performance boost on ARM"
2 years ago
Georgi Gerganov 54a0e66ea0
Check for vdotq_s32 availability
2 years ago
Georgi Gerganov 543c57e991
Ammend to previous commit - forgot to update non-QRDMX branch
2 years ago
Georgi Gerganov 113a9e83eb
10% performance boost on ARM
2 years ago
Matvey Soloviev 404fac0d62
Fix color getting reset before prompt output done (#65)
2 years ago
Georgi Gerganov 1a0a74300f
Update README.md
2 years ago
Matvey Soloviev 96ea727f47
Add interactive mode (#61)
2 years ago
Marc Köhlbrugge 9661954835
Fix typo in README (#45)
2 years ago
Ben Garney f385f8dee8
Allow using prompt files (#59)
2 years ago
beiller 02f0c6fe7f
Add back top_k (#56)
2 years ago
Sebastián A eb062bb012
Windows fixes (#31)
2 years ago
Georgi Gerganov 7027a97837
Update README.md
2 years ago
Georgi Gerganov 2d555e5b42
Add CI (#60)
2 years ago
Georgi Gerganov 7c9e54e55e
Revert "weights_only" arg - this causing more trouble than help
2 years ago
Oleksandr Nikitin b9bd1d0141
python/pytorch compat notes (#44)
2 years ago
beiller 129c7d1ea8
Add repetition penalty (#20)
2 years ago
Georgi Gerganov 702fddf5c5
Clarify meaning of hacking
2 years ago
Georgi Gerganov 7d86e25bf6
README: add "Supported platforms" + update hot topics
2 years ago
deepdiffuser a93120236f
use weights_only in conversion script (#32)
2 years ago
Pavol Rusnak 6a9a67f0be
Add LICENSE (#21)
2 years ago
Georgi Gerganov da1a4ff01f
Update README.md
2 years ago
Juraj Bednar 6b2cb6302f
Fix a typo in model name (#16)
2 years ago
Georgi Gerganov 4235e3d5b3
Update README.md
2 years ago
Georgi Gerganov f1eaff4721 Add AVX2 support for x86 architectures thanks to @Const-me !
2 years ago
Georgi Gerganov a9e58529ea Fix un-initialized FP16 tables on x86 (#15, #2)
2 years ago
Georgi Gerganov 7d9ed7b25f
Bump memory buffer
2 years ago
Georgi Gerganov 0c6803321c
Update README.md
2 years ago
Georgi Gerganov f60fa9e50a
.gitignore models/
2 years ago
Georgi Gerganov 7211862c94
Update Makefile var + add comment
2 years ago
Georgi Gerganov a5c5ae2f54
Update README.md
2 years ago
Georgi Gerganov ea977e85ec
Update README.md
2 years ago
Georgi Gerganov 007a8f6f45
Support all LLaMA models + change Q4_0 quantization storage
2 years ago
Simon Willison 5f2f970d51
Include Python dependencies in README (#6)
2 years ago
Georgi Gerganov 73c6ed5e87
Update README.md
2 years ago
Georgi Gerganov 01eeed8fb1
Update README.md
2 years ago
Georgi Gerganov 6da2df34ee
Update README.md
2 years ago
Jean-Michaël Celerier 9dcf4dba45
Add missing headers for memcpy and assert (#3)
2 years ago
Georgi Gerganov 920a7fe2d9
Update README.md
2 years ago
Georgi Gerganov 3a57ee59de
Update README.md
2 years ago
Georgi Gerganov b85028522d
Update README.md
2 years ago
Georgi Gerganov 8a01f565ff
Update README.md
2 years ago