Commit Graph

52 Commits (054d97e0e1642ef5dfe2d233dec8993cc96e8130)
 

Author SHA1 Message Date
Georgi Gerganov 054d97e0e1
gq : method 4 (AVX2)
2 years ago
Georgi Gerganov 37dcfad83b
gq : progress on method 2
2 years ago
Georgi Gerganov bf709e45de
gq : add amax based method 3
2 years ago
Georgi Gerganov 0a7debb7bf
gq : attempt at n-bit quantization
2 years ago
katsu560 4c2f924553
cmake : update CMakeLists.txt to add correct flags (#26)
2 years ago
Georgi Gerganov ba3e8a3d7f
readme : update Roadmap
2 years ago
Georgi Gerganov 2546cb7780
readme : add Roadmap section
2 years ago
Georgi Gerganov 8f8a5aca99
sync : latest whisper.cpp
2 years ago
Georgi Gerganov efa2cc36a2
tests : fix cblas_sgemm call
2 years ago
Georgi Gerganov 3b3ad42906
tests : add SVD experiments
2 years ago
Georgi Gerganov a6acb3318a
sync : latest whisper.cpp (scratch buffers in ggml)
2 years ago
Georgi Gerganov 47b297224e
Update README.md
2 years ago
Takuya Takeuchi 0467385010
cmake : configure CMAKE_C_FLAGS and target_link_libraries for MSVC (#15)
2 years ago
Georgi Gerganov fb64edddb7
gpt : fix sampling to use the temperature (close #16)
2 years ago
Georgi Gerganov c40a5b51a0
ggml : sync latest whisper.cpp
2 years ago
Georgi Gerganov a0f2f68cdb
gpt-2 : fix broken prompt due to recent experiments
2 years ago
Georgi Gerganov dee3684fec
ggml : sync latest whisper.cpp
2 years ago
Georgi Gerganov 6ed4da0b03
cmake : disable warnings about unused functions
2 years ago
Georgi Gerganov 06e2a3b721
ggml : bugfix in new soft max computation
2 years ago
Georgi Gerganov 78af1420bf
tests : change test2 eps
2 years ago
Georgi Gerganov 1af4cf0102
ggml : sync with latest whisper.cpp
2 years ago
Georgi Gerganov 73a7916d30
tests : some more quantization experiments
2 years ago
Georgi Gerganov e0abac1be7
sync : forgot to sync ggml.h
2 years ago
Georgi Gerganov 45fc4fed0b
sync : latest changes from whisper.cpp
2 years ago
Georgi Gerganov deb0c486c7
tests : wip quantized matrix multiplication method 2
2 years ago
Georgi Gerganov d677c7f61d tests : minor fixes for x86
2 years ago
Georgi Gerganov 446ccf3ab1
tests : experiments with n-bit quantized matrix multiplication
2 years ago
Georgi Gerganov bd9f710a45
sync : latest changes from whisper.cpp
2 years ago
Georgi Gerganov 1dcbe86a0c
gpt-2 : experimenting with attention mask
2 years ago
Georgi Gerganov 99f1afb613
gpt-2 : fix off-by-one error in batching logic
2 years ago
Georgi Gerganov 64efeceabd
examples : redirect download scripts to HF
2 years ago
Georgi Gerganov ed09c7190e
gpt : add support for gpt-jt + fix unicode support
2 years ago
Georgi Gerganov f56828ed78
ggml : sync with latest code from whisper.cpp
2 years ago
Georgi Gerganov 90ee5c6358
sync : latest changes from whisper.cpp
2 years ago
Georgi Gerganov db13973820
Update README.md
2 years ago
Georgi Gerganov 6feeca262f
sync : latest changes from whisper.cpp
2 years ago
Georgi Gerganov 624e4f5313
whisper : fix timestamp sampling
2 years ago
Georgi Gerganov 7094be1f37
sync : whisper.cpp
2 years ago
Georgi Gerganov 270829aa9f
sync : whisper.cpp
2 years ago
Georgi Gerganov 7b70c5a561
Minor fixes
2 years ago
Georgi Gerganov d8f64bce3d
Improve mul_mat performance for big matrices using Accelerate framework
2 years ago
Georgi Gerganov ea0ef2a41e
Performance tests - trying to optimize mul_mat
2 years ago
Georgi Gerganov 67ac34fcfa
sync : whisper.cpp
2 years ago
Georgi Gerganov e2f39f4b52
whisper : sync with whisper.cpp
2 years ago
Georgi Gerganov 8e3c634b27
whisper : various improvements
2 years ago
Georgi Gerganov 8ca553add4
whisper : add C-style API
2 years ago
Georgi Gerganov dd1f4dfbab
whisper : various fixes
2 years ago
Georgi Gerganov 0116c03fb7
whisper : various updates and improvements
2 years ago
Georgi Gerganov 787efb4d2e
Adding Whisper inference example
2 years ago
Georgi Gerganov f21b84cd21
Update README.md + minor stuff
2 years ago