Commit Graph

36 Commits (3adf02e311eaf3ec03d327394d53c1668faf5572)

Author SHA1 Message Date
Georgi Gerganov 05e7d26ba4
ggml : add WASM SIMD for Q4_0
2 years ago
Georgi Gerganov b7621b4fda
ggml : fixes for rpi4
2 years ago
Georgi Gerganov 98f6a4bf94
ggml : fix ggml_is_contiguous() to take into account blck size
2 years ago
Georgi Gerganov eaa4006047
gpt : fix memory usage computation
2 years ago
Georgi Gerganov fde29bd005
ggml : add ggml_compute_forward_rope_f16()
2 years ago
Georgi Gerganov e052167772
ggml : GGML_ASSERT() instead of assert() where appropriate
2 years ago
Georgi Gerganov 7d5889475a
ggml : minor indentations
2 years ago
Georgi Gerganov e89cb32625
ggml : simplify mad q4_0 (ARM)
2 years ago
Georgi Gerganov 6309a60bac
ggml : vectorized quantize_row_q4_0 (ARM)
2 years ago
Georgi Gerganov ea97a5f469
ggml : vectorized mad q4_0 (ARM)
2 years ago
Georgi Gerganov cc94fdafe7
ggml : 4-bit quantization works (only scalar for now)
2 years ago
Georgi Gerganov a366dd31cc
ggml : q4_1 quantization support (seems to work for bigger models)
2 years ago
Georgi Gerganov a37776ddc0
ggml : q4_0 quantization support
2 years ago
Georgi Gerganov 38faca7efe
ggml : Q4_0 quantization support (ggml_get_rows())
2 years ago
katsu560 4c2f924553
cmake : update CMakeLists.txt to add correct flags (#26)
2 years ago
Georgi Gerganov a6acb3318a
sync : latest whisper.cpp (scratch buffers in ggml)
2 years ago
Takuya Takeuchi 0467385010
cmake : configure CMAKE_C_FLAGS and target_link_libraries for MSVC (#15)
2 years ago
Georgi Gerganov dee3684fec
ggml : sync latest whisper.cpp
2 years ago
Georgi Gerganov 6ed4da0b03
cmake : disable warnings about unused functions
2 years ago
Georgi Gerganov 06e2a3b721
ggml : bugfix in new soft max computation
2 years ago
Georgi Gerganov 1af4cf0102
ggml : sync with latest whisper.cpp
2 years ago
Georgi Gerganov 45fc4fed0b
sync : latest changes from whisper.cpp
2 years ago
Georgi Gerganov bd9f710a45
sync : latest changes from whisper.cpp
2 years ago
Georgi Gerganov f56828ed78
ggml : sync with latest code from whisper.cpp
2 years ago
Georgi Gerganov 90ee5c6358
sync : latest changes from whisper.cpp
2 years ago
Georgi Gerganov 6feeca262f
sync : latest changes from whisper.cpp
2 years ago
Georgi Gerganov 624e4f5313
whisper : fix timestamp sampling
2 years ago
Georgi Gerganov 7094be1f37
sync : whisper.cpp
2 years ago
Georgi Gerganov 270829aa9f
sync : whisper.cpp
2 years ago
Georgi Gerganov d8f64bce3d
Improve mul_mat performance for big matrices using Accelerate framework
2 years ago
Georgi Gerganov 67ac34fcfa
sync : whisper.cpp
2 years ago
Georgi Gerganov e2f39f4b52
whisper : sync with whisper.cpp
2 years ago
Georgi Gerganov 8e3c634b27
whisper : various improvements
2 years ago
Georgi Gerganov 787efb4d2e
Adding Whisper inference example
2 years ago
Georgi Gerganov f21b84cd21
Update README.md + minor stuff
2 years ago
Georgi Gerganov fb558f78d9
Initial release
2 years ago