Georgi Gerganov
|
05e7d26ba4
|
ggml : add WASM SIMD for Q4_0
|
2 years ago |
Georgi Gerganov
|
b7621b4fda
|
ggml : fixes for rpi4
|
2 years ago |
Georgi Gerganov
|
98f6a4bf94
|
ggml : fix ggml_is_contiguous() to take into account blck size
|
2 years ago |
Georgi Gerganov
|
eaa4006047
|
gpt : fix memory usage computation
|
2 years ago |
Georgi Gerganov
|
fde29bd005
|
ggml : add ggml_compute_forward_rope_f16()
|
2 years ago |
Georgi Gerganov
|
e052167772
|
ggml : GGML_ASSERT() instead of assert() where appropriate
|
2 years ago |
Georgi Gerganov
|
7d5889475a
|
ggml : minor indentations
|
2 years ago |
Georgi Gerganov
|
e89cb32625
|
ggml : simplify mad q4_0 (ARM)
|
2 years ago |
Georgi Gerganov
|
6309a60bac
|
ggml : vectorized quantize_row_q4_0 (ARM)
|
2 years ago |
Georgi Gerganov
|
ea97a5f469
|
ggml : vectorized mad q4_0 (ARM)
|
2 years ago |
Georgi Gerganov
|
cc94fdafe7
|
ggml : 4-bit quantization works (only scalar for now)
|
2 years ago |
Georgi Gerganov
|
a366dd31cc
|
ggml : q4_1 quantization support (seems to work for bigger models)
|
2 years ago |
Georgi Gerganov
|
a37776ddc0
|
ggml : q4_0 quantization support
|
2 years ago |
Georgi Gerganov
|
38faca7efe
|
ggml : Q4_0 quantization support (ggml_get_rows())
|
2 years ago |
katsu560
|
4c2f924553
|
cmake : update CMakeLists.txt to add correct flags (#26)
* modify src/CMakeLists.txt from whisper.cpp
* cmake : remove OpenBLAS stuff
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
|
2 years ago |
Georgi Gerganov
|
a6acb3318a
|
sync : latest whisper.cpp (scratch buffers in ggml)
|
2 years ago |
Takuya Takeuchi
|
0467385010
|
cmake : configure CMAKE_C_FLAGS and target_link_libraries for MSVC (#15)
|
2 years ago |
Georgi Gerganov
|
dee3684fec
|
ggml : sync latest whisper.cpp
|
2 years ago |
Georgi Gerganov
|
6ed4da0b03
|
cmake : disable warnings about unused functions
|
2 years ago |
Georgi Gerganov
|
06e2a3b721
|
ggml : bugfix in new soft max computation
|
2 years ago |
Georgi Gerganov
|
1af4cf0102
|
ggml : sync with latest whisper.cpp
|
2 years ago |
Georgi Gerganov
|
45fc4fed0b
|
sync : latest changes from whisper.cpp
|
2 years ago |
Georgi Gerganov
|
bd9f710a45
|
sync : latest changes from whisper.cpp
|
2 years ago |
Georgi Gerganov
|
f56828ed78
|
ggml : sync with latest code from whisper.cpp
|
2 years ago |
Georgi Gerganov
|
90ee5c6358
|
sync : latest changes from whisper.cpp
- Documentation
- whisper : token-level timestamps
- ggml : Windows build fixes
- etc.
|
2 years ago |
Georgi Gerganov
|
6feeca262f
|
sync : latest changes from whisper.cpp
|
2 years ago |
Georgi Gerganov
|
624e4f5313
|
whisper : fix timestamp sampling
|
2 years ago |
Georgi Gerganov
|
7094be1f37
|
sync : whisper.cpp
- Add MSVC header
- FP16 GELU
- C interface fixes (no unions)
- Minor CMake updates
|
2 years ago |
Georgi Gerganov
|
270829aa9f
|
sync : whisper.cpp
|
2 years ago |
Georgi Gerganov
|
d8f64bce3d
|
Improve mul_mat performance for big matrices using Accelerate framework
Also:
- Speedup GELU operator via F16 cast
- Multi-thread NORM operator
- Disable FLASH_FF in whisper example
|
2 years ago |
Georgi Gerganov
|
67ac34fcfa
|
sync : whisper.cpp
|
2 years ago |
Georgi Gerganov
|
e2f39f4b52
|
whisper : sync with whisper.cpp
|
2 years ago |
Georgi Gerganov
|
8e3c634b27
|
whisper : various improvements
|
2 years ago |
Georgi Gerganov
|
787efb4d2e
|
Adding Whisper inference example
|
2 years ago |
Georgi Gerganov
|
f21b84cd21
|
Update README.md + minor stuff
- Changed default threads to 4
- Added GGML_PERF for enabling runtime performance timings
|
2 years ago |
Georgi Gerganov
|
fb558f78d9
|
Initial release
|
2 years ago |