Georgi Gerganov
|
d8f64bce3d
|
Improve mul_mat performance for big matrices using Accelerate framework
Also:
- Speedup GELU operator via F16 cast
- Multi-thread NORM operator
- Disable FLASH_FF in whisper example
|
2 years ago |
Georgi Gerganov
|
ea0ef2a41e
|
Performance tests - trying to optimize mul_mat
|
2 years ago |
Georgi Gerganov
|
67ac34fcfa
|
sync : whisper.cpp
|
2 years ago |
Georgi Gerganov
|
e2f39f4b52
|
whisper : sync with whisper.cpp
|
2 years ago |
Georgi Gerganov
|
8e3c634b27
|
whisper : various improvements
|
2 years ago |
Georgi Gerganov
|
8ca553add4
|
whisper : add C-style API
|
2 years ago |
Georgi Gerganov
|
dd1f4dfbab
|
whisper : various fixes
|
2 years ago |
Georgi Gerganov
|
0116c03fb7
|
whisper : various updates and improvements
|
2 years ago |
Georgi Gerganov
|
787efb4d2e
|
Adding Whisper inference example
|
2 years ago |
Georgi Gerganov
|
f21b84cd21
|
Update README.md + minor stuff
- Changed default threads to 4
- Added GGML_PERF for enabling runtime performance timings
|
2 years ago |
Georgi Gerganov
|
0f4e99b1cc
|
Update README.md
|
2 years ago |
Georgi Gerganov
|
fb558f78d9
|
Initial release
|
2 years ago |