Georgi Gerganov
75171c2b79
ggml : multi-thread the ggml_add operator
2 years ago
Georgi Gerganov
137321915f
ggml : fix the check for NEON support ( #7 )
...
Was using the wrong preprocessor macro
2 years ago
Syed Jafri
24cd12f647
Cross compilation ( #121 )
...
* Cross compile windows
* set env properly
* rm log
* fix review
* Add back space
2 years ago
Mikhail Grigorev
8dac3c6e10
Fixed sched_yield
2 years ago
Mikhail Grigorev
6417e59aad
Implemenated sched_yield function for Windows
2 years ago
Georgi Gerganov
e5044f87d9
ggml : fix barrier
2 years ago
Georgi Gerganov
a272f10b2e
ggml : fix thread-safety of ggml_init and ggml_free
2 years ago
Georgi Gerganov
fbd513b813
Add OpenBLAS support
...
Supported via CMake - just add:
cmake .. -DWHISPER_SUPPORT_OPENBLAS=ON
On Ubuntu, you have to install the library like this:
apt install libopenblas-dev
Unfortunately, I don't observe any benefit compared to the
original AVX2 + FP16 implementation. Maybe I'm missing something
2 years ago
Georgi Gerganov
34bb3ab0cf
ggml : add system info functions
2 years ago
Georgi Gerganov
c6710efde2
refactoring : move main + stream in examples + other stuff
2 years ago
Georgi Gerganov
db460b78ff
wip : WASM 128-bit SIMD support
2 years ago
Georgi Gerganov
e905c6f827
wip : initial WASM port
...
Works but it is very slow because no SIMD is used.
For example, jfk.wav is processed in ~23 seconds using "tiny.en" model
2 years ago
Georgi Gerganov
19817711b4
Add reference to FP16 repo
2 years ago
Georgi Gerganov
e36aabe00d
Correct implementation of FP16 GELU
...
Can toggle it via the GGML_GELU_FP16 macro
2 years ago
Georgi Gerganov
91632eb6ea
Revert GELU change
...
Seems it does not work on x86 for some reason
2 years ago
Georgi Gerganov
72d967bce4
Use Accelerate framework on Apple silicon
...
Huge performance improvement in the Encode (almost x2 on MacBook M1 Pro)
Also various extra optimizations:
- Multi-threaded NORM operator
- Faster GELU via F16 cast
2 years ago
Georgi Gerganov
0e858f080d
close #56 : build on FreeBSD
...
Thanks to @abelbabel for the contribution
2 years ago
Borislav Stanimirov
0b45d25151
Building with MSVC
2 years ago
lnyan
4bbb8a587b
Add MinGW support
2 years ago
Georgi Gerganov
e29a5dacc6
ref #11 , #18 , #26 : fix CACHE_LINE_SIZE constant
2 years ago
Georgi Gerganov
167324584b
wip : rpi4 support
2 years ago
Georgi Gerganov
f888c2373d
Flash + language support (ref #2 )
...
- Achieved big performance improvement + memory usage reduction
- Can now translate / transcribe different languages
2 years ago
Georgi Gerganov
b0a11594ae
Initial release
2 years ago