Georgi Gerganov
3b1aacbe6d
talk : talk with AI in the terminal
2 years ago
Georgi Gerganov
50a061b313
ggml : add alternative cblas_sgemm call
2 years ago
Al Hoang
04a16bbf11
fix compilation on haiku
2 years ago
Georgi Gerganov
b6597539f9
ggml : fix typo in previous commit
2 years ago
Georgi Gerganov
9a4b7a916e
ggml : use macros to inline FP16 <-> FP32 conversions
2 years ago
Georgi Gerganov
f8ec718b76
ggml : add F16C CPU flag check
2 years ago
katsu560
35b40a93b9
add fp16/fp32 convert intrinsics
2 years ago
Georgi Gerganov
061fc81bd6
ggml : remove inline specifier from fp16 <-> fp32 converters
2 years ago
Georgi Gerganov
388e9f79ad
ggml : fix the fix
2 years ago
Georgi Gerganov
35cd29ce1f
ggml : fix cross-compile Linux -> Window with mingw ( #168 )
2 years ago
katsu560
804f36aa2c
ggml: change inline ggml_fp16_to_fp32, ggml_fp16_t ggml_fp32_to_fp16
2 years ago
katsu560
83456076f0
add AVX support
2 years ago
Georgi Gerganov
2065572a11
ggml : fix Windows build
2 years ago
boolemancer
0bfe728b84
Fix the Windows pthread_create shim
...
The current implementation doesn't actually set the out parameter,
and it returns 0 on failure instead of on success.
2 years ago
Georgi Gerganov
75171c2b79
ggml : multi-thread the ggml_add operator
2 years ago
Georgi Gerganov
137321915f
ggml : fix the check for NEON support ( #7 )
...
Was using the wrong preprocessor macro
2 years ago
Syed Jafri
24cd12f647
Cross compilation ( #121 )
...
* Cross compile windows
* set env properly
* rm log
* fix review
* Add back space
2 years ago
Mikhail Grigorev
8dac3c6e10
Fixed sched_yield
2 years ago
Mikhail Grigorev
6417e59aad
Implemenated sched_yield function for Windows
2 years ago
Georgi Gerganov
e5044f87d9
ggml : fix barrier
2 years ago
Georgi Gerganov
a272f10b2e
ggml : fix thread-safety of ggml_init and ggml_free
2 years ago
Georgi Gerganov
fbd513b813
Add OpenBLAS support
...
Supported via CMake - just add:
cmake .. -DWHISPER_SUPPORT_OPENBLAS=ON
On Ubuntu, you have to install the library like this:
apt install libopenblas-dev
Unfortunately, I don't observe any benefit compared to the
original AVX2 + FP16 implementation. Maybe I'm missing something
2 years ago
Georgi Gerganov
34bb3ab0cf
ggml : add system info functions
2 years ago
Georgi Gerganov
c6710efde2
refactoring : move main + stream in examples + other stuff
2 years ago
Georgi Gerganov
db460b78ff
wip : WASM 128-bit SIMD support
2 years ago
Georgi Gerganov
e905c6f827
wip : initial WASM port
...
Works but it is very slow because no SIMD is used.
For example, jfk.wav is processed in ~23 seconds using "tiny.en" model
2 years ago
Georgi Gerganov
19817711b4
Add reference to FP16 repo
2 years ago
Georgi Gerganov
e36aabe00d
Correct implementation of FP16 GELU
...
Can toggle it via the GGML_GELU_FP16 macro
2 years ago
Georgi Gerganov
91632eb6ea
Revert GELU change
...
Seems it does not work on x86 for some reason
2 years ago
Georgi Gerganov
72d967bce4
Use Accelerate framework on Apple silicon
...
Huge performance improvement in the Encode (almost x2 on MacBook M1 Pro)
Also various extra optimizations:
- Multi-threaded NORM operator
- Faster GELU via F16 cast
2 years ago
Georgi Gerganov
0e858f080d
close #56 : build on FreeBSD
...
Thanks to @abelbabel for the contribution
2 years ago
Borislav Stanimirov
0b45d25151
Building with MSVC
2 years ago
lnyan
4bbb8a587b
Add MinGW support
2 years ago
Georgi Gerganov
e29a5dacc6
ref #11 , #18 , #26 : fix CACHE_LINE_SIZE constant
2 years ago
Georgi Gerganov
167324584b
wip : rpi4 support
2 years ago
Georgi Gerganov
f888c2373d
Flash + language support (ref #2 )
...
- Achieved big performance improvement + memory usage reduction
- Can now translate / transcribe different languages
2 years ago
Georgi Gerganov
b0a11594ae
Initial release
2 years ago