Georgi Gerganov
f3ee4a9673
whisper : reduce memory usage during inference ( #431 )
...
* ggml : add "scratch" buffer support
* ggml : support for scratch ring-buffer
* ggml : bug fix in ggml_repeat()
* ggml : error on scratch buffer overflow
* whisper : use scratch buffers during inference (base model only)
* whisper : update memory usage for all models
* whisper : fix encoder memory usage
* whisper : use whisper_context functions instead of macros
* whisper : fix FF + remove it from README
* ggml : reuse ggml_new_i32
* ggml : refactor the scratch buffer storage
* whisper : reorder scratch buffers in the decoder
* main : add option to disable temp fallback
* Update README.md
2 years ago
Georgi Gerganov
60337f5306
wasm : check if navigator.storage.estimate() is available
...
Safari does not support it
2 years ago
Georgi Gerganov
206fc93396
whisper.wasm : add small and small.en models
2 years ago
Georgi Gerganov
fafd78945d
bench.wasm : print system info
2 years ago
Georgi Gerganov
44efbf7ff1
cmake : add -Wno-unused-function + update whisper.js
2 years ago
Georgi Gerganov
87dd4a3081
talk.wasm : bump memory usage + update whisper.js
2 years ago
Georgi Gerganov
4a214d2f07
cmake : add CMAKE_RUNTIME_OUTPUT_DIRECTORY
...
Currently needed by the wasm examples
2 years ago
Georgi Gerganov
f309f97df6
Node.js package ( #260 )
...
* npm : preparing infra for node package
* npm : package infra ready
* npm : initial version ready
* npm : change name to whisper.cpp
whisper.js is taken
2 years ago
Georgi Gerganov
fcf515de60
bench.wasm : same as "bench" but runs in the browser ( #89 )
2 years ago
Georgi Gerganov
be16dfa038
whisper.wasm : do not block page while processing ( close #86 )
2 years ago
Georgi Gerganov
abce28ea99
talk.wasm : move to https://whisper.ggerganov.com/talk
...
This way, we can share the same models across different WASM examples
and not have to download them for each page
2 years ago
Georgi Gerganov
be3b720f96
talk.wasm : refactoring + update README.md
2 years ago
Georgi Gerganov
9aea96f774
talk.wasm : polishing + adding many AI personalities
2 years ago
Georgi Gerganov
a4dfbeecf9
talk.wasm : GPT-2 meets Whisper in WebAssembly ( #155 )
...
* talk : initial real-time transcription in the browser
* talk : polishing the UI
* talk : ready for beta testing
* talk.wasm : rename example
2 years ago
Georgi Gerganov
b21213c23e
js : update whipser.js to latest
2 years ago
Georgi Gerganov
69bdb6624a
minor : update whisper.js
2 years ago
Georgi Gerganov
12fb303d9d
whisper.wasm : update system info print
2 years ago
Georgi Gerganov
491ecd7056
wip : polishing WASM example
2 years ago
Georgi Gerganov
db460b78ff
wip : WASM 128-bit SIMD support
2 years ago
Georgi Gerganov
e905c6f827
wip : initial WASM port
...
Works but it is very slow because no SIMD is used.
For example, jfk.wav is processed in ~23 seconds using "tiny.en" model
2 years ago