Commit Graph

115 Commits (32d1a7b65286cf0db582181040bed7adcdc7eba6)

Author SHA1 Message Date
Roland Rabien e70d47baab
Remove C++20 requirement (#257)
3 years ago
bert hubert d1da35de06 fix potential bug reading model data into a small size optimized string which could lead to memory corruption. In an SSO string, you can't write data to &str[0] and expect it to work well.
3 years ago
Georgi Gerganov 603f97ba11
whisper : minor improvemnt in decoding strategy (#244)
3 years ago
Georgi Gerganov f8ec718b76
ggml : add F16C CPU flag check
3 years ago
Georgi Gerganov 78d13257be
Try to improve the token sampling strategy (#193)
3 years ago
Georgi Gerganov 4698dcdb52 whisper : add mechanism for aborting the whisper_full() computation
3 years ago
Georgi Gerganov e266cb0723
whisper.objc : add real-time processing (#97)
3 years ago
Georgi Gerganov c207eed431
whisper.objc : fix build warnings
3 years ago
Georgi Gerganov be16dfa038
whisper.wasm : do not block page while processing (close #86)
3 years ago
Georgi Gerganov b8ce25dec1
refactoring : more readable code
3 years ago
Georgi Gerganov 128aaadb93
whisper : improve printfs
3 years ago
katsu560 83456076f0 add AVX support
3 years ago
Georgi Gerganov 49706a658a
minor : updates few prints + fix buttons in whisper.wasm
3 years ago
Georgi Gerganov 385236d1d3
stream : "-kc" now enables context keeping from previous segment (#90)
3 years ago
M. Eren Akbiyik 63ae03b8e0
Prompt previous tokens for streaming (#163)
3 years ago
Georgi Gerganov a4dfbeecf9
talk.wasm : GPT-2 meets Whisper in WebAssembly (#155)
3 years ago
Georgi Gerganov fb8d77f760 stream : add "audio_ctx" parameter
3 years ago
Georgi Gerganov 62b5ff875c stream : add "max_tokens" parameter
3 years ago
Georgi Gerganov d351771a4b stream : add "single_segment" option
3 years ago
Georgi Gerganov c058aaf22e stream : partial encoder experiments
3 years ago
greeshmay 2ba66360c9
fix: free ggml_context (close #149) (#150)
3 years ago
Georgi Gerganov 83c742f1a7 whisper : add option to speed up the audio tempo by x2
3 years ago
Georgi Gerganov c30bffc8a5
ref #22 : add "duration" option
3 years ago
Georgi Gerganov d5afebd37c
whisper : token-level timestamp refactoring (#49, #120)
3 years ago
Georgi Gerganov 02dfd5b8c3
whisper : fix extra memory usage after recent processor changes
3 years ago
Georgi Gerganov 57fb46f307 main : add option for word-leve timestamps (very experimental)
3 years ago
Georgi Gerganov eba62e0fa1
close #113 : fix struct whisper_token_data
3 years ago
Georgi Gerganov 014a119052 minor : fix multiple definitions of to_timestamp()
3 years ago
Georgi Gerganov dec40be58f parallel : print time of audio boundaries + fix timings
3 years ago
Georgi Gerganov 0b2dc3c82c parallel : working
3 years ago
Georgi Gerganov 85d6e1e1e7 main : fix sampling time + add max_context parameter
3 years ago
Georgi Gerganov 72e9cdd6bf parallel : adding tool for parallel transformer inference
3 years ago
Borislav Stanimirov c565c569e7 Define WHISPER_BUILD so as to export symbols on Windows
3 years ago
Georgi Gerganov 34bb3ab0cf ggml : add system info functions
3 years ago
Georgi Gerganov 5f7e9fa2dc
ref #68, #79 : fix segment time output
3 years ago
Georgi Gerganov 7affd309d3 whisper : add new-segment callback
3 years ago
Georgi Gerganov 31ff0c6a1f wip : experimental color coding of tokens based on probabilities
3 years ago
Georgi Gerganov 8d15a1c635
ci : fix and re-enable tests (2nd try)
3 years ago
Georgi Gerganov 692aa0784f
Revert "ci : fix and re-enable tests"
3 years ago
Georgi Gerganov 80aefc9514
ci : fix and re-enable tests
3 years ago
Georgi Gerganov 7eeef0358a
ref #52 : improve greedy sampling strategy
3 years ago
Georgi Gerganov e30cf83158
ref #57, #62, #63 : remove unions in C-api + remove designated initializers
3 years ago
Georgi Gerganov d6b84b2a23
ref #62 : fix build for some compilers
3 years ago
Georgi Gerganov b4a3875b2c
Revert recent sampling change
3 years ago
Georgi Gerganov cf67bfffa0 Fix EOT token handling
3 years ago
Georgi Gerganov d14823582d Try to improve the sampling strategy a bit
3 years ago
Georgi Gerganov 20d8e7a309 Fix memory sizes
3 years ago
Georgi Gerganov 72d967bce4 Use Accelerate framework on Apple silicon
3 years ago
Georgi Gerganov 0ad085f5e8
ref #48 : clear results at the start of whisper_full
3 years ago
0/0 b799226973 check if spectogram length is <100 before doing anything else
3 years ago
Borislav Stanimirov 0b45d25151 Building with MSVC
3 years ago
Georgi Gerganov 63b6786767
Minor
3 years ago
lnyan 4bbb8a587b Add MinGW support
3 years ago
Georgi Gerganov 2ca8cc77b2 ref #17 : print whisper logs to stderr
3 years ago
Georgi Gerganov 8c7c018893 ref #17 : add options to output result to file
3 years ago
Georgi Gerganov b43b36e006 Update tests
3 years ago
Georgi Gerganov 2f069335ab Adding sanitizer tests
3 years ago
Georgi Gerganov 332c9d77fe whisper : fix bug in token sampling logic
3 years ago
Georgi Gerganov 481cd685d5
ref #10 : option to keep context in "stream" example
3 years ago
Georgi Gerganov 7787b878e1
ref #16, #22 : add "offset" argument
3 years ago
Georgi Gerganov 167324584b wip : rpi4 support
3 years ago
Georgi Gerganov ce1fe95902 wip : improve makefile
3 years ago
Georgi Gerganov 6814cc9b02 Improve result printing
3 years ago
Georgi Gerganov eba33adadd Extend C-style API with full inference methods
3 years ago
Georgi Gerganov 6b77124e01 Initial C-style interface for whisper.cpp
3 years ago