Commit Graph

117 Commits (master)

Author SHA1 Message Date
Georgi Gerganov 6a7c82501e
whisper : improve decoding strategy (#244)
1 year ago
Georgi Gerganov 124c718c73
whisper : fix UB when reading buffer of length 0 bytes (#265)
1 year ago
Roland Rabien e70d47baab
Remove C++20 requirement (#257)
1 year ago
bert hubert d1da35de06 fix potential bug reading model data into a small size optimized string which could lead to memory corruption. In an SSO string, you can't write data to &str[0] and expect it to work well.
1 year ago
Georgi Gerganov 603f97ba11
whisper : minor improvemnt in decoding strategy (#244)
1 year ago
Georgi Gerganov f8ec718b76
ggml : add F16C CPU flag check
1 year ago
Georgi Gerganov 78d13257be
Try to improve the token sampling strategy (#193)
1 year ago
Georgi Gerganov 4698dcdb52 whisper : add mechanism for aborting the whisper_full() computation
1 year ago
Georgi Gerganov e266cb0723
whisper.objc : add real-time processing (#97)
1 year ago
Georgi Gerganov c207eed431
whisper.objc : fix build warnings
1 year ago
Georgi Gerganov be16dfa038
whisper.wasm : do not block page while processing (close #86)
1 year ago
Georgi Gerganov b8ce25dec1
refactoring : more readable code
1 year ago
Georgi Gerganov 128aaadb93
whisper : improve printfs
1 year ago
katsu560 83456076f0 add AVX support
1 year ago
Georgi Gerganov 49706a658a
minor : updates few prints + fix buttons in whisper.wasm
1 year ago
Georgi Gerganov 385236d1d3
stream : "-kc" now enables context keeping from previous segment (#90)
1 year ago
M. Eren Akbiyik 63ae03b8e0
Prompt previous tokens for streaming (#163)
1 year ago
Georgi Gerganov a4dfbeecf9
talk.wasm : GPT-2 meets Whisper in WebAssembly (#155)
1 year ago
Georgi Gerganov fb8d77f760 stream : add "audio_ctx" parameter
1 year ago
Georgi Gerganov 62b5ff875c stream : add "max_tokens" parameter
1 year ago
Georgi Gerganov d351771a4b stream : add "single_segment" option
1 year ago
Georgi Gerganov c058aaf22e stream : partial encoder experiments
1 year ago
greeshmay 2ba66360c9
fix: free ggml_context (close #149) (#150)
1 year ago
Georgi Gerganov 83c742f1a7 whisper : add option to speed up the audio tempo by x2
2 years ago
Georgi Gerganov c30bffc8a5
ref #22 : add "duration" option
2 years ago
Georgi Gerganov d5afebd37c
whisper : token-level timestamp refactoring (#49, #120)
2 years ago
Georgi Gerganov 02dfd5b8c3
whisper : fix extra memory usage after recent processor changes
2 years ago
Georgi Gerganov 57fb46f307 main : add option for word-leve timestamps (very experimental)
2 years ago
Georgi Gerganov eba62e0fa1
close #113 : fix struct whisper_token_data
2 years ago
Georgi Gerganov 014a119052 minor : fix multiple definitions of to_timestamp()
2 years ago
Georgi Gerganov dec40be58f parallel : print time of audio boundaries + fix timings
2 years ago
Georgi Gerganov 0b2dc3c82c parallel : working
2 years ago
Georgi Gerganov 85d6e1e1e7 main : fix sampling time + add max_context parameter
2 years ago
Georgi Gerganov 72e9cdd6bf parallel : adding tool for parallel transformer inference
2 years ago
Borislav Stanimirov c565c569e7 Define WHISPER_BUILD so as to export symbols on Windows
2 years ago
Georgi Gerganov 34bb3ab0cf ggml : add system info functions
2 years ago
Georgi Gerganov 5f7e9fa2dc
ref #68, #79 : fix segment time output
2 years ago
Georgi Gerganov 7affd309d3 whisper : add new-segment callback
2 years ago
Georgi Gerganov 31ff0c6a1f wip : experimental color coding of tokens based on probabilities
2 years ago
Georgi Gerganov 8d15a1c635
ci : fix and re-enable tests (2nd try)
2 years ago
Georgi Gerganov 692aa0784f
Revert "ci : fix and re-enable tests"
2 years ago
Georgi Gerganov 80aefc9514
ci : fix and re-enable tests
2 years ago
Georgi Gerganov 7eeef0358a
ref #52 : improve greedy sampling strategy
2 years ago
Georgi Gerganov e30cf83158
ref #57, #62, #63 : remove unions in C-api + remove designated initializers
2 years ago
Georgi Gerganov d6b84b2a23
ref #62 : fix build for some compilers
2 years ago
Georgi Gerganov b4a3875b2c
Revert recent sampling change
2 years ago
Georgi Gerganov cf67bfffa0 Fix EOT token handling
2 years ago
Georgi Gerganov d14823582d Try to improve the sampling strategy a bit
2 years ago
Georgi Gerganov 20d8e7a309 Fix memory sizes
2 years ago
Georgi Gerganov 72d967bce4 Use Accelerate framework on Apple silicon
2 years ago
Georgi Gerganov 0ad085f5e8
ref #48 : clear results at the start of whisper_full
2 years ago
0/0 b799226973 check if spectogram length is <100 before doing anything else
2 years ago
Borislav Stanimirov 0b45d25151 Building with MSVC
2 years ago
Georgi Gerganov 63b6786767
Minor
2 years ago
lnyan 4bbb8a587b Add MinGW support
2 years ago
Georgi Gerganov 2ca8cc77b2 ref #17 : print whisper logs to stderr
2 years ago
Georgi Gerganov 8c7c018893 ref #17 : add options to output result to file
2 years ago
Georgi Gerganov b43b36e006 Update tests
2 years ago
Georgi Gerganov 2f069335ab Adding sanitizer tests
2 years ago
Georgi Gerganov 332c9d77fe whisper : fix bug in token sampling logic
2 years ago
Georgi Gerganov 481cd685d5
ref #10 : option to keep context in "stream" example
2 years ago
Georgi Gerganov 7787b878e1
ref #16, #22 : add "offset" argument
2 years ago
Georgi Gerganov 167324584b wip : rpi4 support
2 years ago
Georgi Gerganov ce1fe95902 wip : improve makefile
2 years ago
Georgi Gerganov 6814cc9b02 Improve result printing
2 years ago
Georgi Gerganov eba33adadd Extend C-style API with full inference methods
2 years ago
Georgi Gerganov 6b77124e01 Initial C-style interface for whisper.cpp
2 years ago