Commit Graph

40 Commits (52a3e0c92a8be5150d2a59e492b4943ca8a623b0)

Author SHA1 Message Date
Georgi Gerganov d51c5eb906
ggml : define MIN / MAX only if not defined (minor)
2 years ago
Georgi Gerganov d97e6005e9
whisper : add whisper_n_audio_ctx and check for invalid audio_ctx
2 years ago
Matheus de Sousa 8e3f129b4d
minor : resolves some of warnings when compiling with clang/clang++ (#294)
2 years ago
Georgi Gerganov fba10a4c68 whisper : language auto-detect (#59)
2 years ago
Georgi Gerganov bf69b669a0 whisper : add whisper_tokenize()
2 years ago
Georgi Gerganov 78d13257be
Try to improve the token sampling strategy (#193)
2 years ago
Georgi Gerganov 4698dcdb52 whisper : add mechanism for aborting the whisper_full() computation
2 years ago
Georgi Gerganov e266cb0723
whisper.objc : add real-time processing (#97)
2 years ago
Georgi Gerganov c207eed431
whisper.objc : fix build warnings
2 years ago
Georgi Gerganov be16dfa038
whisper.wasm : do not block page while processing (close #86)
2 years ago
Georgi Gerganov b8ce25dec1
refactoring : more readable code
2 years ago
Georgi Gerganov 385236d1d3
stream : "-kc" now enables context keeping from previous segment (#90)
2 years ago
M. Eren Akbiyik 63ae03b8e0
Prompt previous tokens for streaming (#163)
2 years ago
Georgi Gerganov fb8d77f760 stream : add "audio_ctx" parameter
2 years ago
Georgi Gerganov 62b5ff875c stream : add "max_tokens" parameter
2 years ago
Georgi Gerganov d351771a4b stream : add "single_segment" option
2 years ago
Georgi Gerganov c058aaf22e stream : partial encoder experiments
2 years ago
Georgi Gerganov 83c742f1a7 whisper : add option to speed up the audio tempo by x2
2 years ago
Georgi Gerganov c30bffc8a5
ref #22 : add "duration" option
2 years ago
Georgi Gerganov d5afebd37c
whisper : token-level timestamp refactoring (#49, #120)
2 years ago
Georgi Gerganov 57fb46f307 main : add option for word-leve timestamps (very experimental)
2 years ago
Georgi Gerganov eba62e0fa1
close #113 : fix struct whisper_token_data
2 years ago
Georgi Gerganov dec40be58f parallel : print time of audio boundaries + fix timings
2 years ago
Georgi Gerganov 0b2dc3c82c parallel : working
2 years ago
Georgi Gerganov 85d6e1e1e7 main : fix sampling time + add max_context parameter
2 years ago
Georgi Gerganov 72e9cdd6bf parallel : adding tool for parallel transformer inference
2 years ago
Georgi Gerganov 34bb3ab0cf ggml : add system info functions
2 years ago
Georgi Gerganov 7affd309d3 whisper : add new-segment callback
2 years ago
Georgi Gerganov 31ff0c6a1f wip : experimental color coding of tokens based on probabilities
2 years ago
Georgi Gerganov 7eeef0358a
ref #52 : improve greedy sampling strategy
2 years ago
Georgi Gerganov 2d171ced32
close #32 : add comment about thread-safety of the C-style API
2 years ago
Georgi Gerganov e30cf83158
ref #57, #62, #63 : remove unions in C-api + remove designated initializers
2 years ago
Georgi Gerganov 9d5723435f
ref #35 : add <stdbool.h> to whisper.h
2 years ago
Georgi Gerganov 9bbca3110f ref #9 : add API documentation in whisper.h
2 years ago
Georgi Gerganov 2f069335ab Adding sanitizer tests
2 years ago
Georgi Gerganov 481cd685d5
ref #10 : option to keep context in "stream" example
2 years ago
Georgi Gerganov 7787b878e1
ref #16, #22 : add "offset" argument
2 years ago
Georgi Gerganov 6814cc9b02 Improve result printing
2 years ago
Georgi Gerganov eba33adadd Extend C-style API with full inference methods
2 years ago
Georgi Gerganov 6b77124e01 Initial C-style interface for whisper.cpp
2 years ago