Commit Graph

432 Commits (d83e47573b85732bba732840916f8a20a04aae9b)
 

Author SHA1 Message Date
Georgi Gerganov d83e47573b
whisper : add comments
3 years ago
Georgi Gerganov 6700cd57f7
whisper : bug fix when there no previous context
3 years ago
Georgi Gerganov 5548a1986f
whisper : add beam-search support
3 years ago
Georgi Gerganov 5e97f80fc5
whisper : minor fixes
3 years ago
Georgi Gerganov c301a7942b
whisper : add WHISPER_DEBUG + diagnostic prints + new main args
3 years ago
Georgi Gerganov c6a8a47039
whisper : handle empty sequence ranking
3 years ago
Georgi Gerganov 7ea1b736ec
command : fix example to use logits instead of obsolete probs
3 years ago
Georgi Gerganov c67716faba
whisper : support for compression_ration_threshold
3 years ago
Georgi Gerganov 34c5110f59
whisper : temperature + best_of support
3 years ago
Georgi Gerganov 712bc4b960
whisper : fix prompt_past update to not include prompt_init
3 years ago
Georgi Gerganov bd6e70b512
whisper : wip decoding parameters + strategies (part 4)
3 years ago
Georgi Gerganov 116dd67a15
whisper : wip decoding parameters + strategies (part 3)
3 years ago
Georgi Gerganov 3d723d0b82
whisper : wip decoding parameters + strategies (part 2)
3 years ago
Georgi Gerganov 9551d7fabd
whisper : wip decoding parameters + strategies
3 years ago
Georgi Gerganov 628843c60a
whisper : move self-attention kv cache to whisper_decoder
3 years ago
Georgi Gerganov ee58108df8
whisper : refactor kv cache into separate struct
3 years ago
Georgi Gerganov 1163f266d2
whisper : move probs_id buffer to whisper_context
3 years ago
Georgi Gerganov 523e0494a6
whisper : separate self and cross attention memory
3 years ago
Georgi Gerganov 21559537e2
whisper : add whisper_get_logits()
3 years ago
Georgi Gerganov 2d8d3724b8
whisper : apply logit filters and compute logprobs
3 years ago
Georgi Gerganov abe104b80b
whisper : prepare infra for new decoding strategies
3 years ago
Georgi Gerganov 4ef3398e8f
ggml : remove obsolete zeroing + comment fixes (#390)
3 years ago
Ian Bicking 5e9f33596f
readme : clarify main and stream usage (#391)
3 years ago
Abitofevrything 8d7b29cedd
ggml : correct behaviour of ggml_vec_sum_f32 (#390)
3 years ago
boolemancer 08dc705a69
whisper : fix sample_to_timestamp calculation with 64 bit precision to avoid overflow (#388)
3 years ago
Syahmi Azhar 1512545149
whisper : add loader class to allow loading from buffer and others (#353)
3 years ago
Georgi Gerganov 52a3e0c92a
ggml : improve vec_dot_f16 unrolling in flash_attn_f16
3 years ago
Georgi Gerganov d1ea1220ff
command : clean-up / refactoring / formatting (#383)
3 years ago
David 9c4a1522f6
command : always-prompt mode (#383)
3 years ago
David Thorpe f078a6f20e
go : adding features to the go-whisper example, go ci, etc (#384)
3 years ago
Georgi Gerganov f30b5d322c
ggml : fix bug in new soft max computation
3 years ago
Georgi Gerganov 44efbf7ff1
cmake : add -Wno-unused-function + update whisper.js
3 years ago
Georgi Gerganov d347a59a5f
ggml : when using BLAS start only 1 CPU thread
3 years ago
Georgi Gerganov 6394c906af
ggml : fix running tasks with variable number of threads
3 years ago
Georgi Gerganov 74ffa14e1d
ggml : unroll ggml_vec_dot_f16 in ggml_compute_forward_flash_attn_f16
3 years ago
Georgi Gerganov 65fdcbbbbb
whisper : revert accidental MB change
3 years ago
Georgi Gerganov d61d55cd4b
ggml : speed-up soft max via Accelerate + unroll
3 years ago
Georgi Gerganov d51fc3ee0a
ggml : use vDSP_sve and vDSP_maxv from Accelerate
3 years ago
Georgi Gerganov f82a7dd019
ggml : make gcc happy (minor)
3 years ago
Georgi Gerganov 87dd4a3081
talk.wasm : bump memory usage + update whisper.js
3 years ago
m.bell 41e05c6b1b
cmake : support AVX2 in Windows better (#381)
3 years ago
Georgi Gerganov fa379cb22a
Revert "tmp"
3 years ago
David Thorpe 322f4e6c4e
go : bindings updated so they can be used in third party packages. (#379)
3 years ago
Georgi Gerganov 1652965529
tmp
3 years ago
Georgi Gerganov 6042c7a3be
cmake : change min required version to 3.0 (#351)
3 years ago
Georgi Gerganov 6b351bb669
command : add "guided-mode" video demo in the README.md
3 years ago
Abitofevrything a62170c656
ggml : add SSE3 and fp16 conversion lookup table (#368)
3 years ago
Thomas Fitzsimmons 1944e7c33e whisper : document POWER VSX support
3 years ago
Thomas Fitzsimmons 49a8dd6732 ggml : reorganize POWER9 ppc64le SIMD code
3 years ago
Thomas Fitzsimmons 8c7f642286 ggml : change f16 load and store macro arguments
3 years ago