Georgi Gerganov
2c3f50a021
release : v1.1.1
2 years ago
Georgi Gerganov
9a65269a20
.gitignore : add arm_neon.h
2 years ago
Georgi Gerganov
78f166174f
whisper : fix condition for providing past prompt (critical)
...
This bug has been present since v1.1.0.
Effectively, the past transcribed text wasn't being used for following
transcriptions, which likely significantly reduces the transcription
quality.
Likely related to #419
2 years ago
Georgi Gerganov
21c569ba4a
whisper : extend information in whisper_print_timings()
2 years ago
Georgi Gerganov
1a91c19af9
whisper : perform entropy check only when we have at least 32 tokens ( #412 )
2 years ago
Georgi Gerganov
f583e2d2f5
main : we had accidentally disabled the temperature fallback .. ( #291 )
2 years ago
Georgi Gerganov
206fc93396
whisper.wasm : add small and small.en models
2 years ago
Georgi Gerganov
a6cf6f4c4a
bench : minor fixes
2 years ago
Chia-Hsiang Cheng
472a473fd1
main : add an option to accept optional output filenames ( #424 )
...
* Add an option to accept optional output filenames
* Format the file
Co-authored-by: Chia-Hsiang Cheng <gary.chiahsiang.cheng@gmail.com>
2 years ago
Georgi Gerganov
9ba66c2fad
stream : fix handling of --step == --length ( #416 )
2 years ago
Georgi Gerganov
1ccb8a46a5
bench : fix Windows linkage by moving ggml benches in whisper lib ..
2 years ago
Georgi Gerganov
1290fc6457
bench : add memcpy and ggml_mul_mat benchmarks
2 years ago
Digipom
49b529ba74
whisper.android : add support for loading directly from asset in C ( #415 )
2 years ago
Georgi Gerganov
8088a977af
whisper : fix possible uninitialized variables ( #291 )
2 years ago
Georgi Gerganov
c9aeb33676
stream : fix --keep_context argument to be used correctly ( #354 )
2 years ago
Damian Czaja
4a3f0d3fe9
go : remove sample_best and sample_timestamp bindings ( #409 )
2 years ago
Georgi Gerganov
874bde887e
Update README.md
2 years ago
Georgi Gerganov
8738427dd6
cmake : bump version to 1.1.0
2 years ago
Georgi Gerganov
c3991bbb24
Update README.md
2 years ago
Georgi Gerganov
00ea21668b
whisper : account speed_up flag for short audio ( close #405 )
2 years ago
Georgi Gerganov
0b85e8c401
Update README.md
2 years ago
Georgi Gerganov
fafd78945d
bench.wasm : print system info
2 years ago
Georgi Gerganov
8de452c18b
Improve decoding ( #291 )
...
* whisper : prepare infra for new decoding strategies
* whisper : apply logit filters and compute logprobs
* whisper : add whisper_get_logits()
* whisper : separate self and cross attention memory
Initial step needed for supporting parallel decoders
* whisper : move probs_id buffer to whisper_context
* whisper : refactor kv cache into separate struct
* whisper : move self-attention kv cache to whisper_decoder
* whisper : wip decoding parameters + strategies
* whisper : wip decoding parameters + strategies (part 2)
* whisper : wip decoding parameters + strategies (part 3)
* whisper : wip decoding parameters + strategies (part 4)
* whisper : fix prompt_past update to not include prompt_init
* whisper : temperature + best_of support
* whisper : support for compression_ration_threshold
We actually use entropy, but it is similar
* command : fix example to use logits instead of obsolete probs
* whisper : handle empty sequence ranking
* whisper : add WHISPER_DEBUG + diagnostic prints + new main args
* whisper : minor fixes
* whisper : add beam-search support
* whisper : bug fix when there no previous context
* whisper : add comments
* stream : disable temperature fallback
For real-time processing, we always want a single decoder running at T=0
* whisper.swiftui : update example - fix paths + add empty folders
2 years ago
Georgi Gerganov
a6dbd9188b
stream : fix a bug that inserted a lot of empty audio at the start
...
The quality was terrible due to this
2 years ago
Georgi Gerganov
4ef3398e8f
ggml : remove obsolete zeroing + comment fixes ( #390 )
2 years ago
Ian Bicking
5e9f33596f
readme : clarify main and stream usage ( #391 )
...
Give an example of ./main that uses a sample file that's already there, and make the stream example clarify you need `make stream`
2 years ago
Abitofevrything
8d7b29cedd
ggml : correct behaviour of ggml_vec_sum_f32 ( #390 )
2 years ago
boolemancer
08dc705a69
whisper : fix sample_to_timestamp calculation with 64 bit precision to avoid overflow ( #388 )
...
* Do calculation with 64 bit precision to avoid overflow
* Update whisper.cpp
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2 years ago
Syahmi Azhar
1512545149
whisper : add loader class to allow loading from buffer and others ( #353 )
...
* whisper : add loader to allow loading from other than file
* whisper : rename whisper_init to whisper_init_from_file
* whisper : add whisper_init_from_buffer
* android : Delete local.properties
* android : load models directly from assets
* whisper : adding <stddef.h> needed for size_t + code style
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2 years ago
Georgi Gerganov
52a3e0c92a
ggml : improve vec_dot_f16 unrolling in flash_attn_f16
2 years ago
Georgi Gerganov
d1ea1220ff
command : clean-up / refactoring / formatting ( #383 )
2 years ago
David
9c4a1522f6
command : always-prompt mode ( #383 )
2 years ago
David Thorpe
f078a6f20e
go : adding features to the go-whisper example, go ci, etc ( #384 )
...
* Updated bindings so they can be used in third pary packages.
* Updated makefiles to set FMA flag on optionally, for xeon E5 on Darwin
* Added test script
* Changes for examples
* Reverted
* Made the NewContext method private
2 years ago
Georgi Gerganov
f30b5d322c
ggml : fix bug in new soft max computation
2 years ago
Georgi Gerganov
44efbf7ff1
cmake : add -Wno-unused-function + update whisper.js
2 years ago
Georgi Gerganov
d347a59a5f
ggml : when using BLAS start only 1 CPU thread
2 years ago
Georgi Gerganov
6394c906af
ggml : fix running tasks with variable number of threads
2 years ago
Georgi Gerganov
74ffa14e1d
ggml : unroll ggml_vec_dot_f16 in ggml_compute_forward_flash_attn_f16
2 years ago
Georgi Gerganov
65fdcbbbbb
whisper : revert accidental MB change
2 years ago
Georgi Gerganov
d61d55cd4b
ggml : speed-up soft max via Accelerate + unroll
2 years ago
Georgi Gerganov
d51fc3ee0a
ggml : use vDSP_sve and vDSP_maxv from Accelerate
2 years ago
Georgi Gerganov
f82a7dd019
ggml : make gcc happy (minor)
2 years ago
Georgi Gerganov
87dd4a3081
talk.wasm : bump memory usage + update whisper.js
2 years ago
m.bell
41e05c6b1b
cmake : support AVX2 in Windows better ( #381 )
2 years ago
Georgi Gerganov
fa379cb22a
Revert "tmp"
...
This reverts commit 1652965529
.
2 years ago
David Thorpe
322f4e6c4e
go : bindings updated so they can be used in third party packages. ( #379 )
...
* Updated bindings so they can be used in third pary packages.
* Updated makefiles to set FMA flag on optionally, for xeon E5 on Darwin
2 years ago
Georgi Gerganov
1652965529
tmp
2 years ago
Georgi Gerganov
6042c7a3be
cmake : change min required version to 3.0 ( #351 )
...
We increase the min version only when want to use particular
functionality that is available in the newer version
2 years ago
Georgi Gerganov
6b351bb669
command : add "guided-mode" video demo in the README.md
2 years ago
Abitofevrything
a62170c656
ggml : add SSE3 and fp16 conversion lookup table ( #368 )
...
* Improves WASM performance:
On MacBook M1 Pro, I observe 25% faster using Firefox and 35% faster using Chrome
* Add support for SSE3 SIMD
* Add SSE3 to system information
* Add Imath support for fp16-fp32 conversions
* Add Imath to system information
* Wrap Imath calls to avoid static function warnings
* Drop Imath; Add lookup table for f16 -> f32 conversions
* Remove TODO comments
* Update SSE3 to new macro arguments
* Correct updated macro definitions
* Prefer static inline where possible
* ggml : static inlines + add public f16 <-> f32 conversions
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2 years ago