genevera (she/her)
459753342d
yt-wsp.sh : add unique filename generation ( #495 )
...
Co-authored-by: genevera <genevera@noreply.users.github.com>
2 years ago
Georgi Gerganov
9764782bd9
readme : add another .NET repo ( #303 )
2 years ago
Georgi Gerganov
3b010f9bed
readme : add .NET repo ( #303 )
2 years ago
Avik Sengupta
113fcec513
cmake : install whisper.h header ( #485 )
...
Including the header file in the install bundle helps projects that ship binaries.
2 years ago
shibukazu
cfc06bf8df
whisper : suppress non-speech-related token outputs ( #473 )
...
* add non-speech-token suppression
* add suppress non-speech_tokens param
2 years ago
sandrohanea
2bfe0ebc0f
whisper : fixed Beam Search Strategy and exposed whisper_pcm_to_mel_phase_vocoder ( #474 )
...
Co-authored-by: Sandro Hanea <sandrohanea@microsoft.com>
2 years ago
boolemancer
4dd7119deb
whisper : only trim if split_on_word is true ( #476 )
2 years ago
Qianhe Chen
ab1916fc59
ci : add node addon test and optimize compilation configuration ( #468 )
...
* addon: implement node addon call whisper through cpp
* addon: modify the license to MIT
* addon: remove iostream
* addon: rename dir
* addon: fix typo
* addon: configure cmake to build when cmake-js is used
* ci: add addon.node test ci
* addon: remove build WHISPER_BUILD_TESTS
* addon: update build command
* addon: add test
* addon: add test file
* addon: adapt to compile on Windows
* addon: fix typo
* addon: reuse jfk.wav
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
* addon: reuse jfk.wav
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2 years ago
kamranjon
a1c1583cc7
whisper : add whisper_full_lang_id() for getting the context lang ( #461 )
2 years ago
Matija Pevec
d012b5c7e4
whisper : add "split_on_word" flag when using using "max_len" option ( #455 )
...
* Update whisper.cpp
* fix: trim function
* feat: added flag to split on word
* fix: arguments for main
2 years ago
Georgi Gerganov
b2083c5d02
release : v1.2.0
2 years ago
Georgi Gerganov
f3ee4a9673
whisper : reduce memory usage during inference ( #431 )
...
* ggml : add "scratch" buffer support
* ggml : support for scratch ring-buffer
* ggml : bug fix in ggml_repeat()
* ggml : error on scratch buffer overflow
* whisper : use scratch buffers during inference (base model only)
* whisper : update memory usage for all models
* whisper : fix encoder memory usage
* whisper : use whisper_context functions instead of macros
* whisper : fix FF + remove it from README
* ggml : reuse ggml_new_i32
* ggml : refactor the scratch buffer storage
* whisper : reorder scratch buffers in the decoder
* main : add option to disable temp fallback
* Update README.md
2 years ago
Qianhe Chen
c306a7fd89
addon.node : using whisper as a Node.js addon ( #443 )
...
* addon: implement node addon call whisper through cpp
* addon: modify the license to MIT
* addon: remove iostream
* addon: rename dir
* addon: fix typo
* addon: configure cmake to build when cmake-js is used
2 years ago
polarmoon
b2fc4c7010
go : support "auto" as an option when set language ( #462 )
...
Co-authored-by: Ming <ming@localhost>
2 years ago
Georgi Gerganov
291980369c
whisper : suppress task tokens ( #442 )
2 years ago
Taisei Mima
86ef64a855
wasm : fix typo in helper.js ( #459 )
2 years ago
Alex Bacart
3b1960520a
main : CSV format export trimmed spaces fix ( #444 )
...
* Update main.cpp
Removed string trimming
* Update main.cpp
* Update main.cpp
* Revert "Update main.cpp"
This reverts commit d8924fdcfe
.
* Revert "Update main.cpp"
This reverts commit 252e508d85
.
2 years ago
Lukas Rist
2bee2650c6
go : add wrapper for system info ( #456 )
2 years ago
Robin
beb9512be3
go : add WhisperLangAutoDetect method to go binding ( #451 )
2 years ago
Eric Tendian
47737b2e82
livestream.sh : run main with model arg instead of default ( #453 )
...
Actually utilizes the $model var when calling ./main.
2 years ago
Georgi Gerganov
b992f3709e
whisper : do not provide past prompt when n_max_text_ctx == 0
2 years ago
Georgi Gerganov
60337f5306
wasm : check if navigator.storage.estimate() is available
...
Safari does not support it
2 years ago
Lukas Rist
02c7516c57
go : added wrappers to reset and print timings ( #436 )
2 years ago
Georgi Gerganov
411ea9b833
ci : run workflows on pull requests + bindings depend on .h ( #446 )
2 years ago
Ondrej Kokes
11f61cecd6
whisper.wasm : add labels for easier radio selection ( #435 )
2 years ago
Georgi Gerganov
b5ddb16ec7
whisper : condition timestamps to be monotonically increasing ( #425 )
2 years ago
fitzsim
ae16c21e9c
whisper : PPC64 big-endian support ( #398 )
...
* ggml : set cache line size to 128 on POWER9
* whisper : add PPC64 big endian support
2 years ago
Georgi Gerganov
2c3f50a021
release : v1.1.1
2 years ago
Georgi Gerganov
9a65269a20
.gitignore : add arm_neon.h
2 years ago
Georgi Gerganov
78f166174f
whisper : fix condition for providing past prompt (critical)
...
This bug has been present since v1.1.0.
Effectively, the past transcribed text wasn't being used for following
transcriptions, which likely significantly reduces the transcription
quality.
Likely related to #419
2 years ago
Georgi Gerganov
21c569ba4a
whisper : extend information in whisper_print_timings()
2 years ago
Georgi Gerganov
1a91c19af9
whisper : perform entropy check only when we have at least 32 tokens ( #412 )
2 years ago
Georgi Gerganov
f583e2d2f5
main : we had accidentally disabled the temperature fallback .. ( #291 )
2 years ago
Georgi Gerganov
206fc93396
whisper.wasm : add small and small.en models
2 years ago
Georgi Gerganov
a6cf6f4c4a
bench : minor fixes
2 years ago
Chia-Hsiang Cheng
472a473fd1
main : add an option to accept optional output filenames ( #424 )
...
* Add an option to accept optional output filenames
* Format the file
Co-authored-by: Chia-Hsiang Cheng <gary.chiahsiang.cheng@gmail.com>
2 years ago
Georgi Gerganov
9ba66c2fad
stream : fix handling of --step == --length ( #416 )
2 years ago
Georgi Gerganov
1ccb8a46a5
bench : fix Windows linkage by moving ggml benches in whisper lib ..
2 years ago
Georgi Gerganov
1290fc6457
bench : add memcpy and ggml_mul_mat benchmarks
2 years ago
Digipom
49b529ba74
whisper.android : add support for loading directly from asset in C ( #415 )
2 years ago
Georgi Gerganov
8088a977af
whisper : fix possible uninitialized variables ( #291 )
2 years ago
Georgi Gerganov
c9aeb33676
stream : fix --keep_context argument to be used correctly ( #354 )
2 years ago
Damian Czaja
4a3f0d3fe9
go : remove sample_best and sample_timestamp bindings ( #409 )
2 years ago
Georgi Gerganov
874bde887e
Update README.md
2 years ago
Georgi Gerganov
8738427dd6
cmake : bump version to 1.1.0
2 years ago
Georgi Gerganov
c3991bbb24
Update README.md
2 years ago
Georgi Gerganov
00ea21668b
whisper : account speed_up flag for short audio ( close #405 )
2 years ago
Georgi Gerganov
0b85e8c401
Update README.md
2 years ago
Georgi Gerganov
fafd78945d
bench.wasm : print system info
2 years ago
Georgi Gerganov
8de452c18b
Improve decoding ( #291 )
...
* whisper : prepare infra for new decoding strategies
* whisper : apply logit filters and compute logprobs
* whisper : add whisper_get_logits()
* whisper : separate self and cross attention memory
Initial step needed for supporting parallel decoders
* whisper : move probs_id buffer to whisper_context
* whisper : refactor kv cache into separate struct
* whisper : move self-attention kv cache to whisper_decoder
* whisper : wip decoding parameters + strategies
* whisper : wip decoding parameters + strategies (part 2)
* whisper : wip decoding parameters + strategies (part 3)
* whisper : wip decoding parameters + strategies (part 4)
* whisper : fix prompt_past update to not include prompt_init
* whisper : temperature + best_of support
* whisper : support for compression_ration_threshold
We actually use entropy, but it is similar
* command : fix example to use logits instead of obsolete probs
* whisper : handle empty sequence ranking
* whisper : add WHISPER_DEBUG + diagnostic prints + new main args
* whisper : minor fixes
* whisper : add beam-search support
* whisper : bug fix when there no previous context
* whisper : add comments
* stream : disable temperature fallback
For real-time processing, we always want a single decoder running at T=0
* whisper.swiftui : update example - fix paths + add empty folders
2 years ago