Finn Voorhees
fb4d0d470f
whisper : fix uninitialized exp_n_audio_ctx
2 years ago
Georgi Gerganov
0d229163bb
whisper : add API for applying custom logits filters during decoding
2 years ago
Georgi Gerganov
f254e78737
yt-wsp.sh : print help on empty args
2 years ago
Georgi Gerganov
a94897bcde
whisper : by default disable non-speech tokens suppression ( #473 )
...
This seems to be causing hallucinations in the end of the audio, e.g.:
"Thank you for listening"
"Amen"
..
2 years ago
Georgi Gerganov
2407ae8ef0
readme : add Ruby discussion + update .NET discussion
2 years ago
Todd
b623ca43b1
bindings : add Ruby ( #500 )
...
* adding ruby bindings
* avoid adding these they are copied in via extconf.rb
* ignore these files here
* add definitions for boolean params
* initial transcribe for ruby
* use en model and transcribe jfk with assertion
* possibly this works for building ruby binding
* ci : try to add ruby workflow
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2 years ago
conradg
69e6e4644a
main : fix std in input ( #503 )
...
if we don't add this as an explicit check, then we get an "error: unknown argument: -" later on
2 years ago
Georgi Gerganov
09d7d2b68e
examples : refactor in order to reuse code and reduce duplication ( #482 )
...
* examples : refactor common code into a library
* examples : refactor common SDL code into a library
* make : update Makefile to use common libs
* common : fix MSVC M_PI ..
* addon.node : link common lib
2 years ago
shikokuchuo
0336161b7d
whisper : fix signedness compiler warning ( #506 )
2 years ago
genevera (she/her)
459753342d
yt-wsp.sh : add unique filename generation ( #495 )
...
Co-authored-by: genevera <genevera@noreply.users.github.com>
2 years ago
Georgi Gerganov
9764782bd9
readme : add another .NET repo ( #303 )
2 years ago
Georgi Gerganov
3b010f9bed
readme : add .NET repo ( #303 )
2 years ago
Avik Sengupta
113fcec513
cmake : install whisper.h header ( #485 )
...
Including the header file in the install bundle helps projects that ship binaries.
2 years ago
shibukazu
cfc06bf8df
whisper : suppress non-speech-related token outputs ( #473 )
...
* add non-speech-token suppression
* add suppress non-speech_tokens param
2 years ago
sandrohanea
2bfe0ebc0f
whisper : fixed Beam Search Strategy and exposed whisper_pcm_to_mel_phase_vocoder ( #474 )
...
Co-authored-by: Sandro Hanea <sandrohanea@microsoft.com>
2 years ago
boolemancer
4dd7119deb
whisper : only trim if split_on_word is true ( #476 )
2 years ago
Qianhe Chen
ab1916fc59
ci : add node addon test and optimize compilation configuration ( #468 )
...
* addon: implement node addon call whisper through cpp
* addon: modify the license to MIT
* addon: remove iostream
* addon: rename dir
* addon: fix typo
* addon: configure cmake to build when cmake-js is used
* ci: add addon.node test ci
* addon: remove build WHISPER_BUILD_TESTS
* addon: update build command
* addon: add test
* addon: add test file
* addon: adapt to compile on Windows
* addon: fix typo
* addon: reuse jfk.wav
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
* addon: reuse jfk.wav
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2 years ago
kamranjon
a1c1583cc7
whisper : add whisper_full_lang_id() for getting the context lang ( #461 )
2 years ago
Matija Pevec
d012b5c7e4
whisper : add "split_on_word" flag when using using "max_len" option ( #455 )
...
* Update whisper.cpp
* fix: trim function
* feat: added flag to split on word
* fix: arguments for main
2 years ago
Georgi Gerganov
b2083c5d02
release : v1.2.0
2 years ago
Georgi Gerganov
f3ee4a9673
whisper : reduce memory usage during inference ( #431 )
...
* ggml : add "scratch" buffer support
* ggml : support for scratch ring-buffer
* ggml : bug fix in ggml_repeat()
* ggml : error on scratch buffer overflow
* whisper : use scratch buffers during inference (base model only)
* whisper : update memory usage for all models
* whisper : fix encoder memory usage
* whisper : use whisper_context functions instead of macros
* whisper : fix FF + remove it from README
* ggml : reuse ggml_new_i32
* ggml : refactor the scratch buffer storage
* whisper : reorder scratch buffers in the decoder
* main : add option to disable temp fallback
* Update README.md
2 years ago
Qianhe Chen
c306a7fd89
addon.node : using whisper as a Node.js addon ( #443 )
...
* addon: implement node addon call whisper through cpp
* addon: modify the license to MIT
* addon: remove iostream
* addon: rename dir
* addon: fix typo
* addon: configure cmake to build when cmake-js is used
2 years ago
polarmoon
b2fc4c7010
go : support "auto" as an option when set language ( #462 )
...
Co-authored-by: Ming <ming@localhost>
2 years ago
Georgi Gerganov
291980369c
whisper : suppress task tokens ( #442 )
2 years ago
Taisei Mima
86ef64a855
wasm : fix typo in helper.js ( #459 )
2 years ago
Alex Bacart
3b1960520a
main : CSV format export trimmed spaces fix ( #444 )
...
* Update main.cpp
Removed string trimming
* Update main.cpp
* Update main.cpp
* Revert "Update main.cpp"
This reverts commit d8924fdcfe
.
* Revert "Update main.cpp"
This reverts commit 252e508d85
.
2 years ago
Lukas Rist
2bee2650c6
go : add wrapper for system info ( #456 )
2 years ago
Robin
beb9512be3
go : add WhisperLangAutoDetect method to go binding ( #451 )
2 years ago
Eric Tendian
47737b2e82
livestream.sh : run main with model arg instead of default ( #453 )
...
Actually utilizes the $model var when calling ./main.
2 years ago
Georgi Gerganov
b992f3709e
whisper : do not provide past prompt when n_max_text_ctx == 0
2 years ago
Georgi Gerganov
60337f5306
wasm : check if navigator.storage.estimate() is available
...
Safari does not support it
2 years ago
Lukas Rist
02c7516c57
go : added wrappers to reset and print timings ( #436 )
2 years ago
Georgi Gerganov
411ea9b833
ci : run workflows on pull requests + bindings depend on .h ( #446 )
2 years ago
Ondrej Kokes
11f61cecd6
whisper.wasm : add labels for easier radio selection ( #435 )
2 years ago
Georgi Gerganov
b5ddb16ec7
whisper : condition timestamps to be monotonically increasing ( #425 )
2 years ago
fitzsim
ae16c21e9c
whisper : PPC64 big-endian support ( #398 )
...
* ggml : set cache line size to 128 on POWER9
* whisper : add PPC64 big endian support
2 years ago
Georgi Gerganov
2c3f50a021
release : v1.1.1
2 years ago
Georgi Gerganov
9a65269a20
.gitignore : add arm_neon.h
2 years ago
Georgi Gerganov
78f166174f
whisper : fix condition for providing past prompt (critical)
...
This bug has been present since v1.1.0.
Effectively, the past transcribed text wasn't being used for following
transcriptions, which likely significantly reduces the transcription
quality.
Likely related to #419
2 years ago
Georgi Gerganov
21c569ba4a
whisper : extend information in whisper_print_timings()
2 years ago
Georgi Gerganov
1a91c19af9
whisper : perform entropy check only when we have at least 32 tokens ( #412 )
2 years ago
Georgi Gerganov
f583e2d2f5
main : we had accidentally disabled the temperature fallback .. ( #291 )
2 years ago
Georgi Gerganov
206fc93396
whisper.wasm : add small and small.en models
2 years ago
Georgi Gerganov
a6cf6f4c4a
bench : minor fixes
2 years ago
Chia-Hsiang Cheng
472a473fd1
main : add an option to accept optional output filenames ( #424 )
...
* Add an option to accept optional output filenames
* Format the file
Co-authored-by: Chia-Hsiang Cheng <gary.chiahsiang.cheng@gmail.com>
2 years ago
Georgi Gerganov
9ba66c2fad
stream : fix handling of --step == --length ( #416 )
2 years ago
Georgi Gerganov
1ccb8a46a5
bench : fix Windows linkage by moving ggml benches in whisper lib ..
2 years ago
Georgi Gerganov
1290fc6457
bench : add memcpy and ggml_mul_mat benchmarks
2 years ago
Digipom
49b529ba74
whisper.android : add support for loading directly from asset in C ( #415 )
2 years ago
Georgi Gerganov
8088a977af
whisper : fix possible uninitialized variables ( #291 )
2 years ago