sandrohanea
59fdcd19c8
whisper : add whisper_state + default state on the whisper_context ( #523 )
...
* Added whisper state + default state on the whisper_context
* Fixed some examples and bindings
* Fixed whisper_n_len (which was used in some binding) and added whisper_n_len_from_state
* Fixed comments
* whisper : reuse kv_cache_free() and fix compiler warnings
* whisper : clean-up the API comments
---------
Co-authored-by: Sandro Hanea <sandrohanea@microsoft.com>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
1 year ago
Georgi Gerganov
478289a4b3
whisper : set no_context == true by default ( #537 )
1 year ago
polarmoon
5e94129cb2
go : NewContext now returns a clean context ( #537 )
...
Co-authored-by: Ming <ming@localhost>
1 year ago
HY. Kelvin Lee
72af0f5697
main : add csv header ( #552 )
1 year ago
Georgi Gerganov
af005d573f
make : add -DNDEBUG compile flag
1 year ago
Georgi Gerganov
ad1389003d
release : v1.2.1
1 year ago
FlippFuzz
f420de1322
make : add "-mcpu=native" when building for aarch64 ( #532 )
1 year ago
Aaron Pham
d176160f6f
readme : add pybind11 bindings ( #538 )
1 year ago
Georgi Gerganov
ca21f7ab16
readme : add cython bindings ( #9 )
1 year ago
Georgi Gerganov
373043cabe
whisper : zero-initialize some more context variables
...
Just in case
1 year ago
Finn Voorhees
fb4d0d470f
whisper : fix uninitialized exp_n_audio_ctx
1 year ago
Georgi Gerganov
0d229163bb
whisper : add API for applying custom logits filters during decoding
1 year ago
Georgi Gerganov
f254e78737
yt-wsp.sh : print help on empty args
1 year ago
Georgi Gerganov
a94897bcde
whisper : by default disable non-speech tokens suppression ( #473 )
...
This seems to be causing hallucinations in the end of the audio, e.g.:
"Thank you for listening"
"Amen"
..
1 year ago
Georgi Gerganov
2407ae8ef0
readme : add Ruby discussion + update .NET discussion
1 year ago
Todd
b623ca43b1
bindings : add Ruby ( #500 )
...
* adding ruby bindings
* avoid adding these they are copied in via extconf.rb
* ignore these files here
* add definitions for boolean params
* initial transcribe for ruby
* use en model and transcribe jfk with assertion
* possibly this works for building ruby binding
* ci : try to add ruby workflow
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
1 year ago
conradg
69e6e4644a
main : fix std in input ( #503 )
...
if we don't add this as an explicit check, then we get an "error: unknown argument: -" later on
1 year ago
Georgi Gerganov
09d7d2b68e
examples : refactor in order to reuse code and reduce duplication ( #482 )
...
* examples : refactor common code into a library
* examples : refactor common SDL code into a library
* make : update Makefile to use common libs
* common : fix MSVC M_PI ..
* addon.node : link common lib
1 year ago
shikokuchuo
0336161b7d
whisper : fix signedness compiler warning ( #506 )
1 year ago
genevera (she/her)
459753342d
yt-wsp.sh : add unique filename generation ( #495 )
...
Co-authored-by: genevera <genevera@noreply.users.github.com>
1 year ago
Georgi Gerganov
9764782bd9
readme : add another .NET repo ( #303 )
1 year ago
Georgi Gerganov
3b010f9bed
readme : add .NET repo ( #303 )
1 year ago
Avik Sengupta
113fcec513
cmake : install whisper.h header ( #485 )
...
Including the header file in the install bundle helps projects that ship binaries.
1 year ago
shibukazu
cfc06bf8df
whisper : suppress non-speech-related token outputs ( #473 )
...
* add non-speech-token suppression
* add suppress non-speech_tokens param
1 year ago
sandrohanea
2bfe0ebc0f
whisper : fixed Beam Search Strategy and exposed whisper_pcm_to_mel_phase_vocoder ( #474 )
...
Co-authored-by: Sandro Hanea <sandrohanea@microsoft.com>
1 year ago
boolemancer
4dd7119deb
whisper : only trim if split_on_word is true ( #476 )
1 year ago
Qianhe Chen
ab1916fc59
ci : add node addon test and optimize compilation configuration ( #468 )
...
* addon: implement node addon call whisper through cpp
* addon: modify the license to MIT
* addon: remove iostream
* addon: rename dir
* addon: fix typo
* addon: configure cmake to build when cmake-js is used
* ci: add addon.node test ci
* addon: remove build WHISPER_BUILD_TESTS
* addon: update build command
* addon: add test
* addon: add test file
* addon: adapt to compile on Windows
* addon: fix typo
* addon: reuse jfk.wav
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
* addon: reuse jfk.wav
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
1 year ago
kamranjon
a1c1583cc7
whisper : add whisper_full_lang_id() for getting the context lang ( #461 )
1 year ago
Matija Pevec
d012b5c7e4
whisper : add "split_on_word" flag when using using "max_len" option ( #455 )
...
* Update whisper.cpp
* fix: trim function
* feat: added flag to split on word
* fix: arguments for main
1 year ago
Georgi Gerganov
b2083c5d02
release : v1.2.0
1 year ago
Georgi Gerganov
f3ee4a9673
whisper : reduce memory usage during inference ( #431 )
...
* ggml : add "scratch" buffer support
* ggml : support for scratch ring-buffer
* ggml : bug fix in ggml_repeat()
* ggml : error on scratch buffer overflow
* whisper : use scratch buffers during inference (base model only)
* whisper : update memory usage for all models
* whisper : fix encoder memory usage
* whisper : use whisper_context functions instead of macros
* whisper : fix FF + remove it from README
* ggml : reuse ggml_new_i32
* ggml : refactor the scratch buffer storage
* whisper : reorder scratch buffers in the decoder
* main : add option to disable temp fallback
* Update README.md
1 year ago
Qianhe Chen
c306a7fd89
addon.node : using whisper as a Node.js addon ( #443 )
...
* addon: implement node addon call whisper through cpp
* addon: modify the license to MIT
* addon: remove iostream
* addon: rename dir
* addon: fix typo
* addon: configure cmake to build when cmake-js is used
1 year ago
polarmoon
b2fc4c7010
go : support "auto" as an option when set language ( #462 )
...
Co-authored-by: Ming <ming@localhost>
1 year ago
Georgi Gerganov
291980369c
whisper : suppress task tokens ( #442 )
1 year ago
Taisei Mima
86ef64a855
wasm : fix typo in helper.js ( #459 )
1 year ago
Alex Bacart
3b1960520a
main : CSV format export trimmed spaces fix ( #444 )
...
* Update main.cpp
Removed string trimming
* Update main.cpp
* Update main.cpp
* Revert "Update main.cpp"
This reverts commit d8924fdcfe
.
* Revert "Update main.cpp"
This reverts commit 252e508d85
.
1 year ago
Lukas Rist
2bee2650c6
go : add wrapper for system info ( #456 )
1 year ago
Robin
beb9512be3
go : add WhisperLangAutoDetect method to go binding ( #451 )
1 year ago
Eric Tendian
47737b2e82
livestream.sh : run main with model arg instead of default ( #453 )
...
Actually utilizes the $model var when calling ./main.
1 year ago
Georgi Gerganov
b992f3709e
whisper : do not provide past prompt when n_max_text_ctx == 0
1 year ago
Georgi Gerganov
60337f5306
wasm : check if navigator.storage.estimate() is available
...
Safari does not support it
1 year ago
Lukas Rist
02c7516c57
go : added wrappers to reset and print timings ( #436 )
1 year ago
Georgi Gerganov
411ea9b833
ci : run workflows on pull requests + bindings depend on .h ( #446 )
1 year ago
Ondrej Kokes
11f61cecd6
whisper.wasm : add labels for easier radio selection ( #435 )
1 year ago
Georgi Gerganov
b5ddb16ec7
whisper : condition timestamps to be monotonically increasing ( #425 )
1 year ago
fitzsim
ae16c21e9c
whisper : PPC64 big-endian support ( #398 )
...
* ggml : set cache line size to 128 on POWER9
* whisper : add PPC64 big endian support
1 year ago
Georgi Gerganov
2c3f50a021
release : v1.1.1
1 year ago
Georgi Gerganov
9a65269a20
.gitignore : add arm_neon.h
1 year ago
Georgi Gerganov
78f166174f
whisper : fix condition for providing past prompt (critical)
...
This bug has been present since v1.1.0.
Effectively, the past transcribed text wasn't being used for following
transcriptions, which likely significantly reduces the transcription
quality.
Likely related to #419
1 year ago
Georgi Gerganov
21c569ba4a
whisper : extend information in whisper_print_timings()
1 year ago