Georgi Gerganov
fa9d43181f
readme : add bench-wts.sh demo
2 years ago
Georgi Gerganov
bb6b54a03d
bench-wts.sh : rename script + add execute permission
2 years ago
venkr
b597c5a779
qual-bench.sh : add quality comparison tool, and update main.cpp to allow using a font file ( #569 )
2 years ago
Takeshi Inoue
a3fb6c507f
whisper.android : enable fp16 instrinsics (FP16_VA) which is supported by ARMv8.2 or later. ( #572 )
2 years ago
sandrohanea
59fdcd19c8
whisper : add whisper_state + default state on the whisper_context ( #523 )
...
* Added whisper state + default state on the whisper_context
* Fixed some examples and bindings
* Fixed whisper_n_len (which was used in some binding) and added whisper_n_len_from_state
* Fixed comments
* whisper : reuse kv_cache_free() and fix compiler warnings
* whisper : clean-up the API comments
---------
Co-authored-by: Sandro Hanea <sandrohanea@microsoft.com>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2 years ago
Georgi Gerganov
478289a4b3
whisper : set no_context == true by default ( #537 )
2 years ago
polarmoon
5e94129cb2
go : NewContext now returns a clean context ( #537 )
...
Co-authored-by: Ming <ming@localhost>
2 years ago
HY. Kelvin Lee
72af0f5697
main : add csv header ( #552 )
2 years ago
Georgi Gerganov
af005d573f
make : add -DNDEBUG compile flag
2 years ago
Georgi Gerganov
ad1389003d
release : v1.2.1
2 years ago
FlippFuzz
f420de1322
make : add "-mcpu=native" when building for aarch64 ( #532 )
2 years ago
Aaron Pham
d176160f6f
readme : add pybind11 bindings ( #538 )
2 years ago
Georgi Gerganov
ca21f7ab16
readme : add cython bindings ( #9 )
2 years ago
Georgi Gerganov
373043cabe
whisper : zero-initialize some more context variables
...
Just in case
2 years ago
Finn Voorhees
fb4d0d470f
whisper : fix uninitialized exp_n_audio_ctx
2 years ago
Georgi Gerganov
0d229163bb
whisper : add API for applying custom logits filters during decoding
2 years ago
Georgi Gerganov
f254e78737
yt-wsp.sh : print help on empty args
2 years ago
Georgi Gerganov
a94897bcde
whisper : by default disable non-speech tokens suppression ( #473 )
...
This seems to be causing hallucinations in the end of the audio, e.g.:
"Thank you for listening"
"Amen"
..
2 years ago
Georgi Gerganov
2407ae8ef0
readme : add Ruby discussion + update .NET discussion
2 years ago
Todd
b623ca43b1
bindings : add Ruby ( #500 )
...
* adding ruby bindings
* avoid adding these they are copied in via extconf.rb
* ignore these files here
* add definitions for boolean params
* initial transcribe for ruby
* use en model and transcribe jfk with assertion
* possibly this works for building ruby binding
* ci : try to add ruby workflow
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2 years ago
conradg
69e6e4644a
main : fix std in input ( #503 )
...
if we don't add this as an explicit check, then we get an "error: unknown argument: -" later on
2 years ago
Georgi Gerganov
09d7d2b68e
examples : refactor in order to reuse code and reduce duplication ( #482 )
...
* examples : refactor common code into a library
* examples : refactor common SDL code into a library
* make : update Makefile to use common libs
* common : fix MSVC M_PI ..
* addon.node : link common lib
2 years ago
shikokuchuo
0336161b7d
whisper : fix signedness compiler warning ( #506 )
2 years ago
genevera (she/her)
459753342d
yt-wsp.sh : add unique filename generation ( #495 )
...
Co-authored-by: genevera <genevera@noreply.users.github.com>
2 years ago
Georgi Gerganov
9764782bd9
readme : add another .NET repo ( #303 )
2 years ago
Georgi Gerganov
3b010f9bed
readme : add .NET repo ( #303 )
2 years ago
Avik Sengupta
113fcec513
cmake : install whisper.h header ( #485 )
...
Including the header file in the install bundle helps projects that ship binaries.
2 years ago
shibukazu
cfc06bf8df
whisper : suppress non-speech-related token outputs ( #473 )
...
* add non-speech-token suppression
* add suppress non-speech_tokens param
2 years ago
sandrohanea
2bfe0ebc0f
whisper : fixed Beam Search Strategy and exposed whisper_pcm_to_mel_phase_vocoder ( #474 )
...
Co-authored-by: Sandro Hanea <sandrohanea@microsoft.com>
2 years ago
boolemancer
4dd7119deb
whisper : only trim if split_on_word is true ( #476 )
2 years ago
Qianhe Chen
ab1916fc59
ci : add node addon test and optimize compilation configuration ( #468 )
...
* addon: implement node addon call whisper through cpp
* addon: modify the license to MIT
* addon: remove iostream
* addon: rename dir
* addon: fix typo
* addon: configure cmake to build when cmake-js is used
* ci: add addon.node test ci
* addon: remove build WHISPER_BUILD_TESTS
* addon: update build command
* addon: add test
* addon: add test file
* addon: adapt to compile on Windows
* addon: fix typo
* addon: reuse jfk.wav
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
* addon: reuse jfk.wav
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2 years ago
kamranjon
a1c1583cc7
whisper : add whisper_full_lang_id() for getting the context lang ( #461 )
2 years ago
Matija Pevec
d012b5c7e4
whisper : add "split_on_word" flag when using using "max_len" option ( #455 )
...
* Update whisper.cpp
* fix: trim function
* feat: added flag to split on word
* fix: arguments for main
2 years ago
Georgi Gerganov
b2083c5d02
release : v1.2.0
2 years ago
Georgi Gerganov
f3ee4a9673
whisper : reduce memory usage during inference ( #431 )
...
* ggml : add "scratch" buffer support
* ggml : support for scratch ring-buffer
* ggml : bug fix in ggml_repeat()
* ggml : error on scratch buffer overflow
* whisper : use scratch buffers during inference (base model only)
* whisper : update memory usage for all models
* whisper : fix encoder memory usage
* whisper : use whisper_context functions instead of macros
* whisper : fix FF + remove it from README
* ggml : reuse ggml_new_i32
* ggml : refactor the scratch buffer storage
* whisper : reorder scratch buffers in the decoder
* main : add option to disable temp fallback
* Update README.md
2 years ago
Qianhe Chen
c306a7fd89
addon.node : using whisper as a Node.js addon ( #443 )
...
* addon: implement node addon call whisper through cpp
* addon: modify the license to MIT
* addon: remove iostream
* addon: rename dir
* addon: fix typo
* addon: configure cmake to build when cmake-js is used
2 years ago
polarmoon
b2fc4c7010
go : support "auto" as an option when set language ( #462 )
...
Co-authored-by: Ming <ming@localhost>
2 years ago
Georgi Gerganov
291980369c
whisper : suppress task tokens ( #442 )
2 years ago
Taisei Mima
86ef64a855
wasm : fix typo in helper.js ( #459 )
2 years ago
Alex Bacart
3b1960520a
main : CSV format export trimmed spaces fix ( #444 )
...
* Update main.cpp
Removed string trimming
* Update main.cpp
* Update main.cpp
* Revert "Update main.cpp"
This reverts commit d8924fdcfe
.
* Revert "Update main.cpp"
This reverts commit 252e508d85
.
2 years ago
Lukas Rist
2bee2650c6
go : add wrapper for system info ( #456 )
2 years ago
Robin
beb9512be3
go : add WhisperLangAutoDetect method to go binding ( #451 )
2 years ago
Eric Tendian
47737b2e82
livestream.sh : run main with model arg instead of default ( #453 )
...
Actually utilizes the $model var when calling ./main.
2 years ago
Georgi Gerganov
b992f3709e
whisper : do not provide past prompt when n_max_text_ctx == 0
2 years ago
Georgi Gerganov
60337f5306
wasm : check if navigator.storage.estimate() is available
...
Safari does not support it
2 years ago
Lukas Rist
02c7516c57
go : added wrappers to reset and print timings ( #436 )
2 years ago
Georgi Gerganov
411ea9b833
ci : run workflows on pull requests + bindings depend on .h ( #446 )
2 years ago
Ondrej Kokes
11f61cecd6
whisper.wasm : add labels for easier radio selection ( #435 )
2 years ago
Georgi Gerganov
b5ddb16ec7
whisper : condition timestamps to be monotonically increasing ( #425 )
2 years ago
fitzsim
ae16c21e9c
whisper : PPC64 big-endian support ( #398 )
...
* ggml : set cache line size to 128 on POWER9
* whisper : add PPC64 big endian support
2 years ago