Georgi Gerganov
78d13257be
Try to improve the token sampling strategy ( #193 )
...
* whisper : try to improve the token sampling strategy
- Add the "max_initial_timestaamp" token logic from OpenAI
- Disallow sampling timestamps that are in the past
* whisper : fix the max initial timestamp logic + fallback decoding
2 years ago
Georgi Gerganov
9b7df68753
tests : adding transcription tests
2 years ago
Georgi Gerganov
061fc81bd6
ggml : remove inline specifier from fp16 <-> fp32 converters
2 years ago
Georgi Gerganov
57e0e6b700
livestream : handle ffmpeg errors gracefully and stabilize transcript
2 years ago
Georgi Gerganov
4f7363077f
livestream : minor changes
2 years ago
semiformal-net
093c840dee
livestream : fix losing words across audio chunk ( #195 )
...
* improve livestream script
* Update examples/livestream.sh
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Co-authored-by: Paul Edwards <paul.edwards@semiformal.net>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2 years ago
Tienshiao Ma
e7f09a0a61
Fix Darwin flags - was incorrectly always using the Linux else clause
2 years ago
Georgi Gerganov
4698dcdb52
whisper : add mechanism for aborting the whisper_full() computation
2 years ago
Georgi Gerganov
6fd5358dd0
Update README.md
2 years ago
Georgi Gerganov
164df0d447
whisper.objc : fix context + broken readme links
2 years ago
Georgi Gerganov
e266cb0723
whisper.objc : add real-time processing ( #97 )
...
Similar to the "stream" app
2 years ago
Georgi Gerganov
c207eed431
whisper.objc : fix build warnings
2 years ago
Georgi Gerganov
67e819baf4
minor : remove "examples/" prefix from the README
2 years ago
Georgi Gerganov
a425365b82
yt-wsp.sh : script to easily transcribe VODs
...
Thanks to @DaniruKun
ref: https://gist.github.com/DaniruKun/96f763ec1a037cc92fe1a059b643b818
Usage:
cd whisper.cpp
make
./examples/yt-wsp.sh <video-url>
2 years ago
Georgi Gerganov
e0e864d9ca
Update README.md
2 years ago
Georgi Gerganov
68ecadbbc9
command.wasm : add voice assistant example for the Web ( #171 )
...
Same as the command-line tool "command", but runs in the browser
Also, added helper script "extra/deploy-wasm.sh" and fixed some timing
constants for the WASM examples.
2 years ago
Georgi Gerganov
c536ff4005
minor : add comment for using "generate_karaoke.sh"
2 years ago
Georgi Gerganov
cb70b07db5
livestream.sh : simple tool to transcribe audio livestreams ( #185 )
2 years ago
Georgi Gerganov
3c390ffe38
stream.wasm : add web-based real-time transcription ( #112 )
2 years ago
Georgi Gerganov
be16dfa038
whisper.wasm : do not block page while processing ( close #86 )
2 years ago
Georgi Gerganov
0f619b52ce
main : add stereo-channel-based diarization ( #64 )
...
Not tested - I don't have stereo dialog audio
2 years ago
Georgi Gerganov
1246dd023e
command : add demonstration video
2 years ago
Georgi Gerganov
0be27bbd92
command : fix build + fix README + add bold printing
2 years ago
Georgi Gerganov
bc88eb13c6
examples : add "command" tool ( #171 )
2 years ago
Georgi Gerganov
b8ce25dec1
refactoring : more readable code
2 years ago
vicalloy
fd113687aa
correct model name display on running samples
2 years ago
Georgi Gerganov
e4805d9601
wasm : refactor wasm example + reuse fetch mechanism
2 years ago
Georgi Gerganov
ff36415a86
talk.wasm : update video link + some minor fixes
2 years ago
Georgi Gerganov
025ff465b6
Update README.md
...
Use a less cringy video to demo talk.wasm lol
2 years ago
Georgi Gerganov
2c0501b38a
Update README.md
2 years ago
Georgi Gerganov
abce28ea99
talk.wasm : move to https://whisper.ggerganov.com/talk
...
This way, we can share the same models across different WASM examples
and not have to download them for each page
2 years ago
Georgi Gerganov
a2ecd54455
models : add instructions for using HF fine-tuned models
2 years ago
Georgi Gerganov
128aaadb93
whisper : improve printfs
2 years ago
Georgi Gerganov
454b91de16
main : fix dangling pointer when using stdin for input ( #65 )
2 years ago
Georgi Gerganov
d7024cf9dc
main, stream : remove --verbose flag ( #178 )
2 years ago
Georgi Gerganov
37422ed733
talk.wasm : add audio pre-processing + bump memory
2 years ago
Georgi Gerganov
be3b720f96
talk.wasm : refactoring + update README.md
2 years ago
Georgi Gerganov
00f46dbc1d
models : add usage comments to the HF convert script ( #157 )
2 years ago
Georgi Gerganov
5698bddbc9
models : fix HF fine-tuned model conversion script ( #157 )
...
It works now
2 years ago
Georgi Gerganov
388e9f79ad
ggml : fix the fix
2 years ago
Georgi Gerganov
35cd29ce1f
ggml : fix cross-compile Linux -> Window with mingw ( #168 )
2 years ago
Georgi Gerganov
a156a358ca
Revert "update README.md"
...
This reverts commit 6a84147113
.
2 years ago
katsu560
6a84147113
update README.md
2 years ago
katsu560
804f36aa2c
ggml: change inline ggml_fp16_to_fp32, ggml_fp16_t ggml_fp32_to_fp16
2 years ago
katsu560
4b2f51b479
add gprof option
2 years ago
katsu560
800ae5b808
fix AVX,AVX2,FMA,F16C detection on Linux and add flags for OpenBLAS
2 years ago
katsu560
83456076f0
add AVX support
2 years ago
Tamotsu Takahashi
3df6c14fca
Build with OpenBLAS and SDL2 on windows
2 years ago
Georgi Gerganov
d64d6ca3fd
models : minor changes to the HF convert script ( #157 )
2 years ago
Georgi Gerganov
93482d0373
models : add "convert-h5-to-ggml.py" script ( #157 )
...
Converts transformers models to ggml.
Although the conversion is successful, it does not work for some reason.
Not sure why
2 years ago