Commit Graph

288 Commits (9e5f3ddc166ab9354abe12498ef54bb49a30bbe6)
 

Author SHA1 Message Date
keyehzy 9e5f3ddc16 Allow for Twitch.tv live transcription
2 years ago
Kartik Saranathan d91c001120 Fix paths echoed after the download
2 years ago
Al Hoang 04a16bbf11 fix compilation on haiku
2 years ago
Georgi Gerganov 47afb93c3c
yt-wsp.sh : improve usage instructions
2 years ago
Georgi Gerganov 575c53dc41
yt-wsp.sh : fix usage instruction + comment
2 years ago
Georgi Gerganov 3996ecc156
Update README.md
2 years ago
Georgi Gerganov faa85f9840 livestream.sh : remove obsolete comment
2 years ago
Georgi Gerganov b6597539f9
ggml : fix typo in previous commit
2 years ago
Georgi Gerganov 9a4b7a916e
ggml : use macros to inline FP16 <-> FP32 conversions
2 years ago
Georgi Gerganov f8ec718b76
ggml : add F16C CPU flag check
2 years ago
katsu560 35b40a93b9 add fp16/fp32 convert intrinsics
2 years ago
Georgi Gerganov 9fe7306f4b
models : add the new "large" model release by OpenAI
2 years ago
Georgi Gerganov 13e8eb2346
bench : add commit hash to bench-all.sh results
2 years ago
Georgi Gerganov 78d13257be
Try to improve the token sampling strategy (#193)
2 years ago
Georgi Gerganov 9b7df68753
tests : adding transcription tests
2 years ago
Georgi Gerganov 061fc81bd6
ggml : remove inline specifier from fp16 <-> fp32 converters
2 years ago
Georgi Gerganov 57e0e6b700
livestream : handle ffmpeg errors gracefully and stabilize transcript
2 years ago
Georgi Gerganov 4f7363077f
livestream : minor changes
2 years ago
semiformal-net 093c840dee
livestream : fix losing words across audio chunk (#195)
2 years ago
Tienshiao Ma e7f09a0a61 Fix Darwin flags - was incorrectly always using the Linux else clause
2 years ago
Georgi Gerganov 4698dcdb52 whisper : add mechanism for aborting the whisper_full() computation
2 years ago
Georgi Gerganov 6fd5358dd0
Update README.md
2 years ago
Georgi Gerganov 164df0d447
whisper.objc : fix context + broken readme links
2 years ago
Georgi Gerganov e266cb0723
whisper.objc : add real-time processing (#97)
2 years ago
Georgi Gerganov c207eed431
whisper.objc : fix build warnings
2 years ago
Georgi Gerganov 67e819baf4
minor : remove "examples/" prefix from the README
2 years ago
Georgi Gerganov a425365b82
yt-wsp.sh : script to easily transcribe VODs
2 years ago
Georgi Gerganov e0e864d9ca
Update README.md
2 years ago
Georgi Gerganov 68ecadbbc9
command.wasm : add voice assistant example for the Web (#171)
2 years ago
Georgi Gerganov c536ff4005
minor : add comment for using "generate_karaoke.sh"
2 years ago
Georgi Gerganov cb70b07db5
livestream.sh : simple tool to transcribe audio livestreams (#185)
2 years ago
Georgi Gerganov 3c390ffe38
stream.wasm : add web-based real-time transcription (#112)
2 years ago
Georgi Gerganov be16dfa038
whisper.wasm : do not block page while processing (close #86)
2 years ago
Georgi Gerganov 0f619b52ce
main : add stereo-channel-based diarization (#64)
2 years ago
Georgi Gerganov 1246dd023e
command : add demonstration video
2 years ago
Georgi Gerganov 0be27bbd92
command : fix build + fix README + add bold printing
2 years ago
Georgi Gerganov bc88eb13c6
examples : add "command" tool (#171)
2 years ago
Georgi Gerganov b8ce25dec1
refactoring : more readable code
2 years ago
vicalloy fd113687aa correct model name display on running samples
2 years ago
Georgi Gerganov e4805d9601
wasm : refactor wasm example + reuse fetch mechanism
2 years ago
Georgi Gerganov ff36415a86
talk.wasm : update video link + some minor fixes
2 years ago
Georgi Gerganov 025ff465b6
Update README.md
2 years ago
Georgi Gerganov 2c0501b38a
Update README.md
2 years ago
Georgi Gerganov abce28ea99
talk.wasm : move to https://whisper.ggerganov.com/talk
2 years ago
Georgi Gerganov a2ecd54455
models : add instructions for using HF fine-tuned models
2 years ago
Georgi Gerganov 128aaadb93
whisper : improve printfs
2 years ago
Georgi Gerganov 454b91de16
main : fix dangling pointer when using stdin for input (#65)
2 years ago
Georgi Gerganov d7024cf9dc
main, stream : remove --verbose flag (#178)
2 years ago
Georgi Gerganov 37422ed733
talk.wasm : add audio pre-processing + bump memory
2 years ago
Georgi Gerganov be3b720f96
talk.wasm : refactoring + update README.md
2 years ago