Commit Graph

82 Commits (683f111088f27763443256294cebf29c2636064c)

Author SHA1 Message Date
Georgi Gerganov faa85f9840 livestream.sh : remove obsolete comment
2 years ago
Georgi Gerganov 9fe7306f4b
models : add the new "large" model release by OpenAI
2 years ago
Georgi Gerganov 57e0e6b700
livestream : handle ffmpeg errors gracefully and stabilize transcript
2 years ago
Georgi Gerganov 4f7363077f
livestream : minor changes
2 years ago
semiformal-net 093c840dee
livestream : fix losing words across audio chunk (#195)
2 years ago
Georgi Gerganov 4698dcdb52 whisper : add mechanism for aborting the whisper_full() computation
2 years ago
Georgi Gerganov 164df0d447
whisper.objc : fix context + broken readme links
2 years ago
Georgi Gerganov e266cb0723
whisper.objc : add real-time processing (#97)
2 years ago
Georgi Gerganov c207eed431
whisper.objc : fix build warnings
2 years ago
Georgi Gerganov a425365b82
yt-wsp.sh : script to easily transcribe VODs
2 years ago
Georgi Gerganov 68ecadbbc9
command.wasm : add voice assistant example for the Web (#171)
2 years ago
Georgi Gerganov c536ff4005
minor : add comment for using "generate_karaoke.sh"
2 years ago
Georgi Gerganov cb70b07db5
livestream.sh : simple tool to transcribe audio livestreams (#185)
2 years ago
Georgi Gerganov 3c390ffe38
stream.wasm : add web-based real-time transcription (#112)
2 years ago
Georgi Gerganov be16dfa038
whisper.wasm : do not block page while processing (close #86)
2 years ago
Georgi Gerganov 0f619b52ce
main : add stereo-channel-based diarization (#64)
2 years ago
Georgi Gerganov 1246dd023e
command : add demonstration video
2 years ago
Georgi Gerganov 0be27bbd92
command : fix build + fix README + add bold printing
2 years ago
Georgi Gerganov bc88eb13c6
examples : add "command" tool (#171)
2 years ago
Georgi Gerganov b8ce25dec1
refactoring : more readable code
2 years ago
Georgi Gerganov e4805d9601
wasm : refactor wasm example + reuse fetch mechanism
2 years ago
Georgi Gerganov ff36415a86
talk.wasm : update video link + some minor fixes
2 years ago
Georgi Gerganov 025ff465b6
Update README.md
2 years ago
Georgi Gerganov abce28ea99
talk.wasm : move to https://whisper.ggerganov.com/talk
2 years ago
Georgi Gerganov 454b91de16
main : fix dangling pointer when using stdin for input (#65)
2 years ago
Georgi Gerganov d7024cf9dc
main, stream : remove --verbose flag (#178)
2 years ago
Georgi Gerganov 37422ed733
talk.wasm : add audio pre-processing + bump memory
2 years ago
Georgi Gerganov be3b720f96
talk.wasm : refactoring + update README.md
2 years ago
Georgi Gerganov 49706a658a
minor : updates few prints + fix buttons in whisper.wasm
2 years ago
Georgi Gerganov e5dcdabbb8
unicode : fix character replacement (thanks to @tamo)
2 years ago
Georgi Gerganov dad109c3f1
close #109 : add fetching of the model over HTTP (whisper.wasm)
2 years ago
Georgi Gerganov 326573de9a
talk.wasm : final touches
2 years ago
Georgi Gerganov 9aea96f774
talk.wasm : polishing + adding many AI personalities
2 years ago
Georgi Gerganov 385236d1d3
stream : "-kc" now enables context keeping from previous segment (#90)
2 years ago
M. Eren Akbiyik 63ae03b8e0
Prompt previous tokens for streaming (#163)
2 years ago
Georgi Gerganov 78116f8eda
talk.wasm : update README.md
2 years ago
Georgi Gerganov a4dfbeecf9
talk.wasm : GPT-2 meets Whisper in WebAssembly (#155)
2 years ago
Georgi Gerganov f2df9bd768 stream : add "max_tokens" cli arg
2 years ago
Georgi Gerganov fb8d77f760 stream : add "audio_ctx" parameter
2 years ago
Georgi Gerganov 62b5ff875c stream : add "max_tokens" parameter
2 years ago
Georgi Gerganov d351771a4b stream : add "single_segment" option
2 years ago
Georgi Gerganov c058aaf22e stream : partial encoder experiments
2 years ago
Georgi Gerganov 83c742f1a7 whisper : add option to speed up the audio tempo by x2
2 years ago
Alan 7519eabf65 Adds support for stdin wav input
2 years ago
Georgi Gerganov c30bffc8a5
ref #22 : add "duration" option
2 years ago
Georgi Gerganov c71363f14c
examples : add simple script for generating Karaoke video
2 years ago
Georgi Gerganov d42cf6d0df
Update README.md
2 years ago
Georgi Gerganov ef47d77492
main : fix generated bash script
2 years ago
Georgi Gerganov d5afebd37c
whisper : token-level timestamp refactoring (#49, #120)
2 years ago
Georgi Gerganov 6fb98370ba
main : add some comments for the word-level timestamp algorithm
2 years ago