Commit Graph

157 Commits (59fdcd19c8b24ec6d0bdfab9847ca66c805ed831)

Author SHA1 Message Date
Georgi Gerganov fba10a4c68 whisper : language auto-detect (#59)
2 years ago
Georgi Gerganov 32fbc8cd04
main : add option to print the progress (#276)
2 years ago
Georgi Gerganov b8065d90f5
main : add "--prompt" command line argument (#90)
2 years ago
Georgi Gerganov 4312995974 command : better indentation
2 years ago
Georgi Gerganov 5eeeb3412d command : update README, show how to use guided mode
2 years ago
Georgi Gerganov 6a69e3ae27 command : adding guided mode
2 years ago
Georgi Gerganov ea19ed33f1
Update README.md (#46)
2 years ago
Digipom 675e787171
Add Android sample (#277)
2 years ago
Georgi Gerganov a82d331034
stream : update README.md + comments
2 years ago
Georgi Gerganov 5a5c5ddcca
Update README.md
2 years ago
Georgi Gerganov 34e0b4b9ef
stream : fix build
2 years ago
Georgi Gerganov b0f8013eb9
stream : add sliding window mode
2 years ago
Georgi Gerganov a613f16aec
talk : improve prompting
2 years ago
Georgi Gerganov f309f97df6
Node.js package (#260)
2 years ago
Georgi Gerganov aa6adda26e
talk : make compatible with c++11 (part 2)
2 years ago
Georgi Gerganov 444349f4ec
talk : make compatible with c++11
2 years ago
Lexevolution 6ed786957e
Add newline per segment for text output (#254)
2 years ago
Georgi Gerganov fcf515de60
bench.wasm : same as "bench" but runs in the browser (#89)
2 years ago
Georgi Gerganov 85c9ac18b5
Update README.md
2 years ago
Georgi Gerganov b7c85d1ea6 talk : fix build for MSVC
2 years ago
Georgi Gerganov 3b1aacbe6d talk : talk with AI in the terminal
2 years ago
Georgi Gerganov 56822621a8 twitch.sh : various fixes and polishing
2 years ago
keyehzy 9e5f3ddc16 Allow for Twitch.tv live transcription
2 years ago
Georgi Gerganov 47afb93c3c
yt-wsp.sh : improve usage instructions
2 years ago
Georgi Gerganov 575c53dc41
yt-wsp.sh : fix usage instruction + comment
2 years ago
Georgi Gerganov faa85f9840 livestream.sh : remove obsolete comment
2 years ago
Georgi Gerganov 9fe7306f4b
models : add the new "large" model release by OpenAI
2 years ago
Georgi Gerganov 57e0e6b700
livestream : handle ffmpeg errors gracefully and stabilize transcript
2 years ago
Georgi Gerganov 4f7363077f
livestream : minor changes
2 years ago
semiformal-net 093c840dee
livestream : fix losing words across audio chunk (#195)
2 years ago
Georgi Gerganov 4698dcdb52 whisper : add mechanism for aborting the whisper_full() computation
2 years ago
Georgi Gerganov 164df0d447
whisper.objc : fix context + broken readme links
2 years ago
Georgi Gerganov e266cb0723
whisper.objc : add real-time processing (#97)
2 years ago
Georgi Gerganov c207eed431
whisper.objc : fix build warnings
2 years ago
Georgi Gerganov a425365b82
yt-wsp.sh : script to easily transcribe VODs
2 years ago
Georgi Gerganov 68ecadbbc9
command.wasm : add voice assistant example for the Web (#171)
2 years ago
Georgi Gerganov c536ff4005
minor : add comment for using "generate_karaoke.sh"
2 years ago
Georgi Gerganov cb70b07db5
livestream.sh : simple tool to transcribe audio livestreams (#185)
2 years ago
Georgi Gerganov 3c390ffe38
stream.wasm : add web-based real-time transcription (#112)
2 years ago
Georgi Gerganov be16dfa038
whisper.wasm : do not block page while processing (close #86)
2 years ago
Georgi Gerganov 0f619b52ce
main : add stereo-channel-based diarization (#64)
2 years ago
Georgi Gerganov 1246dd023e
command : add demonstration video
2 years ago
Georgi Gerganov 0be27bbd92
command : fix build + fix README + add bold printing
2 years ago
Georgi Gerganov bc88eb13c6
examples : add "command" tool (#171)
2 years ago
Georgi Gerganov b8ce25dec1
refactoring : more readable code
2 years ago
Georgi Gerganov e4805d9601
wasm : refactor wasm example + reuse fetch mechanism
2 years ago
Georgi Gerganov ff36415a86
talk.wasm : update video link + some minor fixes
2 years ago
Georgi Gerganov 025ff465b6
Update README.md
2 years ago
Georgi Gerganov abce28ea99
talk.wasm : move to https://whisper.ggerganov.com/talk
2 years ago
Georgi Gerganov 454b91de16
main : fix dangling pointer when using stdin for input (#65)
2 years ago
Georgi Gerganov d7024cf9dc
main, stream : remove --verbose flag (#178)
2 years ago
Georgi Gerganov 37422ed733
talk.wasm : add audio pre-processing + bump memory
2 years ago
Georgi Gerganov be3b720f96
talk.wasm : refactoring + update README.md
2 years ago
Georgi Gerganov 49706a658a
minor : updates few prints + fix buttons in whisper.wasm
2 years ago
Georgi Gerganov e5dcdabbb8
unicode : fix character replacement (thanks to @tamo)
2 years ago
Georgi Gerganov dad109c3f1
close #109 : add fetching of the model over HTTP (whisper.wasm)
2 years ago
Georgi Gerganov 326573de9a
talk.wasm : final touches
2 years ago
Georgi Gerganov 9aea96f774
talk.wasm : polishing + adding many AI personalities
2 years ago
Georgi Gerganov 385236d1d3
stream : "-kc" now enables context keeping from previous segment (#90)
2 years ago
M. Eren Akbiyik 63ae03b8e0
Prompt previous tokens for streaming (#163)
2 years ago
Georgi Gerganov 78116f8eda
talk.wasm : update README.md
2 years ago
Georgi Gerganov a4dfbeecf9
talk.wasm : GPT-2 meets Whisper in WebAssembly (#155)
2 years ago
Georgi Gerganov f2df9bd768 stream : add "max_tokens" cli arg
2 years ago
Georgi Gerganov fb8d77f760 stream : add "audio_ctx" parameter
2 years ago
Georgi Gerganov 62b5ff875c stream : add "max_tokens" parameter
2 years ago
Georgi Gerganov d351771a4b stream : add "single_segment" option
2 years ago
Georgi Gerganov c058aaf22e stream : partial encoder experiments
2 years ago
Georgi Gerganov 83c742f1a7 whisper : add option to speed up the audio tempo by x2
2 years ago
Alan 7519eabf65 Adds support for stdin wav input
2 years ago
Georgi Gerganov c30bffc8a5
ref #22 : add "duration" option
2 years ago
Georgi Gerganov c71363f14c
examples : add simple script for generating Karaoke video
2 years ago
Georgi Gerganov d42cf6d0df
Update README.md
2 years ago
Georgi Gerganov ef47d77492
main : fix generated bash script
2 years ago
Georgi Gerganov d5afebd37c
whisper : token-level timestamp refactoring (#49, #120)
2 years ago
Georgi Gerganov 6fb98370ba
main : add some comments for the word-level timestamp algorithm
2 years ago
Georgi Gerganov 0729da9a3b
main : fix some edge cases for word-level timestamps
2 years ago
Georgi Gerganov 5dc74e3aff
Update README.md
2 years ago
Georgi Gerganov ac8ef34039
Update README.md
2 years ago
Georgi Gerganov dc12994603
Update README.md
2 years ago
Georgi Gerganov 57fb46f307 main : add option for word-leve timestamps (very experimental)
2 years ago
Georgi Gerganov 5a9e4260a6
stream : add "--capture" option to select capture device (ref #10)
2 years ago
Georgi Gerganov 12fb303d9d
whisper.wasm : update system info print
2 years ago
Georgi Gerganov 2827cbbbe8 main : merge parallel example in main
2 years ago
Georgi Gerganov 0b2dc3c82c parallel : working
2 years ago
Georgi Gerganov 85d6e1e1e7 main : fix sampling time + add max_context parameter
2 years ago
Georgi Gerganov 72e9cdd6bf parallel : adding tool for parallel transformer inference
2 years ago
Georgi Gerganov b89f8960ca
Update README.md
2 years ago
Georgi Gerganov 6f82320b05 Create README.md
2 years ago
Georgi Gerganov 2298310dd8 whisper.nvim : add helper script for the Neovim integration
2 years ago
Georgi Gerganov 8347a7bb6a
stream : few updates to make it compatible for Vim usage (#99)
2 years ago
Georgi Gerganov ebb01b9e33
Print system info at start of program
2 years ago
Georgi Gerganov 2400660f3f Print system info in main
2 years ago
Georgi Gerganov a6c786d5dc Update README.md
2 years ago
Georgi Gerganov 91dcf5f35b Update README.md
2 years ago
Georgi Gerganov 113a4f06d8 Update README.md
2 years ago
Georgi Gerganov 47e78b7288 Update README.md
2 years ago
Georgi Gerganov 34bb3ab0cf ggml : add system info functions
2 years ago
Georgi Gerganov c6710efde2 refactoring : move main + stream in examples + other stuff
2 years ago
Georgi Gerganov d4f94ce427 Update README.md
2 years ago
Georgi Gerganov a52ee08c1e objc : polishing the sample application
2 years ago