Commit Graph

125 Commits (d61d55cd4b9fe77511c8eea28d0220ce552f7008)

Author SHA1 Message Date
Georgi Gerganov e266cb0723
whisper.objc : add real-time processing (#97)
2 years ago
Georgi Gerganov c207eed431
whisper.objc : fix build warnings
2 years ago
Georgi Gerganov a425365b82
yt-wsp.sh : script to easily transcribe VODs
2 years ago
Georgi Gerganov 68ecadbbc9
command.wasm : add voice assistant example for the Web (#171)
2 years ago
Georgi Gerganov c536ff4005
minor : add comment for using "generate_karaoke.sh"
2 years ago
Georgi Gerganov cb70b07db5
livestream.sh : simple tool to transcribe audio livestreams (#185)
2 years ago
Georgi Gerganov 3c390ffe38
stream.wasm : add web-based real-time transcription (#112)
2 years ago
Georgi Gerganov be16dfa038
whisper.wasm : do not block page while processing (close #86)
2 years ago
Georgi Gerganov 0f619b52ce
main : add stereo-channel-based diarization (#64)
2 years ago
Georgi Gerganov 1246dd023e
command : add demonstration video
2 years ago
Georgi Gerganov 0be27bbd92
command : fix build + fix README + add bold printing
2 years ago
Georgi Gerganov bc88eb13c6
examples : add "command" tool (#171)
2 years ago
Georgi Gerganov b8ce25dec1
refactoring : more readable code
2 years ago
Georgi Gerganov e4805d9601
wasm : refactor wasm example + reuse fetch mechanism
2 years ago
Georgi Gerganov ff36415a86
talk.wasm : update video link + some minor fixes
2 years ago
Georgi Gerganov 025ff465b6
Update README.md
2 years ago
Georgi Gerganov abce28ea99
talk.wasm : move to https://whisper.ggerganov.com/talk
2 years ago
Georgi Gerganov 454b91de16
main : fix dangling pointer when using stdin for input (#65)
2 years ago
Georgi Gerganov d7024cf9dc
main, stream : remove --verbose flag (#178)
2 years ago
Georgi Gerganov 37422ed733
talk.wasm : add audio pre-processing + bump memory
2 years ago
Georgi Gerganov be3b720f96
talk.wasm : refactoring + update README.md
2 years ago
Georgi Gerganov 49706a658a
minor : updates few prints + fix buttons in whisper.wasm
2 years ago
Georgi Gerganov e5dcdabbb8
unicode : fix character replacement (thanks to @tamo)
2 years ago
Georgi Gerganov dad109c3f1
close #109 : add fetching of the model over HTTP (whisper.wasm)
2 years ago
Georgi Gerganov 326573de9a
talk.wasm : final touches
2 years ago
Georgi Gerganov 9aea96f774
talk.wasm : polishing + adding many AI personalities
2 years ago
Georgi Gerganov 385236d1d3
stream : "-kc" now enables context keeping from previous segment (#90)
2 years ago
M. Eren Akbiyik 63ae03b8e0
Prompt previous tokens for streaming (#163)
2 years ago
Georgi Gerganov 78116f8eda
talk.wasm : update README.md
2 years ago
Georgi Gerganov a4dfbeecf9
talk.wasm : GPT-2 meets Whisper in WebAssembly (#155)
2 years ago
Georgi Gerganov f2df9bd768 stream : add "max_tokens" cli arg
2 years ago
Georgi Gerganov fb8d77f760 stream : add "audio_ctx" parameter
2 years ago
Georgi Gerganov 62b5ff875c stream : add "max_tokens" parameter
2 years ago
Georgi Gerganov d351771a4b stream : add "single_segment" option
2 years ago
Georgi Gerganov c058aaf22e stream : partial encoder experiments
2 years ago
Georgi Gerganov 83c742f1a7 whisper : add option to speed up the audio tempo by x2
2 years ago
Alan 7519eabf65 Adds support for stdin wav input
2 years ago
Georgi Gerganov c30bffc8a5
ref #22 : add "duration" option
2 years ago
Georgi Gerganov c71363f14c
examples : add simple script for generating Karaoke video
2 years ago
Georgi Gerganov d42cf6d0df
Update README.md
2 years ago
Georgi Gerganov ef47d77492
main : fix generated bash script
2 years ago
Georgi Gerganov d5afebd37c
whisper : token-level timestamp refactoring (#49, #120)
2 years ago
Georgi Gerganov 6fb98370ba
main : add some comments for the word-level timestamp algorithm
2 years ago
Georgi Gerganov 0729da9a3b
main : fix some edge cases for word-level timestamps
2 years ago
Georgi Gerganov 5dc74e3aff
Update README.md
2 years ago
Georgi Gerganov ac8ef34039
Update README.md
2 years ago
Georgi Gerganov dc12994603
Update README.md
2 years ago
Georgi Gerganov 57fb46f307 main : add option for word-leve timestamps (very experimental)
2 years ago
Georgi Gerganov 5a9e4260a6
stream : add "--capture" option to select capture device (ref #10)
2 years ago
Georgi Gerganov 12fb303d9d
whisper.wasm : update system info print
2 years ago
Georgi Gerganov 2827cbbbe8 main : merge parallel example in main
2 years ago
Georgi Gerganov 0b2dc3c82c parallel : working
2 years ago
Georgi Gerganov 85d6e1e1e7 main : fix sampling time + add max_context parameter
2 years ago
Georgi Gerganov 72e9cdd6bf parallel : adding tool for parallel transformer inference
2 years ago
Georgi Gerganov b89f8960ca
Update README.md
2 years ago
Georgi Gerganov 6f82320b05 Create README.md
2 years ago
Georgi Gerganov 2298310dd8 whisper.nvim : add helper script for the Neovim integration
2 years ago
Georgi Gerganov 8347a7bb6a
stream : few updates to make it compatible for Vim usage (#99)
2 years ago
Georgi Gerganov ebb01b9e33
Print system info at start of program
2 years ago
Georgi Gerganov 2400660f3f Print system info in main
2 years ago
Georgi Gerganov a6c786d5dc Update README.md
2 years ago
Georgi Gerganov 91dcf5f35b Update README.md
2 years ago
Georgi Gerganov 113a4f06d8 Update README.md
2 years ago
Georgi Gerganov 47e78b7288 Update README.md
2 years ago
Georgi Gerganov 34bb3ab0cf ggml : add system info functions
2 years ago
Georgi Gerganov c6710efde2 refactoring : move main + stream in examples + other stuff
2 years ago
Georgi Gerganov d4f94ce427 Update README.md
2 years ago
Georgi Gerganov a52ee08c1e objc : polishing the sample application
2 years ago
Georgi Gerganov b41f4a90eb Create README.md
2 years ago
Georgi Gerganov bb1ee266d2 ios : whisper.objc example
2 years ago
Georgi Gerganov 3e69a6071d
Update README.md
2 years ago
Georgi Gerganov f4aa01c2f8
Update README.md
2 years ago
Georgi Gerganov 6b45e37b2b Update README.md and finalize the whisper.wasm example
2 years ago
Georgi Gerganov 491ecd7056 wip : polishing WASM example
2 years ago
Georgi Gerganov e905c6f827 wip : initial WASM port
2 years ago