whisper.cpp

Commit Graph

Author	SHA1	Message	Date
Georgi Gerganov	78d13257be	Try to improve the token sampling strategy (#193 ) * whisper : try to improve the token sampling strategy - Add the "max_initial_timestaamp" token logic from OpenAI - Disallow sampling timestamps that are in the past * whisper : fix the max initial timestamp logic + fallback decoding	3 years ago
Georgi Gerganov	9b7df68753	tests : adding transcription tests	3 years ago
Georgi Gerganov	061fc81bd6	ggml : remove inline specifier from fp16 <-> fp32 converters	3 years ago
Georgi Gerganov	57e0e6b700	livestream : handle ffmpeg errors gracefully and stabilize transcript	3 years ago
Georgi Gerganov	4f7363077f	livestream : minor changes	3 years ago
semiformal-net	093c840dee	livestream : fix losing words across audio chunk (#195 ) * improve livestream script * Update examples/livestream.sh Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> Co-authored-by: Paul Edwards <paul.edwards@semiformal.net> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	3 years ago
Tienshiao Ma	e7f09a0a61	Fix Darwin flags - was incorrectly always using the Linux else clause	3 years ago
Georgi Gerganov	4698dcdb52	whisper : add mechanism for aborting the whisper_full() computation	3 years ago
Georgi Gerganov	6fd5358dd0	Update README.md	3 years ago
Georgi Gerganov	164df0d447	whisper.objc : fix context + broken readme links	3 years ago
Georgi Gerganov	e266cb0723	whisper.objc : add real-time processing (#97 ) Similar to the "stream" app	3 years ago
Georgi Gerganov	c207eed431	whisper.objc : fix build warnings	3 years ago
Georgi Gerganov	67e819baf4	minor : remove "examples/" prefix from the README	3 years ago
Georgi Gerganov	a425365b82	yt-wsp.sh : script to easily transcribe VODs Thanks to @DaniruKun ref: https://gist.github.com/DaniruKun/96f763ec1a037cc92fe1a059b643b818 Usage: cd whisper.cpp make ./examples/yt-wsp.sh <video-url>	3 years ago
Georgi Gerganov	e0e864d9ca	Update README.md	3 years ago
Georgi Gerganov	68ecadbbc9	command.wasm : add voice assistant example for the Web (#171 ) Same as the command-line tool "command", but runs in the browser Also, added helper script "extra/deploy-wasm.sh" and fixed some timing constants for the WASM examples.	3 years ago
Georgi Gerganov	c536ff4005	minor : add comment for using "generate_karaoke.sh"	3 years ago
Georgi Gerganov	cb70b07db5	livestream.sh : simple tool to transcribe audio livestreams (#185 )	3 years ago
Georgi Gerganov	3c390ffe38	stream.wasm : add web-based real-time transcription (#112 )	3 years ago
Georgi Gerganov	be16dfa038	whisper.wasm : do not block page while processing (close #86 )	3 years ago
Georgi Gerganov	0f619b52ce	main : add stereo-channel-based diarization (#64 ) Not tested - I don't have stereo dialog audio	3 years ago
Georgi Gerganov	1246dd023e	command : add demonstration video	3 years ago
Georgi Gerganov	0be27bbd92	command : fix build + fix README + add bold printing	3 years ago
Georgi Gerganov	bc88eb13c6	examples : add "command" tool (#171 )	3 years ago
Georgi Gerganov	b8ce25dec1	refactoring : more readable code	3 years ago
vicalloy	fd113687aa	correct model name display on running samples	3 years ago
Georgi Gerganov	e4805d9601	wasm : refactor wasm example + reuse fetch mechanism	3 years ago
Georgi Gerganov	ff36415a86	talk.wasm : update video link + some minor fixes	3 years ago
Georgi Gerganov	025ff465b6	Update README.md Use a less cringy video to demo talk.wasm lol	3 years ago
Georgi Gerganov	2c0501b38a	Update README.md	3 years ago
Georgi Gerganov	abce28ea99	talk.wasm : move to https://whisper.ggerganov.com/talk This way, we can share the same models across different WASM examples and not have to download them for each page	3 years ago
Georgi Gerganov	a2ecd54455	models : add instructions for using HF fine-tuned models	3 years ago
Georgi Gerganov	128aaadb93	whisper : improve printfs	3 years ago
Georgi Gerganov	454b91de16	main : fix dangling pointer when using stdin for input (#65 )	3 years ago
Georgi Gerganov	d7024cf9dc	main, stream : remove --verbose flag (#178 )	3 years ago
Georgi Gerganov	37422ed733	talk.wasm : add audio pre-processing + bump memory	3 years ago
Georgi Gerganov	be3b720f96	talk.wasm : refactoring + update README.md	3 years ago
Georgi Gerganov	00f46dbc1d	models : add usage comments to the HF convert script (#157 )	3 years ago
Georgi Gerganov	5698bddbc9	models : fix HF fine-tuned model conversion script (#157 ) It works now	3 years ago
Georgi Gerganov	388e9f79ad	ggml : fix the fix	3 years ago
Georgi Gerganov	35cd29ce1f	ggml : fix cross-compile Linux -> Window with mingw (#168 )	3 years ago
Georgi Gerganov	a156a358ca	Revert "update README.md" This reverts commit `6a84147113`.	3 years ago
katsu560	6a84147113	update README.md	3 years ago
katsu560	804f36aa2c	ggml: change inline ggml_fp16_to_fp32, ggml_fp16_t ggml_fp32_to_fp16	3 years ago
katsu560	4b2f51b479	add gprof option	3 years ago
katsu560	800ae5b808	fix AVX,AVX2,FMA,F16C detection on Linux and add flags for OpenBLAS	3 years ago
katsu560	83456076f0	add AVX support	3 years ago
Tamotsu Takahashi	3df6c14fca	Build with OpenBLAS and SDL2 on windows	3 years ago
Georgi Gerganov	d64d6ca3fd	models : minor changes to the HF convert script (#157 )	3 years ago
Georgi Gerganov	93482d0373	models : add "convert-h5-to-ggml.py" script (#157 ) Converts transformers models to ggml. Although the conversion is successful, it does not work for some reason. Not sure why	3 years ago

1 2 3 4 5 ...

275 Commits (78d13257be8094a71b65af401d4753281af2205a) All Branches Search

275 Commits (78d13257be8094a71b65af401d4753281af2205a)

All Branches