whisper.cpp

Commit Graph

Author	SHA1	Message	Date
Georgi Gerganov	5c2176e314	ci : add Windows build	3 years ago
Georgi Gerganov	f2df9bd768	stream : add "max_tokens" cli arg Controls the max tokens per segment for the stream example	3 years ago
Georgi Gerganov	fb8d77f760	stream : add "audio_ctx" parameter Used to overwrite the audio context size of the Encoder. For example, setting "audio_ctx = 512" will make it run about 3 times faster, processing about 10s of audio, instead of 30s. The transcription quality drops, but this can be used for real-time streaming purposes where performance is important.	3 years ago
Georgi Gerganov	62b5ff875c	stream : add "max_tokens" parameter Used to limit the number of tokens in a segment. Useful to battle with word repetition when using partial encoder context	3 years ago
Georgi Gerganov	d351771a4b	stream : add "single_segment" option Force the entire audio chunk to be transcribed into a single segment	3 years ago
Georgi Gerganov	c058aaf22e	stream : partial encoder experiments	3 years ago
greeshmay	2ba66360c9	fix: free ggml_context (close #149 ) (#150 ) * fix: free ggml_context * ggml : free the model's contexts in whisper_free() Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	3 years ago
Georgi Gerganov	e70e5c8b53	models : simplify the conversion script "transformers" dependency is not actually needed	3 years ago
Dody Suria Wijaya	55a0e1a64e	Update download-ggml-model.sh follow curl redirect to new hosting site	3 years ago
Georgi Gerganov	864a78a8d0	models : change default hosting to Hugging Face My Linode is running out of monthly bandwidth due to the big interest in the project	3 years ago
Georgi Gerganov	83c742f1a7	whisper : add option to speed up the audio tempo by x2 Using a Phase Vocoder for speeding up the audio tempo by scaling down the frequencies in the frequency domain. This reduces the computation in the Encoder by a factor of 2. The transcription accuracy is degraded, but for slow to normal speech - it seems to be still very good. I think this can find application for real-time transcription - i.e. the "stream" example.	3 years ago
Georgi Gerganov	41b48ab7f1	make : add libwhisper.so target (#144 )	3 years ago
Chidi Williams	a728be9cdb	Add WHISPER_NO_AVX and WHISPER_NO_AVX2 to CMakeLists (#136 ) * Check for AVX and AVX2 on Darwin * Add AVX options to CMakeLists	3 years ago
Georgi Gerganov	46a68fb9b5	minor : remove one more redundant line	3 years ago
Georgi Gerganov	ccd56a9c5b	minor : fix double float32 conversion in python script	3 years ago
Georgi Gerganov	3500ce8727	ref #40 : start working on the documentation	3 years ago
Alan	7519eabf65	Adds support for stdin wav input	3 years ago
Georgi Gerganov	b21213c23e	js : update whipser.js to latest	3 years ago
Chidi Williams	9e700e1821	Check for AVX and AVX2 on Darwin	3 years ago
boolemancer	0bfe728b84	Fix the Windows pthread_create shim The current implementation doesn't actually set the out parameter, and it returns 0 on failure instead of on success.	3 years ago
Georgi Gerganov	4e5674a5d5	sync : submodule whisper.spm	3 years ago
Georgi Gerganov	4c66b6a828	cmake : add submodule whisper.spm	3 years ago
Georgi Gerganov	c30bffc8a5	ref #22 : add "duration" option Can be used to partially process a recording	3 years ago
Georgi Gerganov	8fdfb0ba92	Update README.md	3 years ago
Georgi Gerganov	c71363f14c	examples : add simple script for generating Karaoke video	3 years ago
Georgi Gerganov	a09e9123ca	Update README.md	3 years ago
Georgi Gerganov	d42cf6d0df	Update README.md	3 years ago
Georgi Gerganov	ef47d77492	main : fix generated bash script	3 years ago
Georgi Gerganov	75171c2b79	ggml : multi-thread the ggml_add operator	3 years ago
Georgi Gerganov	a2eeb941f6	cmake : fix passing GGML_PERF compile option	3 years ago
Georgi Gerganov	0e689f83d8	Update README.md	3 years ago
Georgi Gerganov	d5afebd37c	whisper : token-level timestamp refactoring (#49 , #120 ) This turned out pretty good overall. The algorithm has been moved from main.cpp to whisper.cpp and can be reused for all subtitles types. This means that now you can specify the maximum length of the generated lines. Simply provide the "-ml" argument specifying the max length in number of characters	3 years ago
Georgi Gerganov	4b1c32e8ea	Update README.md	3 years ago
Georgi Gerganov	b5dde365e9	extra : compute SHA of all models files	3 years ago
Georgi Gerganov	02dfd5b8c3	whisper : fix extra memory usage after recent processor changes Had increased the memory buffer to the size of the model and forgot to bring it down.	3 years ago
Syed Jafri	c63ce24834	Allow building with Accelerate for x86_64 Macs (#123 ) * Cross compile windows * set env properly * rm log * fix review * Add back space * Don't force architecture * Allow building x86_64 with accelerate	3 years ago
Georgi Gerganov	137321915f	ggml : fix the check for NEON support (#7 ) Was using the wrong preprocessor macro	3 years ago
Syed Jafri	24cd12f647	Cross compilation (#121 ) * Cross compile windows * set env properly * rm log * fix review * Add back space	3 years ago
Georgi Gerganov	e46bc56e71	Update README.md	3 years ago
Georgi Gerganov	6fb98370ba	main : add some comments for the word-level timestamp algorithm	3 years ago
Georgi Gerganov	0729da9a3b	main : fix some edge cases for word-level timestamps	3 years ago
Georgi Gerganov	5dc74e3aff	Update README.md	3 years ago
Georgi Gerganov	ac8ef34039	Update README.md	3 years ago
Mikhail Grigorev	b26345cc7b	Added for Windows implemenated script download-ggml-model.cmd	3 years ago
Mikhail Grigorev	8dac3c6e10	Fixed sched_yield	3 years ago
Mikhail Grigorev	6417e59aad	Implemenated sched_yield function for Windows	3 years ago
Georgi Gerganov	dc12994603	Update README.md	3 years ago
Georgi Gerganov	b0f2aa0ea6	Update README.md	3 years ago
Georgi Gerganov	57fb46f307	main : add option for word-leve timestamps (very experimental)	3 years ago
Georgi Gerganov	5a9e4260a6	stream : add "--capture" option to select capture device (ref #10 )	3 years ago

1 2 3 4 5 ...

311 Commits (930c693989230cbae27519ed7141b67f49b1ca38) All Branches Search

311 Commits (930c693989230cbae27519ed7141b67f49b1ca38)

All Branches