whisper.cpp

Commit Graph

Author	SHA1	Message	Date
Tamotsu Takahashi	2f596f5b33	Find libopenblas.dll.a on windows "lib" is needed for windows. With this change, you can build whisper.cpp with OpenBLAS's prebuilt DLL. 1. extract a zip from https://github.com/xianyi/OpenBLAS/releases 2. copy the headers in (openblas)/include to the root directory of whisper.cpp 3. invoke cmake with -DCMAKE_LIBRARY_PATH=(openblas)\lib -DWHISPER_SUPPORT_OPENBLAS=ON 4. copy (openblas)/bin/libopenblas.dll to the same directory of whisper.dll after msbuild https://github.com/ggerganov/whisper.cpp/issues/89#issuecomment-1324391258	3 years ago
Georgi Gerganov	e5dcdabbb8	unicode : fix character replacement (thanks to @tamo)	3 years ago
Georgi Gerganov	dad109c3f1	close #109 : add fetching of the model over HTTP (whisper.wasm)	3 years ago
Georgi Gerganov	326573de9a	talk.wasm : final touches	3 years ago
Georgi Gerganov	9aea96f774	talk.wasm : polishing + adding many AI personalities	3 years ago
Georgi Gerganov	385236d1d3	stream : "-kc" now enables context keeping from previous segment (#90 ) By default, the context keeping is disabled	3 years ago
M. Eren Akbiyik	63ae03b8e0	Prompt previous tokens for streaming (#163 ) * feat: prompt previous tokens for streaming I used a vector pointer instead of vector itself because it gave weird errors, and why not * convert vector to use with C api * feat: remove old refs, check for prompt size * feat: use better way of getting the pointer	3 years ago
Georgi Gerganov	78116f8eda	talk.wasm : update README.md	3 years ago
Georgi Gerganov	a4dfbeecf9	talk.wasm : GPT-2 meets Whisper in WebAssembly (#155 ) * talk : initial real-time transcription in the browser * talk : polishing the UI * talk : ready for beta testing * talk.wasm : rename example	3 years ago
Georgi Gerganov	2e311a2917	Update README.md	3 years ago
Georgi Gerganov	2065572a11	ggml : fix Windows build	3 years ago
Georgi Gerganov	5c2176e314	ci : add Windows build	3 years ago
Georgi Gerganov	f2df9bd768	stream : add "max_tokens" cli arg Controls the max tokens per segment for the stream example	3 years ago
Georgi Gerganov	fb8d77f760	stream : add "audio_ctx" parameter Used to overwrite the audio context size of the Encoder. For example, setting "audio_ctx = 512" will make it run about 3 times faster, processing about 10s of audio, instead of 30s. The transcription quality drops, but this can be used for real-time streaming purposes where performance is important.	3 years ago
Georgi Gerganov	62b5ff875c	stream : add "max_tokens" parameter Used to limit the number of tokens in a segment. Useful to battle with word repetition when using partial encoder context	3 years ago
Georgi Gerganov	d351771a4b	stream : add "single_segment" option Force the entire audio chunk to be transcribed into a single segment	3 years ago
Georgi Gerganov	c058aaf22e	stream : partial encoder experiments	3 years ago
greeshmay	2ba66360c9	fix: free ggml_context (close #149 ) (#150 ) * fix: free ggml_context * ggml : free the model's contexts in whisper_free() Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	3 years ago
Georgi Gerganov	e70e5c8b53	models : simplify the conversion script "transformers" dependency is not actually needed	3 years ago
Dody Suria Wijaya	55a0e1a64e	Update download-ggml-model.sh follow curl redirect to new hosting site	3 years ago
Georgi Gerganov	864a78a8d0	models : change default hosting to Hugging Face My Linode is running out of monthly bandwidth due to the big interest in the project	3 years ago
Georgi Gerganov	83c742f1a7	whisper : add option to speed up the audio tempo by x2 Using a Phase Vocoder for speeding up the audio tempo by scaling down the frequencies in the frequency domain. This reduces the computation in the Encoder by a factor of 2. The transcription accuracy is degraded, but for slow to normal speech - it seems to be still very good. I think this can find application for real-time transcription - i.e. the "stream" example.	3 years ago
Georgi Gerganov	41b48ab7f1	make : add libwhisper.so target (#144 )	3 years ago
Chidi Williams	a728be9cdb	Add WHISPER_NO_AVX and WHISPER_NO_AVX2 to CMakeLists (#136 ) * Check for AVX and AVX2 on Darwin * Add AVX options to CMakeLists	3 years ago
Georgi Gerganov	46a68fb9b5	minor : remove one more redundant line	3 years ago
Georgi Gerganov	ccd56a9c5b	minor : fix double float32 conversion in python script	3 years ago
Georgi Gerganov	3500ce8727	ref #40 : start working on the documentation	3 years ago
Alan	7519eabf65	Adds support for stdin wav input	3 years ago
Georgi Gerganov	b21213c23e	js : update whipser.js to latest	3 years ago
Chidi Williams	9e700e1821	Check for AVX and AVX2 on Darwin	3 years ago
boolemancer	0bfe728b84	Fix the Windows pthread_create shim The current implementation doesn't actually set the out parameter, and it returns 0 on failure instead of on success.	3 years ago
Georgi Gerganov	4e5674a5d5	sync : submodule whisper.spm	3 years ago
Georgi Gerganov	4c66b6a828	cmake : add submodule whisper.spm	3 years ago
Georgi Gerganov	c30bffc8a5	ref #22 : add "duration" option Can be used to partially process a recording	3 years ago
Georgi Gerganov	8fdfb0ba92	Update README.md	3 years ago
Georgi Gerganov	c71363f14c	examples : add simple script for generating Karaoke video	3 years ago
Georgi Gerganov	a09e9123ca	Update README.md	3 years ago
Georgi Gerganov	d42cf6d0df	Update README.md	3 years ago
Georgi Gerganov	ef47d77492	main : fix generated bash script	3 years ago
Georgi Gerganov	75171c2b79	ggml : multi-thread the ggml_add operator	3 years ago
Georgi Gerganov	a2eeb941f6	cmake : fix passing GGML_PERF compile option	3 years ago
Georgi Gerganov	0e689f83d8	Update README.md	3 years ago
Georgi Gerganov	d5afebd37c	whisper : token-level timestamp refactoring (#49 , #120 ) This turned out pretty good overall. The algorithm has been moved from main.cpp to whisper.cpp and can be reused for all subtitles types. This means that now you can specify the maximum length of the generated lines. Simply provide the "-ml" argument specifying the max length in number of characters	3 years ago
Georgi Gerganov	4b1c32e8ea	Update README.md	3 years ago
Georgi Gerganov	b5dde365e9	extra : compute SHA of all models files	3 years ago
Georgi Gerganov	02dfd5b8c3	whisper : fix extra memory usage after recent processor changes Had increased the memory buffer to the size of the model and forgot to bring it down.	3 years ago
Syed Jafri	c63ce24834	Allow building with Accelerate for x86_64 Macs (#123 ) * Cross compile windows * set env properly * rm log * fix review * Add back space * Don't force architecture * Allow building x86_64 with accelerate	3 years ago
Georgi Gerganov	137321915f	ggml : fix the check for NEON support (#7 ) Was using the wrong preprocessor macro	3 years ago
Syed Jafri	24cd12f647	Cross compilation (#121 ) * Cross compile windows * set env properly * rm log * fix review * Add back space	3 years ago
Georgi Gerganov	e46bc56e71	Update README.md	3 years ago

1 2 3 4 5 ...

272 Commits (57e0e6b7004d53a9b0abd327965e4da312a31408) All Branches Search

272 Commits (57e0e6b7004d53a9b0abd327965e4da312a31408)

All Branches