whisper.cpp

Commit Graph

Author	SHA1	Message	Date
Georgi Gerganov	37422ed733	talk.wasm : add audio pre-processing + bump memory	3 years ago
Georgi Gerganov	be3b720f96	talk.wasm : refactoring + update README.md	3 years ago
Georgi Gerganov	00f46dbc1d	models : add usage comments to the HF convert script (#157 )	3 years ago
Georgi Gerganov	5698bddbc9	models : fix HF fine-tuned model conversion script (#157 ) It works now	3 years ago
Georgi Gerganov	388e9f79ad	ggml : fix the fix	3 years ago
Georgi Gerganov	35cd29ce1f	ggml : fix cross-compile Linux -> Window with mingw (#168 )	3 years ago
Georgi Gerganov	a156a358ca	Revert "update README.md" This reverts commit `6a84147113`.	3 years ago
katsu560	6a84147113	update README.md	3 years ago
katsu560	804f36aa2c	ggml: change inline ggml_fp16_to_fp32, ggml_fp16_t ggml_fp32_to_fp16	3 years ago
katsu560	4b2f51b479	add gprof option	3 years ago
katsu560	800ae5b808	fix AVX,AVX2,FMA,F16C detection on Linux and add flags for OpenBLAS	3 years ago
katsu560	83456076f0	add AVX support	3 years ago
Tamotsu Takahashi	3df6c14fca	Build with OpenBLAS and SDL2 on windows	3 years ago
Georgi Gerganov	d64d6ca3fd	models : minor changes to the HF convert script (#157 )	3 years ago
Georgi Gerganov	93482d0373	models : add "convert-h5-to-ggml.py" script (#157 ) Converts transformers models to ggml. Although the conversion is successful, it does not work for some reason. Not sure why	3 years ago
Georgi Gerganov	49706a658a	minor : updates few prints + fix buttons in whisper.wasm	3 years ago
Georgi Gerganov	363a2dadec	Update README.md	3 years ago
Georgi Gerganov	623a486056	Update README.md	3 years ago
Tamotsu Takahashi	2f596f5b33	Find libopenblas.dll.a on windows "lib" is needed for windows. With this change, you can build whisper.cpp with OpenBLAS's prebuilt DLL. 1. extract a zip from https://github.com/xianyi/OpenBLAS/releases 2. copy the headers in (openblas)/include to the root directory of whisper.cpp 3. invoke cmake with -DCMAKE_LIBRARY_PATH=(openblas)\lib -DWHISPER_SUPPORT_OPENBLAS=ON 4. copy (openblas)/bin/libopenblas.dll to the same directory of whisper.dll after msbuild https://github.com/ggerganov/whisper.cpp/issues/89#issuecomment-1324391258	3 years ago
Georgi Gerganov	e5dcdabbb8	unicode : fix character replacement (thanks to @tamo)	3 years ago
Georgi Gerganov	dad109c3f1	close #109 : add fetching of the model over HTTP (whisper.wasm)	3 years ago
Georgi Gerganov	326573de9a	talk.wasm : final touches	3 years ago
Georgi Gerganov	9aea96f774	talk.wasm : polishing + adding many AI personalities	3 years ago
Georgi Gerganov	385236d1d3	stream : "-kc" now enables context keeping from previous segment (#90 ) By default, the context keeping is disabled	3 years ago
M. Eren Akbiyik	63ae03b8e0	Prompt previous tokens for streaming (#163 ) * feat: prompt previous tokens for streaming I used a vector pointer instead of vector itself because it gave weird errors, and why not * convert vector to use with C api * feat: remove old refs, check for prompt size * feat: use better way of getting the pointer	3 years ago
Georgi Gerganov	78116f8eda	talk.wasm : update README.md	3 years ago
Georgi Gerganov	a4dfbeecf9	talk.wasm : GPT-2 meets Whisper in WebAssembly (#155 ) * talk : initial real-time transcription in the browser * talk : polishing the UI * talk : ready for beta testing * talk.wasm : rename example	3 years ago
Georgi Gerganov	2e311a2917	Update README.md	3 years ago
Georgi Gerganov	2065572a11	ggml : fix Windows build	3 years ago
Georgi Gerganov	5c2176e314	ci : add Windows build	3 years ago
Georgi Gerganov	f2df9bd768	stream : add "max_tokens" cli arg Controls the max tokens per segment for the stream example	3 years ago
Georgi Gerganov	fb8d77f760	stream : add "audio_ctx" parameter Used to overwrite the audio context size of the Encoder. For example, setting "audio_ctx = 512" will make it run about 3 times faster, processing about 10s of audio, instead of 30s. The transcription quality drops, but this can be used for real-time streaming purposes where performance is important.	3 years ago
Georgi Gerganov	62b5ff875c	stream : add "max_tokens" parameter Used to limit the number of tokens in a segment. Useful to battle with word repetition when using partial encoder context	3 years ago
Georgi Gerganov	d351771a4b	stream : add "single_segment" option Force the entire audio chunk to be transcribed into a single segment	3 years ago
Georgi Gerganov	c058aaf22e	stream : partial encoder experiments	3 years ago
greeshmay	2ba66360c9	fix: free ggml_context (close #149 ) (#150 ) * fix: free ggml_context * ggml : free the model's contexts in whisper_free() Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	3 years ago
Georgi Gerganov	e70e5c8b53	models : simplify the conversion script "transformers" dependency is not actually needed	3 years ago
Dody Suria Wijaya	55a0e1a64e	Update download-ggml-model.sh follow curl redirect to new hosting site	3 years ago
Georgi Gerganov	864a78a8d0	models : change default hosting to Hugging Face My Linode is running out of monthly bandwidth due to the big interest in the project	3 years ago
Georgi Gerganov	83c742f1a7	whisper : add option to speed up the audio tempo by x2 Using a Phase Vocoder for speeding up the audio tempo by scaling down the frequencies in the frequency domain. This reduces the computation in the Encoder by a factor of 2. The transcription accuracy is degraded, but for slow to normal speech - it seems to be still very good. I think this can find application for real-time transcription - i.e. the "stream" example.	3 years ago
Georgi Gerganov	41b48ab7f1	make : add libwhisper.so target (#144 )	3 years ago
Chidi Williams	a728be9cdb	Add WHISPER_NO_AVX and WHISPER_NO_AVX2 to CMakeLists (#136 ) * Check for AVX and AVX2 on Darwin * Add AVX options to CMakeLists	3 years ago
Georgi Gerganov	46a68fb9b5	minor : remove one more redundant line	3 years ago
Georgi Gerganov	ccd56a9c5b	minor : fix double float32 conversion in python script	3 years ago
Georgi Gerganov	3500ce8727	ref #40 : start working on the documentation	3 years ago
Alan	7519eabf65	Adds support for stdin wav input	3 years ago
Georgi Gerganov	b21213c23e	js : update whipser.js to latest	3 years ago
Chidi Williams	9e700e1821	Check for AVX and AVX2 on Darwin	3 years ago
boolemancer	0bfe728b84	Fix the Windows pthread_create shim The current implementation doesn't actually set the out parameter, and it returns 0 on failure instead of on success.	3 years ago
Georgi Gerganov	4e5674a5d5	sync : submodule whisper.spm	3 years ago

1 2 3 4 5 ...

340 Commits (90564f85f97d0a5d6054fd6b70ca1bd7ba42bd70) All Branches Search

340 Commits (90564f85f97d0a5d6054fd6b70ca1bd7ba42bd70)

All Branches