whisper.cpp

Commit Graph

Author	SHA1	Message	Date
Georgi Gerganov	6a7c82501e	whisper : improve decoding strategy (#244 ) - Clear past prompt when there is very short audio left for processing. My observation is that in these cases the decoding tends to repeat and hallucinate stuff and I think this is induced by the existing prompt - When we fail to sample timestamp token, retry by clearing the past prompt. If it fails again, then we advance the window by 1 second	3 years ago
Georgi Gerganov	a82d331034	stream : update README.md + comments	3 years ago
Georgi Gerganov	c37c2443c1	Update README.md (#56 )	3 years ago
Georgi Gerganov	0f11759406	ggml : make more compatible with c99 (#262 )	3 years ago
Georgi Gerganov	5a5c5ddcca	Update README.md	3 years ago
Georgi Gerganov	34e0b4b9ef	stream : fix build	3 years ago
Georgi Gerganov	b0f8013eb9	stream : add sliding window mode	3 years ago
Georgi Gerganov	124c718c73	whisper : fix UB when reading buffer of length 0 bytes (#265 )	3 years ago
Georgi Gerganov	f66ac6dc4f	ggml : fix indentation	3 years ago
Georgi Gerganov	9955fa4ed7	ggml : make compatible with c99 (#262 )	3 years ago
Georgi Gerganov	a613f16aec	talk : improve prompting	3 years ago
Georgi Gerganov	930c693989	release : v1.0.3 Fixed whisper.spm tests	3 years ago
Georgi Gerganov	d8a0dde31a	Update README.md	3 years ago
Georgi Gerganov	9e3e6f253a	release : v1.0.2	3 years ago
Georgi Gerganov	57ccd7cc4f	Update README.md	3 years ago
Georgi Gerganov	812ae3ffbd	Update README.md	3 years ago
Georgi Gerganov	f309f97df6	Node.js package (#260 ) * npm : preparing infra for node package * npm : package infra ready * npm : initial version ready * npm : change name to whisper.cpp whisper.js is taken	3 years ago
Georgi Gerganov	aa6adda26e	talk : make compatible with c++11 (part 2)	3 years ago
Georgi Gerganov	444349f4ec	talk : make compatible with c++11	3 years ago
Georgi Gerganov	37a93d2459	cmake : require c++11 instead of c++20	3 years ago
Roland Rabien	e70d47baab	Remove C++20 requirement (#257 ) * Remove C++20 requirement * Roll back C features not supported in VS2017	3 years ago
Lexevolution	6ed786957e	Add newline per segment for text output (#254 )	3 years ago
Georgi Gerganov	ea38ad6e70	bench : more concise representation of the results (#89 )	3 years ago
Georgi Gerganov	054940e1f6	minor : fix .gitignore to not ignore examples	3 years ago
Georgi Gerganov	fcf515de60	bench.wasm : same as "bench" but runs in the browser (#89 )	3 years ago
Georgi Gerganov	85c9ac18b5	Update README.md	3 years ago
Georgi Gerganov	b7c85d1ea6	talk : fix build for MSVC	3 years ago
Georgi Gerganov	3b1aacbe6d	talk : talk with AI in the terminal	3 years ago
bert hubert	d1da35de06	fix potential bug reading model data into a small size optimized string which could lead to memory corruption. In an SSO string, you can't write data to &str[0] and expect it to work well. Also added a small wrapper function to more safely read model data without having to get the sizeof right. I tested this on tiny, base and large models, there was no change in behaviour.	3 years ago
Georgi Gerganov	603f97ba11	whisper : minor improvemnt in decoding strategy (#244 ) Do not allow for text segments to go beyond end of audio. This partially mitigates some issues when the last audio window is 1-2 seconds just before the end of the audio file and the decoding spirals into a repetition of the last transcribed phrase.	3 years ago
Georgi Gerganov	50a061b313	ggml : add alternative cblas_sgemm call	3 years ago
Georgi Gerganov	832b4f34c9	make : indentation + .gitignore	3 years ago
Reinis Muiznieks	0f98755fc5	Flag for Position Independent Code	3 years ago
Georgi Gerganov	56822621a8	twitch.sh : various fixes and polishing - check if streamlink is installed - fix audio chunking - change default threads to 4	3 years ago
keyehzy	9e5f3ddc16	Allow for Twitch.tv live transcription We rely on streamlink library to give us a stream, then we proceed similarly to the radio livestream example.	3 years ago
Kartik Saranathan	d91c001120	Fix paths echoed after the download Was using models path instead of root path	3 years ago
Al Hoang	04a16bbf11	fix compilation on haiku	3 years ago
Georgi Gerganov	47afb93c3c	yt-wsp.sh : improve usage instructions	3 years ago
Georgi Gerganov	575c53dc41	yt-wsp.sh : fix usage instruction + comment	3 years ago
Georgi Gerganov	3996ecc156	Update README.md	3 years ago
Georgi Gerganov	faa85f9840	livestream.sh : remove obsolete comment	3 years ago
Georgi Gerganov	b6597539f9	ggml : fix typo in previous commit	3 years ago
Georgi Gerganov	9a4b7a916e	ggml : use macros to inline FP16 <-> FP32 conversions	3 years ago
Georgi Gerganov	f8ec718b76	ggml : add F16C CPU flag check	3 years ago
katsu560	35b40a93b9	add fp16/fp32 convert intrinsics	3 years ago
Georgi Gerganov	9fe7306f4b	models : add the new "large" model release by OpenAI The old "large" model is now renamed "large-v1". If you have been using it, make sure to rename it and download the new "large" model for best results.	3 years ago
Georgi Gerganov	13e8eb2346	bench : add commit hash to bench-all.sh results	3 years ago
Georgi Gerganov	78d13257be	Try to improve the token sampling strategy (#193 ) * whisper : try to improve the token sampling strategy - Add the "max_initial_timestaamp" token logic from OpenAI - Disallow sampling timestamps that are in the past * whisper : fix the max initial timestamp logic + fallback decoding	3 years ago
Georgi Gerganov	9b7df68753	tests : adding transcription tests	3 years ago
Georgi Gerganov	061fc81bd6	ggml : remove inline specifier from fp16 <-> fp32 converters	3 years ago

1 2 3 4 5 ...

322 Commits (6a7c82501e3794724ba80bfb9a983810af036803) All Branches Search

322 Commits (6a7c82501e3794724ba80bfb9a983810af036803)

All Branches