whisper.cpp

Commit Graph

Author	SHA1	Message	Date
Georgi Gerganov	b3c865083e	ci : add emscripten build	3 years ago
Georgi Gerganov	a0d4f8e65c	main : make whisper_print_segment_callback() more readable (close #371 )	3 years ago
Georgi Gerganov	196d738974	minor : close #370 + Makefile build info print change	3 years ago
Andy Maloney	84c6b42e65	cmake : update to 3.19 (#351 ) - update from 3.0 (from 2014) to 3.19 (from 2020) - move some global setting onto the targets (through a cmake include)	3 years ago
Niels Mayer	a593b932e4	main : add -ocsv, aka --output-csv to output a CSV file Adds -ocsv, aka --output-csv feature to examples/main, which outputs a CSV file containing lines formatted as follows <startTime-in-integer-milliseconds>, <endTime-in-integer-milliseconds>, "<transcript-line-including-commas>".	3 years ago
Andy Maloney	dc90efd504	examples : small code cleanups (#322 ) - remove unnecessary initialization of string to "" - use empty() instead of checking size() - use emplace_back instead of push_back - use nullptr instead of NULL - remove unnecessary call to .data() on string - use character overload of find_first_of() instead of passing a string	3 years ago
Georgi Gerganov	99da1e5cc8	cmake : enable and fix -Wall -Wextra -Wpedantic C++ warnings	3 years ago
Matheus de Sousa	8e3f129b4d	minor : resolves some of warnings when compiling with clang/clang++ (#294 ) * Resolves some of warnings when compiling with clang/clang++ Mostly nit stuff that clang catches when compiling with -Wall -Wextra -pedantic. - Fix comparison between sign/unsigned integers. - Passes a constant reference (const&) instead of copying each time. * minor : normalize coding style * minor : fix warning Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	3 years ago
Georgi Gerganov	fba10a4c68	whisper : language auto-detect (#59 )	3 years ago
Georgi Gerganov	32fbc8cd04	main : add option to print the progress (#276 )	3 years ago
Georgi Gerganov	b8065d90f5	main : add "--prompt" command line argument (#90 ) This allows to provide an initial prompt to be used at the start of the processing.	3 years ago
Lexevolution	6ed786957e	Add newline per segment for text output (#254 )	3 years ago
Georgi Gerganov	4698dcdb52	whisper : add mechanism for aborting the whisper_full() computation	3 years ago
Georgi Gerganov	0f619b52ce	main : add stereo-channel-based diarization (#64 ) Not tested - I don't have stereo dialog audio	3 years ago
Georgi Gerganov	bc88eb13c6	examples : add "command" tool (#171 )	3 years ago
Georgi Gerganov	b8ce25dec1	refactoring : more readable code	3 years ago
Georgi Gerganov	454b91de16	main : fix dangling pointer when using stdin for input (#65 )	3 years ago
Georgi Gerganov	d7024cf9dc	main, stream : remove --verbose flag (#178 )	3 years ago
Georgi Gerganov	e5dcdabbb8	unicode : fix character replacement (thanks to @tamo)	3 years ago
Georgi Gerganov	83c742f1a7	whisper : add option to speed up the audio tempo by x2 Using a Phase Vocoder for speeding up the audio tempo by scaling down the frequencies in the frequency domain. This reduces the computation in the Encoder by a factor of 2. The transcription accuracy is degraded, but for slow to normal speech - it seems to be still very good. I think this can find application for real-time transcription - i.e. the "stream" example.	3 years ago
Alan	7519eabf65	Adds support for stdin wav input	3 years ago
Georgi Gerganov	c30bffc8a5	ref #22 : add "duration" option Can be used to partially process a recording	3 years ago
Georgi Gerganov	ef47d77492	main : fix generated bash script	3 years ago
Georgi Gerganov	d5afebd37c	whisper : token-level timestamp refactoring (#49 , #120 ) This turned out pretty good overall. The algorithm has been moved from main.cpp to whisper.cpp and can be reused for all subtitles types. This means that now you can specify the maximum length of the generated lines. Simply provide the "-ml" argument specifying the max length in number of characters	3 years ago
Georgi Gerganov	6fb98370ba	main : add some comments for the word-level timestamp algorithm	3 years ago
Georgi Gerganov	0729da9a3b	main : fix some edge cases for word-level timestamps	3 years ago
Georgi Gerganov	dc12994603	Update README.md	3 years ago
Georgi Gerganov	57fb46f307	main : add option for word-leve timestamps (very experimental)	3 years ago
Georgi Gerganov	2827cbbbe8	main : merge parallel example in main	3 years ago
Georgi Gerganov	0b2dc3c82c	parallel : working	3 years ago
Georgi Gerganov	85d6e1e1e7	main : fix sampling time + add max_context parameter	3 years ago
Georgi Gerganov	ebb01b9e33	Print system info at start of program	3 years ago
Georgi Gerganov	2400660f3f	Print system info in main	3 years ago
Georgi Gerganov	47e78b7288	Update README.md	3 years ago
Georgi Gerganov	c6710efde2	refactoring : move main + stream in examples + other stuff	3 years ago

35 Commits (52a3e0c92a8be5150d2a59e492b4943ca8a623b0)