Georgi Gerganov
1eb81f863f
make : revert accidental change of optimization flags
2 years ago
Georgi Gerganov
fba10a4c68
whisper : language auto-detect ( #59 )
2 years ago
Georgi Gerganov
afe2db0fe2
Add Roadmap
2 years ago
Georgi Gerganov
a7047b2a28
ggml : implement ggml_compute_forward_dup_f16() special cases
2 years ago
Georgi Gerganov
32fbc8cd04
main : add option to print the progress ( #276 )
2 years ago
Georgi Gerganov
b8065d90f5
main : add "--prompt" command line argument ( #90 )
...
This allows to provide an initial prompt to be used at the start of the
processing.
2 years ago
Georgi Gerganov
4312995974
command : better indentation
2 years ago
Georgi Gerganov
5eeeb3412d
command : update README, show how to use guided mode
2 years ago
Georgi Gerganov
6a69e3ae27
command : adding guided mode
2 years ago
Georgi Gerganov
bf69b669a0
whisper : add whisper_tokenize()
...
Tokenizes a string into a list of vocabulary tokens
2 years ago
Georgi Gerganov
ea19ed33f1
Update README.md ( #46 )
...
Add references to the new Android app
2 years ago
Digipom
675e787171
Add Android sample ( #277 )
...
* Add Android sample
* Use main project C files
* Stop existing playback before starting new playback
* Make text scrollable
* Stop playback when starting to record
* Remove extra var
2 years ago
Georgi Gerganov
c6c3ad5a98
ci : add Windows build without OpenBLAS + change to Release ( #85 ) ( #282 )
2 years ago
Georgi Gerganov
6a7c82501e
whisper : improve decoding strategy ( #244 )
...
- Clear past prompt when there is very short audio left for processing.
My observation is that in these cases the decoding tends to repeat and
hallucinate stuff and I think this is induced by the existing prompt
- When we fail to sample timestamp token, retry by clearing the past
prompt. If it fails again, then we advance the window by 1 second
2 years ago
Georgi Gerganov
a82d331034
stream : update README.md + comments
2 years ago
Georgi Gerganov
c37c2443c1
Update README.md ( #56 )
2 years ago
Georgi Gerganov
0f11759406
ggml : make more compatible with c99 ( #262 )
2 years ago
Georgi Gerganov
5a5c5ddcca
Update README.md
2 years ago
Georgi Gerganov
34e0b4b9ef
stream : fix build
2 years ago
Georgi Gerganov
b0f8013eb9
stream : add sliding window mode
2 years ago
Georgi Gerganov
124c718c73
whisper : fix UB when reading buffer of length 0 bytes ( #265 )
2 years ago
Georgi Gerganov
f66ac6dc4f
ggml : fix indentation
2 years ago
Georgi Gerganov
9955fa4ed7
ggml : make compatible with c99 ( #262 )
2 years ago
Georgi Gerganov
a613f16aec
talk : improve prompting
2 years ago
Georgi Gerganov
930c693989
release : v1.0.3
...
Fixed whisper.spm tests
2 years ago
Georgi Gerganov
d8a0dde31a
Update README.md
2 years ago
Georgi Gerganov
9e3e6f253a
release : v1.0.2
2 years ago
Georgi Gerganov
57ccd7cc4f
Update README.md
2 years ago
Georgi Gerganov
812ae3ffbd
Update README.md
2 years ago
Georgi Gerganov
f309f97df6
Node.js package ( #260 )
...
* npm : preparing infra for node package
* npm : package infra ready
* npm : initial version ready
* npm : change name to whisper.cpp
whisper.js is taken
2 years ago
Georgi Gerganov
aa6adda26e
talk : make compatible with c++11 (part 2)
2 years ago
Georgi Gerganov
444349f4ec
talk : make compatible with c++11
2 years ago
Georgi Gerganov
37a93d2459
cmake : require c++11 instead of c++20
2 years ago
Roland Rabien
e70d47baab
Remove C++20 requirement ( #257 )
...
* Remove C++20 requirement
* Roll back C features not supported in VS2017
2 years ago
Lexevolution
6ed786957e
Add newline per segment for text output ( #254 )
2 years ago
Georgi Gerganov
ea38ad6e70
bench : more concise representation of the results ( #89 )
2 years ago
Georgi Gerganov
054940e1f6
minor : fix .gitignore to not ignore examples
2 years ago
Georgi Gerganov
fcf515de60
bench.wasm : same as "bench" but runs in the browser ( #89 )
2 years ago
Georgi Gerganov
85c9ac18b5
Update README.md
2 years ago
Georgi Gerganov
b7c85d1ea6
talk : fix build for MSVC
2 years ago
Georgi Gerganov
3b1aacbe6d
talk : talk with AI in the terminal
2 years ago
bert hubert
d1da35de06
fix potential bug reading model data into a small size optimized string which could lead to memory corruption. In an SSO string, you can't write data to &str[0] and expect it to work well.
...
Also added a small wrapper function to more safely read model data without having to get the sizeof right. I tested this on tiny, base and large models, there was no change in behaviour.
2 years ago
Georgi Gerganov
603f97ba11
whisper : minor improvemnt in decoding strategy ( #244 )
...
Do not allow for text segments to go beyond end of audio.
This partially mitigates some issues when the last audio window is 1-2
seconds just before the end of the audio file and the decoding spirals
into a repetition of the last transcribed phrase.
2 years ago
Georgi Gerganov
50a061b313
ggml : add alternative cblas_sgemm call
2 years ago
Georgi Gerganov
832b4f34c9
make : indentation + .gitignore
2 years ago
Reinis Muiznieks
0f98755fc5
Flag for Position Independent Code
2 years ago
Georgi Gerganov
56822621a8
twitch.sh : various fixes and polishing
...
- check if streamlink is installed
- fix audio chunking
- change default threads to 4
2 years ago
keyehzy
9e5f3ddc16
Allow for Twitch.tv live transcription
...
We rely on streamlink library to give us a stream, then we proceed similarly to
the radio livestream example.
2 years ago
Kartik Saranathan
d91c001120
Fix paths echoed after the download
...
Was using models path instead of root path
2 years ago
Al Hoang
04a16bbf11
fix compilation on haiku
2 years ago