Georgi Gerganov
68f5962be6
ci : add cmake builds
2 years ago
Georgi Gerganov
332c9d77fe
whisper : fix bug in token sampling logic
...
Could overflow buffer
2 years ago
Georgi Gerganov
877c058179
Add CMake support
2 years ago
Georgi Gerganov
481cd685d5
ref #10 : option to keep context in "stream" example
...
Seems the results become worse when we keep the context, so by default
this is not enabled
2 years ago
Georgi Gerganov
3f15bb8a08
ref #10 : add "step" argument for "stream" example
...
Controls how often we run the inference.
By default, we run it every 3 seconds.
2 years ago
Georgi Gerganov
7787b878e1
ref #16 , #22 : add "offset" argument
...
Allows to start processing the input audio at some offset from the
beginning. Useful for splitting a long job into multiple tasks.
2 years ago
Georgi Gerganov
e29a5dacc6
ref #11 , #18 , #26 : fix CACHE_LINE_SIZE constant
2 years ago
Georgi Gerganov
844d60b284
Add CI using Github Actions
2 years ago
Georgi Gerganov
700898e6ed
ref #22 : add option to provide multiple input .wav files
2 years ago
Georgi Gerganov
6b1c3cc198
Update README.md
2 years ago
Georgi Gerganov
b8f713482e
Minor updates
2 years ago
Georgi Gerganov
167324584b
wip : rpi4 support
2 years ago
Georgi Gerganov
ce1fe95902
wip : improve makefile
2 years ago
Georgi Gerganov
74197ffc11
Merge pull request #20 from ArtyomZemlyak/master
...
Fix: main get language from cli args
2 years ago
Артём Земляк
495b81b367
Fix: main get n_threads from cli
2 years ago
Артём Земляк
f007e186fe
Fix: main get language from cli args
2 years ago
Georgi Gerganov
e7a15876f8
Update README.md
2 years ago
Georgi Gerganov
6814cc9b02
Improve result printing
2 years ago
Georgi Gerganov
eba33adadd
Extend C-style API with full inference methods
2 years ago
Georgi Gerganov
6b77124e01
Initial C-style interface for whisper.cpp
2 years ago
Georgi Gerganov
be8ba034f6
ref #10 : handle Ctrl+C in "stream" app
2 years ago
Georgi Gerganov
d71e567656
Update README.md
2 years ago
Georgi Gerganov
b6bf906730
ref #10 : quick-and-dirty attempt for real-time audio transciption
...
- Processes input in chunks of 3 seconds.
- Padding audio with silence
- Uses 1 second audio from previous pass
- No text context
2 years ago
Georgi Gerganov
77d929f603
Fix bug in FFT
...
The FFT routine does not work for odd N
Solution is to add DFT and use it when N is odd
2 years ago
Georgi Gerganov
6d654d192a
Fix reading of stereo WAV files
2 years ago
Georgi Gerganov
62897e8ae6
Update README.md
2 years ago
Georgi Gerganov
15b49e8baf
Bug fix
...
Longer prompts could cause out-of-bounds access
2 years ago
Georgi Gerganov
3bcdbdfc32
Reduce memory usage even more + better sampling
...
- The encode/decode memory buffers are now reused
- If the 30-sec segment goes for too long without a timestamp token, we
force one. Improves transcription for large model
- Stereo support
- Add "micro-machines.wav" sample
2 years ago
Georgi Gerganov
310f4883d1
Update README.md
2 years ago
Georgi Gerganov
fd3f3d748f
Update README.md
2 years ago
Georgi Gerganov
5877c3578e
ref #4 : added transcription timestamps
...
Can be turned off with "-nt" argument.
Performance has also improved.
2 years ago
Georgi Gerganov
8d4041c31f
Merge pull request #3 from cdosoftei/master
...
Pass -pthread to linker
2 years ago
cdosoftei
d4fcfa47b0
Pass -pthread to linker
2 years ago
Georgi Gerganov
4352a6018b
Update README.md
2 years ago
Georgi Gerganov
f888c2373d
Flash + language support (ref #2 )
...
- Achieved big performance improvement + memory usage reduction
- Can now translate / transcribe different languages
2 years ago
Georgi Gerganov
154fa796dd
ref #1 : add -pthread to compilation flags
2 years ago
Georgi Gerganov
476182e439
Update README.md and simplify usage
2 years ago
Georgi Gerganov
f2456f8d93
Create README.md
2 years ago
Georgi Gerganov
28802c4dae
Create LICENSE
2 years ago
Georgi Gerganov
b0a11594ae
Initial release
2 years ago