Georgi Gerganov
83c742f1a7
whisper : add option to speed up the audio tempo by x2
...
Using a Phase Vocoder for speeding up the audio tempo by scaling down
the frequencies in the frequency domain.
This reduces the computation in the Encoder by a factor of 2.
The transcription accuracy is degraded, but for slow to normal speech -
it seems to be still very good.
I think this can find application for real-time transcription - i.e. the
"stream" example.
2 years ago
Georgi Gerganov
41b48ab7f1
make : add libwhisper.so target ( #144 )
2 years ago
Chidi Williams
a728be9cdb
Add WHISPER_NO_AVX and WHISPER_NO_AVX2 to CMakeLists ( #136 )
...
* Check for AVX and AVX2 on Darwin
* Add AVX options to CMakeLists
2 years ago
Georgi Gerganov
46a68fb9b5
minor : remove one more redundant line
2 years ago
Georgi Gerganov
ccd56a9c5b
minor : fix double float32 conversion in python script
2 years ago
Georgi Gerganov
3500ce8727
ref #40 : start working on the documentation
2 years ago
Alan
7519eabf65
Adds support for stdin wav input
2 years ago
Georgi Gerganov
b21213c23e
js : update whipser.js to latest
2 years ago
Chidi Williams
9e700e1821
Check for AVX and AVX2 on Darwin
2 years ago
boolemancer
0bfe728b84
Fix the Windows pthread_create shim
...
The current implementation doesn't actually set the out parameter,
and it returns 0 on failure instead of on success.
2 years ago
Georgi Gerganov
4e5674a5d5
sync : submodule whisper.spm
2 years ago
Georgi Gerganov
4c66b6a828
cmake : add submodule whisper.spm
2 years ago
Georgi Gerganov
c30bffc8a5
ref #22 : add "duration" option
...
Can be used to partially process a recording
2 years ago
Georgi Gerganov
8fdfb0ba92
Update README.md
2 years ago
Georgi Gerganov
c71363f14c
examples : add simple script for generating Karaoke video
2 years ago
Georgi Gerganov
a09e9123ca
Update README.md
2 years ago
Georgi Gerganov
d42cf6d0df
Update README.md
2 years ago
Georgi Gerganov
ef47d77492
main : fix generated bash script
2 years ago
Georgi Gerganov
75171c2b79
ggml : multi-thread the ggml_add operator
2 years ago
Georgi Gerganov
a2eeb941f6
cmake : fix passing GGML_PERF compile option
2 years ago
Georgi Gerganov
0e689f83d8
Update README.md
2 years ago
Georgi Gerganov
d5afebd37c
whisper : token-level timestamp refactoring ( #49 , #120 )
...
This turned out pretty good overall. The algorithm has been moved from
main.cpp to whisper.cpp and can be reused for all subtitles types. This
means that now you can specify the maximum length of the generated
lines. Simply provide the "-ml" argument specifying the max length in
number of characters
2 years ago
Georgi Gerganov
4b1c32e8ea
Update README.md
2 years ago
Georgi Gerganov
b5dde365e9
extra : compute SHA of all models files
2 years ago
Georgi Gerganov
02dfd5b8c3
whisper : fix extra memory usage after recent processor changes
...
Had increased the memory buffer to the size of the model and forgot to
bring it down.
2 years ago
Syed Jafri
c63ce24834
Allow building with Accelerate for x86_64 Macs ( #123 )
...
* Cross compile windows
* set env properly
* rm log
* fix review
* Add back space
* Don't force architecture
* Allow building x86_64 with accelerate
2 years ago
Georgi Gerganov
137321915f
ggml : fix the check for NEON support ( #7 )
...
Was using the wrong preprocessor macro
2 years ago
Syed Jafri
24cd12f647
Cross compilation ( #121 )
...
* Cross compile windows
* set env properly
* rm log
* fix review
* Add back space
2 years ago
Georgi Gerganov
e46bc56e71
Update README.md
2 years ago
Georgi Gerganov
6fb98370ba
main : add some comments for the word-level timestamp algorithm
2 years ago
Georgi Gerganov
0729da9a3b
main : fix some edge cases for word-level timestamps
2 years ago
Georgi Gerganov
5dc74e3aff
Update README.md
2 years ago
Georgi Gerganov
ac8ef34039
Update README.md
2 years ago
Mikhail Grigorev
b26345cc7b
Added for Windows implemenated script download-ggml-model.cmd
2 years ago
Mikhail Grigorev
8dac3c6e10
Fixed sched_yield
2 years ago
Mikhail Grigorev
6417e59aad
Implemenated sched_yield function for Windows
2 years ago
Georgi Gerganov
dc12994603
Update README.md
2 years ago
Georgi Gerganov
b0f2aa0ea6
Update README.md
2 years ago
Georgi Gerganov
57fb46f307
main : add option for word-leve timestamps (very experimental)
2 years ago
Georgi Gerganov
5a9e4260a6
stream : add "--capture" option to select capture device (ref #10 )
2 years ago
Georgi Gerganov
eba62e0fa1
close #113 : fix struct whisper_token_data
2 years ago
Georgi Gerganov
69bdb6624a
minor : update whisper.js
2 years ago
Georgi Gerganov
12fb303d9d
whisper.wasm : update system info print
2 years ago
Georgi Gerganov
234f414652
ref #5 : update CMake for Windows build
...
- __AVX2__ should already be defined due to /arch:AVX2
- _CRT_SECURE_NO_WARNINGS should be defined both for shared and static lib
2 years ago
Georgi Gerganov
014a119052
minor : fix multiple definitions of to_timestamp()
2 years ago
Georgi Gerganov
dec40be58f
parallel : print time of audio boundaries + fix timings
2 years ago
Georgi Gerganov
e5044f87d9
ggml : fix barrier
2 years ago
Georgi Gerganov
2827cbbbe8
main : merge parallel example in main
2 years ago
Georgi Gerganov
0b2dc3c82c
parallel : working
2 years ago
Georgi Gerganov
a272f10b2e
ggml : fix thread-safety of ggml_init and ggml_free
2 years ago