Georgi Gerganov
c30bffc8a5
ref #22 : add "duration" option
...
Can be used to partially process a recording
2 years ago
Georgi Gerganov
c71363f14c
examples : add simple script for generating Karaoke video
2 years ago
Georgi Gerganov
d42cf6d0df
Update README.md
2 years ago
Georgi Gerganov
ef47d77492
main : fix generated bash script
2 years ago
Georgi Gerganov
d5afebd37c
whisper : token-level timestamp refactoring ( #49 , #120 )
...
This turned out pretty good overall. The algorithm has been moved from
main.cpp to whisper.cpp and can be reused for all subtitles types. This
means that now you can specify the maximum length of the generated
lines. Simply provide the "-ml" argument specifying the max length in
number of characters
2 years ago
Georgi Gerganov
6fb98370ba
main : add some comments for the word-level timestamp algorithm
2 years ago
Georgi Gerganov
0729da9a3b
main : fix some edge cases for word-level timestamps
2 years ago
Georgi Gerganov
5dc74e3aff
Update README.md
2 years ago
Georgi Gerganov
ac8ef34039
Update README.md
2 years ago
Georgi Gerganov
dc12994603
Update README.md
2 years ago
Georgi Gerganov
57fb46f307
main : add option for word-leve timestamps (very experimental)
2 years ago
Georgi Gerganov
5a9e4260a6
stream : add "--capture" option to select capture device (ref #10 )
2 years ago
Georgi Gerganov
12fb303d9d
whisper.wasm : update system info print
2 years ago
Georgi Gerganov
2827cbbbe8
main : merge parallel example in main
2 years ago
Georgi Gerganov
0b2dc3c82c
parallel : working
2 years ago
Georgi Gerganov
85d6e1e1e7
main : fix sampling time + add max_context parameter
2 years ago
Georgi Gerganov
72e9cdd6bf
parallel : adding tool for parallel transformer inference
2 years ago
Georgi Gerganov
b89f8960ca
Update README.md
2 years ago
Georgi Gerganov
6f82320b05
Create README.md
2 years ago
Georgi Gerganov
2298310dd8
whisper.nvim : add helper script for the Neovim integration
2 years ago
Georgi Gerganov
8347a7bb6a
stream : few updates to make it compatible for Vim usage ( #99 )
2 years ago
Georgi Gerganov
ebb01b9e33
Print system info at start of program
2 years ago
Georgi Gerganov
2400660f3f
Print system info in main
2 years ago
Georgi Gerganov
a6c786d5dc
Update README.md
2 years ago
Georgi Gerganov
91dcf5f35b
Update README.md
2 years ago
Georgi Gerganov
113a4f06d8
Update README.md
2 years ago
Georgi Gerganov
47e78b7288
Update README.md
2 years ago
Georgi Gerganov
34bb3ab0cf
ggml : add system info functions
2 years ago
Georgi Gerganov
c6710efde2
refactoring : move main + stream in examples + other stuff
2 years ago
Georgi Gerganov
d4f94ce427
Update README.md
2 years ago
Georgi Gerganov
a52ee08c1e
objc : polishing the sample application
2 years ago
Georgi Gerganov
b41f4a90eb
Create README.md
2 years ago
Georgi Gerganov
bb1ee266d2
ios : whisper.objc example
2 years ago
Georgi Gerganov
3e69a6071d
Update README.md
2 years ago
Georgi Gerganov
f4aa01c2f8
Update README.md
2 years ago
Georgi Gerganov
6b45e37b2b
Update README.md and finalize the whisper.wasm example
2 years ago
Georgi Gerganov
491ecd7056
wip : polishing WASM example
2 years ago
Georgi Gerganov
e905c6f827
wip : initial WASM port
...
Works but it is very slow because no SIMD is used.
For example, jfk.wav is processed in ~23 seconds using "tiny.en" model
2 years ago