You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
27 lines
973 B
27 lines
973 B
# whisper.objc
|
|
|
|
Minimal Obj-C application for automatic offline speech recognition.
|
|
The inference runs locally, on-device.
|
|
|
|
https://user-images.githubusercontent.com/1991296/197385372-962a6dea-bca1-4d50-bf96-1d8c27b98c81.mp4
|
|
|
|
Real-time transcription demo:
|
|
|
|
https://user-images.githubusercontent.com/1991296/204126266-ce4177c6-6eca-4bd9-bca8-0e46d9da2364.mp4
|
|
|
|
## Usage
|
|
|
|
```java
|
|
git clone https://github.com/ggerganov/whisper.cpp
|
|
open whisper.cpp/examples/whisper.objc/whisper.objc.xcodeproj/
|
|
```
|
|
|
|
Make sure to build the project in `Release`:
|
|
|
|
<img width="947" alt="image" src="https://user-images.githubusercontent.com/1991296/197382607-9e1e6d1b-79fa-496f-9d16-b71dc1535701.png">
|
|
|
|
Also, don't forget to add the `-DGGML_USE_ACCELERATE` compiler flag in Build Phases.
|
|
This can significantly improve the performance of the transcription:
|
|
|
|
<img width="1072" alt="image" src="https://user-images.githubusercontent.com/1991296/208511239-8d7cdbd1-aa48-41b5-becd-ca288d53cc07.png">
|