@@ -6,7 +6,8 @@ It can be used as a reference for using the `whisper.cpp` library in other proje
6
6
```
7
7
./build/bin/whisper-cli -h
8
8
9
- usage: ./build-pkg/bin/whisper-cli [options] file0.wav file1.wav ...
9
+ usage: ./build/bin/whisper-cli [options] file0 file1 ...
10
+ supported audio formats: flac, mp3, ogg, wav
10
11
11
12
options:
12
13
-h, --help [default] show this help message and exit
@@ -24,6 +25,7 @@ options:
24
25
-wt N, --word-thold N [0.01 ] word timestamp probability threshold
25
26
-et N, --entropy-thold N [2.40 ] entropy threshold for decoder fail
26
27
-lpt N, --logprob-thold N [-1.00 ] log probability threshold for decoder fail
28
+ -nth N, --no-speech-thold N [0.60 ] no speech threshold
27
29
-tp, --temperature N [0.00 ] The sampling temperature, between 0 and 1
28
30
-tpi, --temperature-inc N [0.20 ] The increment of temperature, between 0 and 1
29
31
-debug, --debug-mode [false ] enable debug mode (eg. dump log_mel)
@@ -50,12 +52,13 @@ options:
50
52
-dl, --detect-language [false ] exit after automatically detecting language
51
53
--prompt PROMPT [ ] initial prompt (max n_text_ctx/2 tokens)
52
54
-m FNAME, --model FNAME [models/ggml-base.en.bin] model path
53
- -f FNAME, --file FNAME [ ] input WAV file path
55
+ -f FNAME, --file FNAME [ ] input audio file path
54
56
-oved D, --ov-e-device DNAME [CPU ] the OpenVINO device used for encode inference
55
57
-dtw MODEL --dtw MODEL [ ] compute token-level timestamps
56
58
-ls, --log-score [false ] log best decoder scores of tokens
57
59
-ng, --no-gpu [false ] disable GPU
58
60
-fa, --flash-attn [false ] flash attention
61
+ -sns, --suppress-nst [false ] suppress non-speech tokens
59
62
--suppress-regex REGEX [ ] regular expression matching tokens to suppress
60
63
--grammar GRAMMAR [ ] GBNF grammar to guide decoding
61
64
--grammar-rule RULE [ ] top-level GBNF grammar rule name
0 commit comments