Skip to content

Commit fe61423

Browse files
committed
update s2t benchmarks
1 parent 3b098d7 commit fe61423

1 file changed

Lines changed: 2 additions & 2 deletions

File tree

docs/docs/02-benchmarks/inference-time.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -139,15 +139,15 @@ Average time for encoding audio of given length over 10 runs. For `Whisper` mode
139139

140140
| Model | iPhone 17 Pro (XNNPACK) [ms] | iPhone 16 Pro (XNNPACK) [ms] | iPhone SE 3 (XNNPACK) [ms] | Samsung Galaxy S24 (XNNPACK) [ms] | OnePlus 12 (XNNPACK) [ms] |
141141
| ------------------ | :--------------------------: | :--------------------------: | :------------------------: | :-------------------------------: | :-----------------------: |
142-
| Whisper-tiny (30s) | 248 | 254 | 1145 | 435 | 526 |
142+
| Whisper-tiny (30s) | 89 | 93 | 403 | 277 | 260 |
143143

144144
### Decoding
145145

146146
Average time for decoding one token in sequence of approximately 100 tokens, with encoding context is obtained from audio of noted length.
147147

148148
| Model | iPhone 17 Pro (XNNPACK) [ms] | iPhone 16 Pro (XNNPACK) [ms] | iPhone SE 3 (XNNPACK) [ms] | Samsung Galaxy S24 (XNNPACK) [ms] | OnePlus 12 (XNNPACK) [ms] |
149149
| ------------------ | :--------------------------: | :--------------------------: | :------------------------: | :-------------------------------: | :-----------------------: |
150-
| Whisper-tiny (30s) | 23 | 25 | 121 | 92 | 115 |
150+
| Whisper-tiny (30s) | 6 | 6 | 40 | 28 | 25 |
151151

152152
## Text to Speech
153153

0 commit comments

Comments
 (0)