Latency
The ANE path is consistently faster than the prior GPU path on these clips, with p50 results around 92-152 ms.
Parakey keeps its latency claims tied to a reproducible benchmark in experiments/swift-bench/. Numbers below are p50 across five trials per backend, first inference excluded.
| Clip | Duration | FluidAudio on ANE | Prior GPU path | Speedup |
|---|---|---|---|---|
short-clean |
2.50 s | 1.57× | ||
medium-clean |
3.99 s | 1.83× | ||
disfluent |
5.31 s | 1.97× | ||
longer-technical |
9.49 s | 1.97× |
Machine-readable results: benchmarks/results.json. Reproduction docs: experiments/swift-bench.
The ANE path is consistently faster than the prior GPU path on these clips, with p50 results around 92-152 ms.
Both backends produced essentially identical transcripts on the synthetic test audio, including the same TTS artifacts.
The benchmark measures compute latency, not energy. Power measurement remains a future bench improvement.