- “Audio Minutes” refers to the total duration of audio data streamed through the API, measured in minutes. The calculation of audio minutes is based on the actual playback duration of the audio content, regardless of the speed at which the audio data is transmitted. The duration of audio content is determined by its standard playback speed, which is defined as 1x (normal speed). If audio data is streamed at an accelerated rate, up to a maximum speed allowed in the Documentation, the Audio Minutes are still calculated based on the standard playback duration.
- Deviating from the DeepL API Pro for translating or improving text, the DeepL API Pro for speech to text charges Customer based on the total Audio Minutes streamed, irrespective of the connection duration or the speed of transmission. Any fractional Audio Minutes will be rounded up to the nearest whole minute for billing purposes.
| Audio Stream | Realtime Audio stream to be translated into text in up to 5 languages. |
| Input Languages (Audio) | The realtime audio stream you want to have translated can be in one of the following languages:
|
| Target language (Text) | The language in which your translations are provided can be one of the following:
|
| Language | The language which has been detected for your audio. |
| Text | The translated text(s) and the transcribed source as a text. |