Skip to content

ElevenLabs, recognized as a top-tier AI voice company, recently introduced its speech recognition model, scribe_v1, capable of transcribing audio into text across 99 languages.

The free tier offers a generous allowance, supporting single uploads of audio or video files up to 1GB.

Using in Video Translation Software

  1. Update your software to version v3.59+

  2. Create an API key on this page: https://elevenlabs.io/app/settings/api-keys

  3. In the video translation software, go to Menu--TTS Settings--Elevenlabs.io and enter the copied API key, then save.

  4. Select Elevenlabs.io in the speech recognition channel to start using it.

Using on the Web

  1. Go to https://elevenlabs.io/app/speech-to-text. If you don't have an account, register with your email. No phone verification, card binding, or top-up is required.
  2. After logging in, click Speech to text on the left, as shown below.

  1. After the transcription is complete, click on the displayed name to enter the transcription results page.