Using the deepgram.com Speech Recognition API
Deepgram.com speech recognition API support was added after v2.92. This is a foreign AI service that gives you $200 upon registration, which is enough to use for a while.
- Open the website https://deepgram.com/, register, and log in to enter the console https://console.deepgram.com/
- After logging in, click the large green "Create API Key" in the console
The following figure pops up after clicking
Write a few English letters in the first text box, and then click "" at the bottom. The SK will then be displayed, remember to copy it, as shown in the figure below
- Open Menu--Speech Recognition Settings--Deepgram Window
API Key: Enter the key copied in the previous step in the API Key.
Silence Duration: You can keep the default 200, that is, 200ms. If the video to be recognized has a fast speech rate, you can appropriately reduce it to 150. If it is slow and there is a lot of silence, you can appropriately increase it to 500 or 800.
- Note: The Deepgram platform does not support Chinese well. Whether it is the subtitles directly returned after being processed by Deepgram, or the re-segmentation according to the word-level timestamps, there is a lack of punctuation marks, which leads to unsatisfactory subtitle segmentation. In order to optimize, the Ali Chinese punctuation recovery model is used to re-segment. Please select "Chinese Re-segmentation" in the software interface.