Common Errors FAQ | pyVideoTrans-Open Source Video Translation Tool -pyvideotrans.com github.com/jianchang512/pyvideotrans

There are many useful links in the Menu bar -- Help/About, such as model download addresses, CUDA configuration guides, etc. Check them out when you encounter problems.

1. Cannot Open After Double-Clicking sp.exe

The software is developed based on pyside6. The main interface uses a lot of Qt components, so loading may be slow, ranging from 5 seconds to 2 minutes. Please be patient.

If it still doesn't display completely after a few minutes, and doesn't show a splash screen, only a black window, it may be a program error. Check the console for errors. If it's a pre-packaged version, check if you only downloaded the update package. If so, please download the full package.

If you have tried all methods and waited a long time but still cannot open it, open the latest log file in the logs folder, check the error message, or submit the file to GitHub Issue or bbs.pyvideotrans.com to ask questions and upload the file.

2. Reduced Video Clarity

Some operations in the translation process involve transcoding. Transcoding will inevitably result in quality loss. If you want to minimize the loss, you can do the following:

The original video uses libx264 encoded mp4 video.
Set as shown in Menu--Tools--Advanced Settings.

3. Error During Translation Stage

A red error occurs during the translation process after the subtitles are recognized. This is generally a "network connection error" or an "account problem" with the translation channel used.

If you are using translation channels such as "Google", "Microsoft", "Gemini", etc., then it is likely a network connection problem. You need to enable a proxy and fill in the network proxy address provided in the proxy tool in the network proxy address text box.

If you are already using a proxy, but still report a network connection error, your proxy may be unavailable. Please correctly fill in the HTTP proxy IP and port number provided in the proxy tool.

If you are sure that the proxy is available, but still report an error, it is likely that the account is unavailable. For example, Gemini is not available in all countries. You can try switching the proxy node to another country.

Gemini, ChatGPT, and AzureGPT channels generally have request frequency limits. Excluding network problems and account problems, it may be that the request frequency exceeds the limit. At this time, you can open Menu--Tools--Advanced Settings and set "Pause After Translation/s" to 30 or a larger number.

3. Speech Recognition Accuracy is Too Low

In faster mode and OpenAI mode, using a larger model can improve accuracy. tiny is a small model with poor recognition effect. large-v3 is the largest model with the best recognition effect. All model download addresses: https://pyvideotrans.com/model
If the original video is in Chinese, you can try using zh_recogn for better results. Instructions: https://pyvideotrans.com/zh_recogn.html
Select "Keep Background Sound" to denoise in advance, which will improve the recognition effect. But note that if the video is very large, do not select it.

4. Model Download Address

Model download address: https://pyvideotrans.com/model

5. Is it Available on Win7?

Win7 system is not supported.

6. Prompt Missing python310.dll

You may have only downloaded the update patch package. The patch package cannot be used alone. Please download the complete package of 1.9G first, and then download the patch package to overwrite it after decompression.

7. Error During Merging Stage `ffprobe {}`

It is likely caused by spaces or single/double quotes in the original video name. Try renaming the original video. For example, the original name is D:/UNSW/2024 T2/BIOS 2061/Week 5 Amphibians, reptiles, and birds/_video_out/BIOS2061-5246_00069- Lecture 13 - Birds 1 'Origin of Birds' - Prof. Richard Kingsford- Part 2 - UNSW##BIOS2061-5246_00069- Lecture 13 - Birds 1 'Origin of Birds' - Prof. Richard Kingsford- Part 2 - UNSW.mp4. You can see that the video name is very complex and contains spaces, single quotes, etc. This kind of name is very prone to errors during processing. Please delete spaces and single quotes.

Is Docker Deployment Supported?

Not supported.

Can it Recognize Subtitle Text in the Video, i.e., OCR Recognition?

The principle of this software is to recognize human speech in the video and convert it into text subtitles. OCR subtitle recognition is not supported.

Can it be Called Through HTTP API Interface?

Not currently available. This feature may be added later.

Can New Languages be Added?

Not possible, because speech recognition for subtitles depends on the Whisper model, and the languages supported by this model are limited. Unsupported languages cannot be recognized.

Where to Download the Software

https://pyvideotrans.com/downpackage.html

Where to Download the Model

https://pyvideotrans.com/model.html

CUDA is Installed, but Still Cannot Use it

Possible reasons:

1: Built-in CUDA support requires version 11.8 or above. Check if your CUDA version is too low. 2: The graphics card driver is too old and needs to be updated. 3: cudnn is not installed. 4: The graphics card is not an NVIDIA card or is incompatible.

Does it Support Multi-Role Recognition and Dubbing?

Not supported. The recognized subtitles do not distinguish between speakers and roles. This function can be manually implemented through "Set Row Role".

CLI Command Line Mode Always Has Problems

CLI mode updates are delayed. Please use the old version.

Error During Translation Stage

Please change the translation channel or fill in the network proxy.

The Software Freezes After Double-Clicking and Cannot Start, Stuck on the Startup Screen

The software is large, please be patient. If it still cannot be opened after a long time, please try:

Close anti-virus software, security software, etc.
Confirm that the path and directory where the software is located are composed of English or numbers, and do not contain spaces, Chinese, special symbols, etc.

If it cannot be started after overwriting the upgrade package, please download the complete package.

If it is already a complete package, please be patient. If it still does not start after 2 minutes, try forcing it to close and reopen.

Which Translations are Supported?

Currently supports Microsoft Translator, Google Translate, Baidu Translate, Tencent Translate, DeepL Translate, ChatGPT Translate, AzureGPT Translate, Gemini Pro Translate, DeepLx Translate, OTT Offline Translation, FreeGoogle Translate, FreeChatGPT Translate.

Connection Error

The error "Connection error" indicates that the network connection failed. If you have not filled in the proxy in the software interface, please fill it in. It is impossible to directly connect to ChatGPT/Gemini/Google API in China, and you need to fill in the proxy. Note that being able to open the corresponding website in the browser does not mean that it can be used in the software. Please fill in the correct proxy address in the network proxy input box.

Whole all out of memory

The error "Whole all out of memory" indicates that the video memory is insufficient. Please use a smaller model, such as tiny small, etc.

Requested float16 compute type, but the target device or backend do not support efficient float 16 computation

The error shows that the current graphics card does not support this data type. The solution is to open Menu--Tools--Advanced Settings and find:

CUDA Data Type

Change the content to int8_float16

Then restart the software and execute. If the error still occurs, change it to float32.

How to Install

No installation is required. After downloading the complete package and decompressing it, double-click sp.exe to use it.

Why is it Reported as a Virus or Blocked?

This is software packaged using pyinstaller. It has not been digitally signed or certified by anti-virus software, so it may be misreported. Please add it to the trusted whitelist or close the security software. Or use source code deployment.

Which TTS Voiceovers are Supported?

edgeTTS / Azure AI / GPT-SoVITS / clone-voice / elevenlabs

Source Code Deployment Problem Description

The default uses ctranslate2 version 4.x, which only supports CUDA 12.x. If your CUDA version is lower than 12 and you cannot upgrade CUDA to 12.x, please execute the command to uninstall ctranslate2 and then reinstall it.

pip uninstall -y ctranslate2

pip install ctranslate2==3.24.0

You may encounter errors such as xx module not found. Please open requirements.txt, search for the xx module, and then remove == and the version number after it.

Is There Any Customer Service?

No, this is free software, without income or profit, and cannot be equipped with customer service.

Is it Free?

This is a free and open-source project. No fees are charged and use is free. The translation and TTS interfaces are charged by the respective API merchants and are not related to this project.

Can it be Used Commercially?

Individuals or companies can use it casually. However, if you want to integrate it into a commercial project, please follow the GPL-v3 open-source license.

1. Cannot Open After Double-Clicking sp.exe ​

2. Reduced Video Clarity ​

3. Error During Translation Stage ​

3. Speech Recognition Accuracy is Too Low ​

4. Model Download Address ​

5. Is it Available on Win7? ​

6. Prompt Missing python310.dll ​

7. Error During Merging Stage ffprobe {} ​

Is Docker Deployment Supported? ​

Can it Recognize Subtitle Text in the Video, i.e., OCR Recognition? ​

Can it be Called Through HTTP API Interface? ​

Can New Languages be Added? ​

Where to Download the Software ​

Where to Download the Model ​

CUDA is Installed, but Still Cannot Use it ​

Does it Support Multi-Role Recognition and Dubbing? ​

CLI Command Line Mode Always Has Problems ​

Error During Translation Stage ​

The Software Freezes After Double-Clicking and Cannot Start, Stuck on the Startup Screen ​

Which Translations are Supported? ​

Connection Error ​

Whole all out of memory ​

Requested float16 compute type, but the target device or backend do not support efficient float 16 computation ​

How to Install ​

Why is it Reported as a Virus or Blocked? ​

Which TTS Voiceovers are Supported? ​

Source Code Deployment Problem Description ​

Is There Any Customer Service? ​

Is it Free? ​

Can it be Used Commercially? ​