Overview
This node processes Douyin (Chinese TikTok) videos by either extracting basic video information or converting the video's audio into text transcripts. The "提取文本" (Extract Text) operation specifically converts the speech in a Douyin video into text using selectable speech recognition models.
Common scenarios for this node include:
- Automatically generating subtitles or captions for Douyin videos.
- Extracting spoken content from videos for analysis, indexing, or search.
- Archiving or repurposing video audio content as text.
Practical example: Given a Douyin video URL or a text containing a share link, the node downloads the video, performs speech-to-text transcription with a chosen model, and outputs the recognized text for further processing or storage.
Properties
| Name | Meaning |
|---|---|
| 视频URL或分享文本 | The full URL of the Douyin video or a text containing the share link to the video. |
| 语音识别模型 | The speech recognition model used for transcription. Options: Paraformer-V1, Paraformer-V2, Fun-ASR |
Output
The node outputs JSON data containing the transcription result of the video’s audio. The exact structure depends on the underlying implementation but generally includes the recognized text extracted from the video.
No binary data output is indicated.
Dependencies
- Requires an API key credential for accessing the Douyin transcript service.
- Uses external speech recognition models referenced at: https://help.aliyun.com/zh/model-studio/recording-file-recognition
- Network access to download Douyin videos via provided URLs or share texts.
Troubleshooting
Common issues:
- Invalid or malformed video URLs or share texts may cause failures in downloading or parsing the video.
- Network connectivity problems can prevent video retrieval or API calls.
- Using an unsupported or incorrect speech recognition model value may lead to errors.
- Missing or invalid API credentials will block access to the transcription service.
Error messages:
未知操作: <operation>indicates an unsupported operation was requested; ensure "提取文本" is selected.- Errors related to video parsing or transcription usually stem from invalid input URLs or network issues.
Resolutions:
- Verify the video URL or share text format.
- Confirm API credentials are correctly configured.
- Check network connectivity.
- Select a valid speech recognition model option.
Links and References
- Speech recognition models documentation:
https://help.aliyun.com/zh/model-studio/recording-file-recognition