Overview
This node retrieves the transcript of a YouTube video given its video ID or URL. It is useful for scenarios where you want to extract spoken content from videos for further processing, such as generating subtitles, analyzing speech content, or creating searchable text archives. For example, you can input a YouTube video URL and get back the full transcript text along with a subtitle file in SRT format.
Properties
| Name | Meaning |
|---|---|
| Youtube URL | The ID or full URL of the YouTube video to transcribe. This can be either the video ID itself or a complete YouTube URL. |
Output
The node outputs JSON data with the following structure:
videoId: The extracted YouTube video ID.transcript: A plain text string containing the full transcript of the video, concatenated from all transcript segments.srt: A string representing the transcript formatted as an SRT subtitle file, including timestamps for each segment.
No binary data output is produced by this node.
Dependencies
- This node depends on an external library that fetches YouTube transcripts.
- No explicit API keys or authentication tokens are required since it uses publicly available transcript data.
- Requires internet access to fetch transcript data from YouTube.
Troubleshooting
- Common issues:
- Invalid or malformed YouTube URL or video ID will cause the node to fail to fetch the transcript.
- Videos without available transcripts (e.g., no captions or restricted content) will result in errors or empty results.
- Error messages:
- Errors related to fetching the transcript typically indicate network issues or unavailable transcripts.
- To handle failures gracefully, enable the "Continue On Fail" option in the node settings to avoid stopping the workflow on errors.