Overview
This node integrates with the Dify API to perform several AI-powered operations, including converting audio files to text. The "Audio to Text" operation allows users to upload an audio file in binary form and receive a transcription of its content. This is useful for scenarios such as transcribing interviews, meetings, podcasts, or any audio recordings into editable text.
Practical examples:
- Automatically transcribe customer support calls for analysis.
- Convert voice notes into text for easier documentation.
- Generate subtitles or captions from video/audio content.
Properties
| Name | Meaning |
|---|---|
| User ID | Unique identifier for the user making the request. |
| Binary Property | Name of the binary property that contains the audio file to be transcribed (e.g., "data"). |
Output
The node outputs a JSON object containing the transcription result returned by the Dify API. The exact structure depends on the API response but generally includes the recognized text and possibly metadata about the transcription.
If the node is used with other operations (not requested here), it may output different structures, but for "Audio to Text," the main output is the transcription JSON.
The node does not output binary data for this operation; it only consumes binary input and returns JSON output.
Dependencies
- Requires an active connection to the Dify API via an API key credential.
- The node expects the Dify API base URL and API key to be configured in the credentials.
- The audio file must be provided as binary data within the specified binary property of the input item.
Troubleshooting
Error: No binary data property "X" on item!
This error occurs if the specified binary property does not exist on the input item. Ensure that the binary data is correctly attached to the input under the given property name.API Request Failures
If the API key is invalid or missing, or if there are network issues, the node will fail to connect to the Dify API. Verify that the API key credential is correctly set up and that the network connection is stable.Unsupported Audio Format
If the audio file format is not supported by the Dify API, the transcription may fail or return errors. Confirm that the audio file is in a supported format like MP3, M4A, WAV, WEBM, or AMR.
Links and References
- Dify API Documentation (Assumed official docs for reference)
- n8n Documentation on Working with Binary Data