Pollinations.AI Audio icon

Pollinations.AI Audio

Generate audio (text-to-speech) or transcribe audio (speech-to-text) using Pollinations.AI

Overview

This node integrates with Pollinations.AI to perform two main operations: converting text to speech (Text to Speech) and transcribing audio to text (Speech to Text). It is useful for automating audio content generation from text or extracting text from audio files. For example, it can generate spoken audio for accessibility or content creation, or transcribe recorded meetings or voice notes into text for documentation.

Use Case Examples

  1. Convert a blog post or article text into an audio podcast using the Text to Speech operation.
  2. Transcribe customer support calls or interviews from audio files to text for analysis using the Speech to Text operation.

Properties

Name Meaning
Audio Input Specifies the source of the audio for transcription, either from binary data of a previous node or from a URL.
Audio URL The URL of the audio file to transcribe, used when Audio Input is set to URL.
Binary Property The name of the binary property containing the audio file, used when Audio Input is set to binary.
Audio Format The format of the audio file to be transcribed, such as WAV, MP3, M4A, or OGG.

Output

Binary

Contains the generated audio file in MP3 format when using Text to Speech operation.

JSON

  • text - The original text input for Text to Speech operation.
  • voice - The voice used for speech synthesis in Text to Speech operation.
  • audioUrl - The URL of the generated audio file in Text to Speech operation.
  • transcription - The transcribed text from the audio input in Speech to Text operation.
  • fullResponse - The full API response from Pollinations.AI for the transcription request.

Dependencies

  • Requires Pollinations.AI API token for authentication.

Troubleshooting

  • Error 'API token is required' indicates missing or invalid API credentials; ensure the Pollinations.AI API token is correctly set in credentials.
  • Timeout errors may occur if the audio file is too large or the network is slow; consider using smaller audio files or checking network connectivity.
  • If transcription results are empty or incorrect, verify the audio format and quality, and ensure the audio data is correctly provided either as binary or URL.

Links

  • Pollinations.AI - Official website for Pollinations.AI, the service used for text-to-speech and speech-to-text operations.

Discussion