groq

N8N community node for Groq API - Speech-to-Text transcription using Whisper AI. Convert audio to text with high accuracy. Perfect for WhatsApp voice messages, audio files, and voice automation workflows.

Package Information

Downloads: 43 weekly / 149 monthly

Latest Version: 0.2.0

Author: [Seu Nome]

Available Nodes

Groq Speech to Text

Interact with Groq API for AI operations

Documentation

n8n-nodes-groq

This is an n8n community node package that provides integration with the Groq API for Speech-to-Text functionality.

Features

Speech to Text: Convert audio files to text using Groq's Whisper models
Multiple input formats: Support for binary data and audio URLs
Advanced models: Whisper Large V3, Whisper Large V3 Turbo, and Distil Whisper Large V3 EN
Flexible response formats: Plain text, JSON, and detailed JSON with timestamps
Advanced settings: Temperature control, language, prompts, and timestamp granularity

Installation

To use this community node, you need to install it in your n8n instance:

npm install n8n-nodes-groq

Configuration

Groq API Key: Get your API key from console.groq.com/keys
Credentials: Configure Groq API credentials in n8n with your API key

Usage

Speech to Text

The node allows you to convert audio to text using the following parameters:

Audio Input:
- Binary data (from previous nodes)
- Audio file URL
Model: Choose between Whisper Large V3 Turbo (fastest), Whisper Large V3 (high accuracy), or Distil Whisper Large V3 EN (optimized for English)
Response Format: Text, JSON, or detailed JSON
Language: Specify audio language (optional)
Prompt: Text to guide model style (optional)
Temperature: Control output randomness (0-1, optional)

Supported Audio Formats

The node supports the following audio formats:

WAV
MP3
M4A
FLAC
WEBM
OGG
OPUS

Example Usage

Use an "HTTP Request" or "Read Binary File" node to get an audio file
Connect to the Groq Speech to Text node
Configure the "Speech to Text" operation
Select "Binary Data" as audio input
Configure the model and other parameters as needed
Execute the workflow to get the transcription

Perfect for WhatsApp Integration

This node works seamlessly with WhatsApp voice messages:

Receive WhatsApp webhook with audio
Convert base64 audio to file using "Convert to File" node
Process with Groq Speech to Text
Send transcription back to WhatsApp

License

MIT

Support

For issues and suggestions, please open an issue in the project repository.

groqInstall