Actions16
Overview
The "Voice changer" operation within the Speech resource of this ElevenLabs node allows users to transform an existing audio file by changing its voice characteristics. This is useful for applications such as creating voiceovers with different voices, anonymizing speakers, or generating creative audio effects by modifying the original voice in an audio clip.
Typical use cases include:
- Altering a recorded podcast or narration to sound like a different speaker.
- Creating character voices for games or animations.
- Enhancing privacy by changing the voice in sensitive recordings.
- Experimenting with audio content for entertainment or marketing purposes.
Users provide an input audio file (binary data), select a target voice, and configure optional parameters to control the voice transformation's style, stability, and output format.
Properties
| Name | Meaning |
|---|---|
| Binary Input Field | The name of the binary property containing the audio file to be transformed. This is required to specify which input binary data will be processed. |
| Voice ID | The identifier of the voice to apply to the input audio. Can be selected from a searchable list or entered manually by ID. |
| Additional Fields | A collection of optional settings: |
| - Binary Name | Change the output binary property's name (default: "data"). |
| - File Name | Change the output file name (default: "voice"). |
| - Streaming Latency | Optimize streaming latency at some cost to quality. Values range from 0 (no optimization) to 4 (max optimization with text normalizer off). |
| - Output Format | Format of the generated audio output. Options include various MP3 and PCM formats, and μ-law encoding. Default is MP3 44.1kHz 128kbps. |
| - Model Name or ID | Identifier of the model used for voice transformation. Selectable from a list or specified by ID. |
| - Stability | Controls voice stability; a number between 0 and 1. Higher values mean more stable voice characteristics. |
| - Similarity Boost | Controls how closely the output voice resembles the target voice; a number between 0 and 1. |
| - Style | Exaggerates the voice style; a number between 0 and 1. |
| - Speaker Boost | Boolean flag to activate speaker boost feature. |
| - Seed | Numeric seed for deterministic output. Using the same seed with the same input produces identical audio. Range: 0 to 4294967295. |
| - Enable Logging | Whether to enable logging for the request. Disabling logging results in zero retention mode (history features unavailable). |
| - Remove Background Noise | Boolean flag to remove background noise from the input audio before processing. |
Output
The node outputs the transformed audio as binary data under a configurable binary property (default named "data"). The binary data contains the audio file encoded in the selected output format (e.g., MP3 or PCM). Additionally, the output JSON may contain metadata about the response, but the primary output is the modified audio file ready for further use or download.
Dependencies
- Requires an API key credential for ElevenLabs API access.
- The node makes HTTP POST requests to the ElevenLabs API endpoint
https://api.elevenlabs.io/v1. - Proper configuration of the API key credential in n8n is necessary.
- The user must supply valid input audio data in binary form.
Troubleshooting
- Invalid Voice ID: If the selected voice ID does not exist or is invalid, the API will likely return an error. Verify the voice ID by selecting from the provided list or checking the correct ID.
- Missing Binary Input: The node requires a binary input field containing audio data. Ensure that the input binary property name matches the configured "Binary Input Field".
- Unsupported Audio Format: Input audio should be in a supported format. If the API rejects the input, convert the audio to a compatible format before processing.
- API Authentication Errors: Ensure the API key credential is correctly set up and has sufficient permissions.
- Latency Optimization Quality Tradeoff: Enabling higher streaming latency optimizations may degrade audio quality. Adjust the "Streaming Latency" setting accordingly.
- Logging Disabled: Disabling logging disables history features; enable it if you need to track usage or debug issues.