ElevenLabs icon

ElevenLabs

WIP

Overview

The "Generate Voice" operation of the Voice resource in this node allows users to create a synthetic voice sample based on specified speaker characteristics such as gender, accent, and age. It generates a temporary voice ID along with an audio sample that can be previewed. If the generated voice is satisfactory, users can then call a separate "Create Voice" operation using the returned voice ID to save or use the voice permanently.

This node is useful for applications involving text-to-speech synthesis where customized voices are needed, such as:

  • Creating personalized voice assistants.
  • Generating audio content with specific voice traits.
  • Testing different voice styles before committing to a final voice creation.

Example: Generate a female British-accented young adult voice sample reading a custom text snippet, then decide whether to save it.

Properties

Name Meaning
Gender Name or ID The gender of the speaker. Choose from a predefined list or specify an ID dynamically via expressions.
Accent Name or ID The accent of the speaker. Choose from a predefined list or specify an ID dynamically via expressions.
Age Name or ID The age group of the speaker. Choose from a predefined list or specify an ID dynamically via expressions.
Additional Fields A collection of optional fields:
Binary Name: Change the output binary data property name (default: "data").
File Name: Change the output file name (default: "voice").
Sample Text: Text used for voice generation.
Accent Strength: Numeric value controlling how strong the accent should be (range 0.3 to 2, default 1).

Additionally, there are two notice properties shown only for this operation:

  • A general beta notice about the node's development status and links for more info/support.
  • A description explaining that this operation generates a voice sample and returns a generated voice ID.

Output

The node outputs JSON data containing at least:

  • generated_voice_id: An identifier for the generated voice sample.
  • Audio sample data representing the synthesized speech of the provided text with the selected voice parameters.

If configured, the node also outputs binary data containing the audio file of the generated voice sample. The binary data's property name and file name can be customized via input properties.

Dependencies

  • Requires an API key credential for authentication with the ElevenLabs API service.
  • Network access to the ElevenLabs API endpoint (https://api.elevenlabs.io/v1).
  • No other external dependencies are indicated.

Troubleshooting

  • Common issues:

    • Invalid or missing API key will cause authentication failures.
    • Providing unsupported or incorrect gender/accent/age IDs may result in errors or no voice generated.
    • Accent strength values outside the allowed range (0.3 to 2) might cause validation errors.
    • Empty or excessively long sample text could lead to unexpected results or API rejections.
  • Error messages:

    • Authentication errors: Check that the API key credential is correctly set up.
    • Validation errors on input parameters: Verify that gender, accent, age, and accent strength values are valid and within allowed ranges.
    • Network or API errors: Ensure stable internet connection and that the ElevenLabs API service is operational.

Links and References

Discussion