Smithery Logo
MCPsSkillsDocsPricing
Login
Smithery Logo

Accelerating the Agent Economy

Resources

DocumentationPrivacy PolicySystem Status

Company

PricingAboutBlog

Connect

© 2026 Smithery. All rights reserved.

    nuva-lab

    voice-clone

    nuva-lab/voice-clone
    AI & ML

    About

    SKILL.md

    Install

    Install via Skills CLI

    or add to your agent
    • Claude Code
      Claude Code
    • Codex
      Codex
    • OpenClaw
      OpenClaw
    • Cursor
      Cursor
    • Amp
      Amp
    • GitHub Copilot
      GitHub Copilot
    • Gemini CLI
      Gemini CLI
    • Kilo Code
      Kilo Code
    • Junie
      Junie
    • Replit
      Replit
    • Windsurf
      Windsurf
    • Cline
      Cline
    • Continue
      Continue
    • OpenCode
      OpenCode
    • OpenHands
      OpenHands
    • Roo Code
      Roo Code
    • Augment
      Augment
    • Goose
      Goose
    • Trae
      Trae
    • Zencoder
      Zencoder
    • Antigravity
      Antigravity
    ├─
    ├─
    └─

    About

    Clone a voice using qwen3-tts and generate speech from text

    SKILL.md

    Voice Clone Skill

    Use this skill to clone a speaker's voice and generate text-to-speech audio.

    Two-Step Process

    Step 1: Clone Voice (one-time)

    python skills/voice-clone/clone.py <audio_sample.wav> [--transcript "text"]
    

    Creates a speaker embedding file that can be reused.

    Step 2: Generate Speech

    python skills/voice-clone/speak.py <embedding.safetensors> "Text to speak"
    

    Generates audio using the cloned voice.

    Requirements

    • FAL_KEY in .env (fal.ai API key)
    • Voice sample: 10-30 seconds of clear speech (WAV/MP3)
    • Optional: Transcript of the sample for better quality

    Output

    • assets/outputs/voice_embeddings/<name>_embedding.safetensors - Reusable voice model
    • assets/outputs/audio/<name>_speech.wav - Generated audio

    Notes

    • qwen3-tts works best with Chinese speech samples
    • Cross-lingual cloning (Chinese voice → English speech) may have quality variations
    • Provide reference transcript for best quality
    Repository
    nuva-lab/vibecut
    Files