Qwen3 Studio

Feature Walkthrough

See It In Action

A full tour of every engine, the Batch Studio workflow, and the plugin system — from first launch to finished audio.

Three Creative Engines

Every Voice, Every Style, Every Scene

Three purpose-built synthesis engines for every production scenario — from scripted narration to zero-shot character creation.

🟢 Custom Voice

Command Pre-Trained Personas

Drive a library of professionally-tuned vocal characters using plain English style instructions. Consistent, high-quality output across every take — perfect for narration, audiobooks, and reliable character voice-over.

9 built-in personas — Ryan, Aiden, Vivian, Serena, Eric, Dylan, Sohee, and more
Style Injection: "Speak softly", "Conspiratorial whisper", "Old radio announcer"
Style & Profile Manager to create, save, and toggle custom styles inline
Seed control for perfectly reproducible takes

🔵 Voice Design

Create Voices That Never Existed

Generate entirely new vocal identities from text descriptions alone. Define the body and the performance. The model constructs a unique vocal fingerprint from scratch — no reference audio required.

Two-field formula: Voice Description (the body) + Style Instruction (the performance)
"A 60-year-old gravelly smoker with a Southern drawl" — just describe it
Seed control to lock and reproduce exact vocal fingerprints
Export to Voice Clone for consistent cross-session rendering

🟣 Voice Clone

Precision Digital Replicas

Capture any voice from as little as 3–10 seconds of reference audio. Feed it any script and the model delivers that same voice, performing your direction. The integrated Prep Station handles reference transcription automatically.

Only 3–10 seconds of clean reference audio required
Integrated Whisper AI transcription for reference preparation
Speaker prompt caching — computed once per batch, not per block
|| delimiter for multi-segment long-form content rendering

Batch Studio

The Full Production Pipeline

A non-linear audio director for multi-voice scripts. Produce entire podcast episodes, game dialogue trees, or audiobook chapters with a single run.

Director-Level Control Over Every Block

Each script block carries its own speaker, engine, style, language, seed, temperature, and Top P. Mix any combination of engines and voices in a single scene — Auto-Switch handles all model transitions automatically.

⚡ Per-Block Gen — Regenerate a single block without touching the rest of the scene
🎲 Multi-Take (×3) — Generate 3 variations silently, then pick the best; winning seed saved automatically
Seed Control — Pin an integer for deterministic, reproducible takes every time
Status Ledger — Grey → Blue (busy) → Yellow (review) → Green / Red approval workflow
🔍 Auto-Verify — Post-generation Whisper audit: silence scan + transcription fuzzy-match
Collapsible Blocks — Compact headers for long scripts; Save / Load entire scenes to JSON

More Features

Everything Built In

From production utilities to developer tooling, Qwen3 Studio ships with a complete ecosystem for serious voice work.

🗂 Style & Profile Manager

Enable, disable, create, and inline-edit all custom styles and Voice Design profiles from a dedicated tab. Changes sync live to every dropdown instantly — no restart needed.

🔌 Modules Manager

GitHub-synced plugin hub with SHA-256 verification. Toggle features on/off without restarting. Pull the latest official plugins in one click, or ship your own headless extensions.

📝 Text Parser

Pre-process documents and scripts into clean, segment-ready input. Strips timestamps, normalises formatting, and splits at natural sentence boundaries for consistent rendering.

🎓 Interactive Tutorials

Built-in guided tutorials walk through each engine, the batch workflow, and advanced voice design techniques — right inside the app, without leaving the UI.

💡 Contextual Help

Every tab has an inline help panel with practical tips, tone recipes, and action tag references. Always one click away — no separate documentation window needed.

⚡ VRAM & Stability

Aggressive VRAM flush between every generation and take, real-time GPU memory indicator, meta-tensor safety guard, and an emergency Reset button that never hangs or crashes.

Audio Demo

Hear It For Yourself

These samples were generated locally using the Voice Clone engine on the 12Hz High-Fidelity architecture. No cloud. No API call.

Public Figure

English

"Look, people ask me all the time — they say 'Sir, how is your voice so clear?' And I tell them, it's Qwen Studio..."

Public Figure

Spanish

"Y déjenme decirles algo más. Hablo español perfectamente. Nadie habla español mejor que yo..."

Sir David Attenborough

English

"Here we observe the modern content creator in their natural habitat... utilizing the new high-fidelity architecture..."

Sir David Attenborough

Spanish

"Y observen la facilidad con la que cambia de piel. Ahora habla en la lengua de Cervantes, conservando su elegancia natural..."

Humphrey Bogart

English

"Of all the GitHub repos in all the towns in all the world... she walks into mine. This is the one."

Humphrey Bogart

Spanish

"Escúchame bien, muñeca. Esto no es un juego. Esto es calidad de estudio local."

Rosalía

Spanish

"Yo me paso años perfeccionando mi voz... y esta IA local la clona en cuatro segundos. Cuatro. Tengo sentimientos encontrados."

Rosalía

Spanish

"Mi manager me llamó muy alterado. Le dije que se tranquilizara... Luego le pregunté si sonaba mejor que yo en directo. Me colgó."

Gérard Depardieu

French

"J'ai d'abord refusé — je suis un artiste, pas une machine. Puis on m'a dit: sans abonnement, sans nuage. J'ai ouvert un Bordeaux... et j'ai dit oui."

Sophia Loren

Italian

"Hanno copiato la mia voce senza internet, senza pagare ogni mese — solo la GPU che lavora come un pazzo. Amico mio, questo è genio puro."

João Gilberto

Portuguese

"Bossa Nova não é sobre gritar. É sobre o silêncio. Esta inteligência artificial entende isso... sussurra a minha voz. Mas onde está o meu violão?"

The Godfather

English

"You come to me... into my browser... and you ask me to clone a voice. You don't even offer me a GPU. I will make you an audio file you cannot refuse."

Documentation

Everything You Need to Know

From quick start to advanced plugin development — fully documented and kept up to date with every release.

About the Developer

Hi, I'm Blues.

To me, good development is simply finding the best solution to a problem. Imagine you are in charge of finding the best way to get from your home to work. As a developer, I have to know all the options — whether that is walking, digging a tunnel, or taking a helicopter.

My task is to find the way that makes the journey as smooth as possible for the person traveling. I don't need to know how to build the airplane to solve the problem — I just need to know exactly when to use it. Everything I learn from life helps me find a better way to get people where they need to go. And I am always open to listening and learning from anyone who thinks we should take a different path.

See It In Action

Every Voice, Every Style, Every Scene

Command Pre-Trained Personas

Create Voices That Never Existed

Precision Digital Replicas

The Full Production Pipeline

Director-Level Control Over Every Block

Everything Built In

🗂 Style & Profile Manager

🔌 Modules Manager

📝 Text Parser

🎓 Interactive Tutorials

💡 Contextual Help

⚡ VRAM & Stability

Hear It For Yourself

Public Figure

Public Figure

Sir David Attenborough

Sir David Attenborough

Humphrey Bogart

Humphrey Bogart

Rosalía

Rosalía

Gérard Depardieu

Sophia Loren

João Gilberto

The Godfather

Everything You Need to Know

Feature Specification

Director's Guide

Plugin SDK

Hi, I'm Blues.