Process + guide

How written work becomes finished audio.

MoonLion’s Audio Lab turns books, scripts, lessons, business content, and creator material into directed audio through a repeatable quality workflow: source intake, performance-score preparation, voice testing, controlled generation, human QA, mastering, and delivery.

Start simple

The request form only needs name, email address, project title, what you are interested in, and a short message. Details can come after the first conversation.

Hear before scale

A Bronze or Silver starter sample can test voice fit, pacing, tone, and production style before larger production.

Protect before work

Rights, AI narration authorization, voice consent, payment, revisions, and delivery terms are confirmed in writing before production begins.

Terminology

MoonLion’s Audio Lab terms you may hear during production.

Performance score

The recording-ready version of the text. It defines voice identity, speaker clarity, pacing, tags, segmentation, and QA notes.

Identity anchor

A short voice description placed at the start of a production block so the voice model knows the accent, range, persona, and quality target.

Scene director note

A brief note that names the point of view, scene type, emotional beats, stability mode, and guidance target.

Speaker key

A clear map of narrator and character voices so dialogue turns are not confusing during production.

Natural / Creative mode

Natural supports clean narration. Creative supports vulnerability, conflict, intimacy, suspense, action, and emotional shifts. Robust is avoided for MoonLion’s Audio Lab fiction.

Tag leakage

When an emotional tag accidentally carries into a neutral line. Good segmentation and restrained tags prevent this.

Production path

The full quality workflow.

01

Request

The short request form starts the conversation. Optional details help with scoping but are not required.

02

Scope

MoonLion’s Audio Lab clarifies format, audience, word count, voices, content rating, timeline, SFX needs, and platform goals.

03

Source intake

The stable text, pronunciation notes, tone references, speaker needs, and usage notes are gathered after the next step is clear.

04

Starter sample

A Bronze or Silver starter can test the hardest performance shift before larger generation work.

05

Agreement

Rights, AI narration approval, scope, payment, revisions, voice consent, and delivery terms are confirmed.

06

Performance score

The text is prepared for the ear with identity anchors, scene notes, speaker keys, segmentation, tags, and SFX cues where useful.

07

Directed generation

Projects are generated in controlled sections with improvement passes where needed.

08

QA + mastering

Human listening checks identity drift, tag leakage, artifacts, breath realism, pacing, speaker clarity, and listener fatigue before delivery.

What the client provides
  • Source textBook, excerpt, article, newsletter, or script.
  • Project contextGenre, audience, tone, content rating, series/backlist needs, and final goal.
  • Pronunciation notesNames, places, fantasy terms, brand words, accents, and invented language.
  • Approval checkpointsFeedback on the audition, pilot, voice direction, and final delivery scope.
What MoonLion’s Audio Lab handles
  • Voice directionVoice path, narrator identity, speaker map, and performance feel.
  • Script shapingNormalization for the ear, pacing, tags, SFX cues, and clean segmentation.
  • ProductionDirected generation, regeneration, editing, QA listening, mastering, and file preparation.
  • Delivery supportFinal files, launch assets by scope, and platform-conscious notes where applicable.
Script preparation

Highest-quality audio starts before generation.

Normalize for the earNumbers, abbreviations, symbols, times, money, and visual formatting are prepared so the narrator does not stumble.
Use audible tags onlyTags must describe something the voice can perform: breath, pacing, emotion, vocal quality, hesitation, urgency, warmth, or silence.
Use the two-word runwayPerformance tags are placed before the moment they affect so the voice has time to prepare the breath and mouth-feel.
Segment by emotional densityNeutral narration can run longer. Intimacy, tension, action, and emotional spikes stay shorter to prevent drift and artifacts.
Keep SFX intentionalSound effects support scene openings, transitions, environment, or dramatic impact. They should not turn every movement into a cue.
Quality controls

What MoonLion’s Audio Lab ready means.

Identity lock

The narrator still sounds like the same person across narration, dialogue, intimacy, urgency, and recovery.

Subtle differentiation

Characters shift rhythm, vocal weight, and mouth-feel without becoming cartoon voices.

Breath realism

Breaths and micro-sounds feel physical, not digital or pasted on.

No tag leakage

An emotional direction does not contaminate unrelated neutral narration.

No synthetic artifacts

Metallic buzz, robotic jumps, digital tearing, and strained artifacts are fixed before delivery.

Long-form listenability

The final question is simple: would a listener keep their headphones on for four more hours?

Next step

Start with the project shape.

Once the source material, rights, voice direction, and delivery path are clear, the project can move into Bronze, Silver, Gold, Platinum, or custom scope with fewer surprises.