AI generation is temporarily down. We're working to bring it back, sorry for the inconvenience. Your existing content is unaffected.

Docs Home /Scene Mode /Overview

⌘K

Scene Mode

Scene Mode is the script-to-video surface. Paste a script, click Generate, and Fawna chunks it into scenes, generates an image for each, adds narration, and produces a finished video. When the output is close, you can hand it to the Editor for fine control. Minimum effort, maximum pacing.

When to use Scene Mode

You already have a script and want a video with a narrator reading it.
You care about pacing and narrative flow more than shot-by-shot camera work.
You want a first-draft video in minutes, not hours.
You are producing educational, explainer, or voice-driven content where each beat is a scene.

The flow at a glance

Create a storyboard. Paste your script, pick an aspect ratio, pick a voice (or a cast of voices). See Creating a storyboard.
Review the scene split. Fawna breaks the script into scenes based on pacing. Each scene gets its own card in a grid.
Generate assets. Bulk-generate images, narration audio, and motion video for every scene. Re-roll individual scenes until each one lands.
Play it back. The storyboard player stitches every scene with its audio. Preview from start to finish.
Import to the editor if you want to cut, add transitions, layer audio, or add overlays. See Importing to the editor.

The workspace

Scene Mode opens with a preview player on top and a scene grid underneath. The sidebar on the right toggles between two contexts:

Storyboard panel: global settings, script, character list, model defaults.
Scene panel: edits for the scene you have currently selected. Text, image tier, motion model, speaker, placement.

Click any scene card in the grid to focus it. The preview scrubs to that scene and the sidebar switches to the scene panel.

Scenes are the unit of work

A scene is one beat of the script. It has a text chunk, an image, an audio clip of the narrator reading the chunk, and optionally a motion video generated from the image. Every scene is independent: regenerating one does not affect the others.

Fawna picks scene length based on scenes-per-minute, a pacing dial. Default is 10, which gives roughly 6 seconds of finished video per scene. You can adjust this at storyboard creation.

Scene Mode vs Editor

Need	Use
Script to finished video fast	Scene Mode
Multiple overlapping audio tracks	Editor
Cross-fades and transitions	Editor
Text overlays, lower thirds	Editor
Specific trim down to frame	Editor
Different aspect per scene	Neither (one aspect per storyboard)
Fine shot-by-shot control	Editor
30-scene narrative in 20 min	Scene Mode, then import to Editor for polish

Typical workflow: draft in Scene Mode, import to the Editor, polish there. Scene Mode handles the coarse shape, Editor handles the detail. The two are decoupled: edits you make in the Editor do not push back to Scene Mode.

Where to go next

Creating a storyboard: fields, reference images, pacing.
Narrators and voices: single vs multi-narrator, cast rows, voice presets.
Editing scenes: per-scene text, image, motion, and audio controls.

← Previous

Fawna Compose

Creating a storyboard