Generator · 100% free, no signup · Updated May 2026

Outline + script for your next YouTube video in 30 seconds.

A free YouTube script generator that produces a structured 5-section script: hook (first 8s), context, body (3-5 beats), mid-video re-hook at 50%, and CTA. Calibrated to YouTube long-form pacing (~150 words/min) and the 50% retention cliff.

Type a topic + pick a video length (Short / 5 min / 10 min / 20+ min) + style (talking head / tutorial / story / contrarian). The generator outputs a 5-beat structure with retention triggers placed where they actually matter.

Your video

Length
Style

Your script outline

═══ COLD OPEN (0:00-0:08) — Hook ═══
[BLUNT CLAIM about the topic, 1 sentence]

═══ CONTEXT (0:08-0:25) — Why this matters ═══
Quick context on Why I dropped 4 features from my SaaS this week — why someone watching should care. Reference a stat, a recent event, or a pain point. Land the value promise: "how to focus your roadmap by killing what doesn't move the needle".

═══ BEAT 1 (1min-2min) — First point ═══
Make ONE clear point. Open with a concrete example or stat. Tell the lesson in plain language. 

═══ BEAT 2 (2min-3min) — Second point ═══
Make ONE clear point. Open with a concrete example or stat. Tell the lesson in plain language. [RE-ENGAGEMENT TRIGGER — pattern interrupt, B-roll cut, or question to viewer]

═══ BEAT 3 (3min-4min) — Third point ═══
Make ONE clear point. Open with a concrete example or stat. Tell the lesson in plain language. 

═══ MID-ROLL CTA (~halfway) ═══
Subscribe / newsletter / product mention. Keep it ≤8 seconds. Tie it to the value the viewer is already getting.

═══ CLOSE — Tie-up + next action ═══
Recap the 3 points in one sentence each. End with a specific next action: "Try this next [WEEK/MONTH] and tell me how it goes." Or: "Watch [LINKED VIDEO] to go deeper on [SUBTOPIC]."

Who this is for

YouTube creators stuck staring at a blank doc who need the skeleton of a video before they start filming.

The problem this solves

Most "AI YouTube scripts" output a wall of text that ignores how YouTube retention actually works. The 5-beat structure is what every retention-optimized YouTube channel actually uses: cold open hook → context → 3 body beats with re-engagement → mid-roll CTA → close. The generator gives you the skeleton; you fill in the lines.

YouTube long-form videos with a structured 5-section script and a mid-video re-hook at the 50% mark retain on average 18-28% more viewers through the end vs improvised single-hook videos of the same length.

Source: YouTube Creator Insider retention research 2024 + TubeBuddy structure-impact analysis

How to use it

  1. 01

    Pick video length

    Short (<60s), 5-minute, 10-minute, 20+ minute. The length shapes the number of body beats and where re-engagement triggers land.

  2. 02

    Pick style

    Talking head, tutorial, story, contrarian. The style shapes the cold-open pattern (hook style) and the rhythm of body beats.

  3. 03

    Type your topic + key claim

    One sentence topic + one sentence "what the viewer will leave knowing". These two anchor the entire script structure.

  4. 04

    Get the outline + copy

    Full 5-beat outline with retention triggers marked. Copy into your teleprompter or filming notes.

What you get

  • Cold-open hook + 3-5 body beats + CTA + close
  • Retention triggers placed at video-specific timestamps
  • Style-tuned rhythm (talking head ≠ tutorial pacing)
  • Plain-text export — drop into teleprompter, Notion, doc

Frequently asked

How long should a YouTube script be?

~150 words per minute of video. A 10-minute video script: ~1,500 words. The generator auto-scales body-beat count based on target duration.

Why does YouTube need a mid-video re-hook?

The "50% cliff" — viewers drop off heavily around the midpoint. A pattern-interrupt (surprising claim, callback, story restart) at 50% prevents this drop and lifts end-of-video retention by 18-28%.

Is this different from the video script formatter?

The formatter outputs word-count targets per section. The generator outputs actual script copy templates. Use them together: formatter sets structure, generator fills it.

Should I script word-for-word or use bullets?

Hook + CTA: word-for-word (read off-camera or memorize). Body: bullets you improvise from. Scripted body sounds robotic; scripted hooks earn the open.

Can I use this for YouTube Shorts?

Use the Video Script Formatter instead — Shorts have completely different pacing (90-140 words for 60 seconds) and no mid-video re-hook is needed below 4 minutes.

Related tools

Author + maintainer

— Founder of Clipflow. Indie SaaS builder shipping creator tools full-time since 2024. All tools and benchmarks on this page are reviewed quarterly. Last review: May 2026.

@clipflow on X · LinkedIn · hi@clipflow.to

Want this baked into your workflow?

Clipflow runs the whole repurposing pipeline — clip-finder, brand-voice captions, scheduler, multi-platform publish — for free on the Starter plan.

Try Clipflow free

Alle kostenlosen Tools