Generator · 100% free, no signup · Updated May 2026
Outline + script for your next YouTube video in 30 seconds.
A free YouTube script generator that produces a structured 5-section script: hook (first 8s), context, body (3-5 beats), mid-video re-hook at 50%, and CTA. Calibrated to YouTube long-form pacing (~150 words/min) and the 50% retention cliff.
Type a topic + pick a video length (Short / 5 min / 10 min / 20+ min) + style (talking head / tutorial / story / contrarian). The generator outputs a 5-beat structure with retention triggers placed where they actually matter.
Your video
Your script outline
═══ COLD OPEN (0:00-0:08) — Hook ═══ [BLUNT CLAIM about the topic, 1 sentence] ═══ CONTEXT (0:08-0:25) — Why this matters ═══ Quick context on Why I dropped 4 features from my SaaS this week — why someone watching should care. Reference a stat, a recent event, or a pain point. Land the value promise: "how to focus your roadmap by killing what doesn't move the needle". ═══ BEAT 1 (1min-2min) — First point ═══ Make ONE clear point. Open with a concrete example or stat. Tell the lesson in plain language. ═══ BEAT 2 (2min-3min) — Second point ═══ Make ONE clear point. Open with a concrete example or stat. Tell the lesson in plain language. [RE-ENGAGEMENT TRIGGER — pattern interrupt, B-roll cut, or question to viewer] ═══ BEAT 3 (3min-4min) — Third point ═══ Make ONE clear point. Open with a concrete example or stat. Tell the lesson in plain language. ═══ MID-ROLL CTA (~halfway) ═══ Subscribe / newsletter / product mention. Keep it ≤8 seconds. Tie it to the value the viewer is already getting. ═══ CLOSE — Tie-up + next action ═══ Recap the 3 points in one sentence each. End with a specific next action: "Try this next [WEEK/MONTH] and tell me how it goes." Or: "Watch [LINKED VIDEO] to go deeper on [SUBTOPIC]."
Who this is for
YouTube creators stuck staring at a blank doc who need the skeleton of a video before they start filming.
The problem this solves
Most "AI YouTube scripts" output a wall of text that ignores how YouTube retention actually works. The 5-beat structure is what every retention-optimized YouTube channel actually uses: cold open hook → context → 3 body beats with re-engagement → mid-roll CTA → close. The generator gives you the skeleton; you fill in the lines.
YouTube long-form videos with a structured 5-section script and a mid-video re-hook at the 50% mark retain on average 18-28% more viewers through the end vs improvised single-hook videos of the same length.
Source: YouTube Creator Insider retention research 2024 + TubeBuddy structure-impact analysis
How to use it
- 01
Pick video length
Short (<60s), 5-minute, 10-minute, 20+ minute. The length shapes the number of body beats and where re-engagement triggers land.
- 02
Pick style
Talking head, tutorial, story, contrarian. The style shapes the cold-open pattern (hook style) and the rhythm of body beats.
- 03
Type your topic + key claim
One sentence topic + one sentence "what the viewer will leave knowing". These two anchor the entire script structure.
- 04
Get the outline + copy
Full 5-beat outline with retention triggers marked. Copy into your teleprompter or filming notes.
What you get
- ✓ Cold-open hook + 3-5 body beats + CTA + close
- ✓ Retention triggers placed at video-specific timestamps
- ✓ Style-tuned rhythm (talking head ≠ tutorial pacing)
- ✓ Plain-text export — drop into teleprompter, Notion, doc
Frequently asked
How long should a YouTube script be?
~150 words per minute of video. A 10-minute video script: ~1,500 words. The generator auto-scales body-beat count based on target duration.
Why does YouTube need a mid-video re-hook?
The "50% cliff" — viewers drop off heavily around the midpoint. A pattern-interrupt (surprising claim, callback, story restart) at 50% prevents this drop and lifts end-of-video retention by 18-28%.
Is this different from the video script formatter?
The formatter outputs word-count targets per section. The generator outputs actual script copy templates. Use them together: formatter sets structure, generator fills it.
Should I script word-for-word or use bullets?
Hook + CTA: word-for-word (read off-camera or memorize). Body: bullets you improvise from. Scripted body sounds robotic; scripted hooks earn the open.
Can I use this for YouTube Shorts?
Use the Video Script Formatter instead — Shorts have completely different pacing (90-140 words for 60 seconds) and no mid-video re-hook is needed below 4 minutes.
Related tools
Author + maintainer
Adrian Berisha — Founder of Clipflow. Indie SaaS builder shipping creator tools full-time since 2024. All tools and benchmarks on this page are reviewed quarterly. Last review: May 2026.
@clipflow on X · LinkedIn · hi@clipflow.to
Want this baked into your workflow?
Clipflow runs the whole repurposing pipeline — clip-finder, brand-voice captions, scheduler, multi-platform publish — for free on the Starter plan.
Try Clipflow free