YouTube AI Recaps
YouTube AI Recaps is a smart summarization feature that helps users catch up on long-form videos by generating timestamped key takeaways in seconds.
Role
UI UX Designer
TimeLine
April 2025 – May 2025
Tools
Figma, FigJam, Notion
Project Type
Bootcamp project, UX Academy
The Problem
Users often struggle to watch long videos start to finish because:
• 🧱 Traditional search tools feel clunky and inefficient
• ⏳ Users waste time skipping through to find relevant moments
• 😵💫 Lack of structure in long videos makes retention difficult
• 🤯 Passive watching leads to low engagement and knowledge loss
The Solution
YouTube AI Recaps simplifies video engagement by:
• 🎯 Summarizing videos with one clean, AI-generated recap
• ⚡ Giving users timestamped takeaways for quick browsing
• 🔁 Helping users review content without rewatching entirely
• 🧠 Supporting retention through structured, digestible info
The Research
For the YouTube AI Feature, I conducted targeted user research to understand how people interact with long-form video content and what prevents them from getting value quickly. This included a light competitor review of tools like Glasp and Eightify. I also conducted user interviews to uncover common pain points around content skimming, retention, and daily YouTube habits. From these insights, I created a focused user persona to shape our design direction. I then translated our findings into an affinity map to group insights by motivation, frustration, and behavior patterns — ensuring every design choice solved a real user need.
To learn how users consume and retain video content, I interviewed 8 participants. Key findings:
90% said they skip around videos instead of watching fully
Most users rely on guesswork when looking for useful moments
Many users said they forget what they watched shortly after
User Interviews
Competitive Analysis
I analyzed tools like Eightify, Glasp, and YouTube’s own transcript feature to identify gaps. Findings:
Eightify offers fast AI summaries directly on YouTube
Glasp enables highlight saving and some topic filtering
YouTube’s built-in transcript is native and easily accessible
Strengths
Weaknesses
Summaries often lack context or nuance from the video
Most interfaces feel text-heavy and unintuitive
No clear path to organize or review important takeaways
Eightify
Instantly generates AI-powered video summaries on YouTube
Provides time-stamped bullet points
Easy to use via a browser extension
Glasp
Lets users highlight, save, and tag video notes
Topic filtering and personalized highlight feeds
Integrates with other learning tools (Notion, Readwise)
YouTube Transcript
Automatically generates transcripts for most videos
Allows word-level search and navigation
Native and always available within the YouTube UI
Core Features
After analyzing competitors and gathering insights from interviews, I created two user personas to represent the primary mindsets and behaviors uncovered during research.
User Personas
Age: 35
Occupation: Project Manager
Location: Detroit, MI
Family: Married No kids
Marcus Green
Goals:
Stay sharp by consuming high-quality, relevant information
Maximize productivity without burning out
Advance personal and professional growth
Pain Points:
Gets frustrated when content feels bloated or overly padded
Feels like digital tools often try to do too much
Doesn’t have patience for features that require too much onboarding
Age: 20
Occupation: Full-time college student
Location: PHX, AZ
Family: Lives on campus with two roommates
Talia Nguyen
Goals:
Discover new ideas, content, or tools that spark personal growth
Make the most of her limited time between responsibilities
Stay informed without feeling behind in classes or news
Pain Points:
Struggles to focus when information is presented slowly
Feels overwhelmed by rigid or multi-step routines
Finds it difficult to stick to long-term morning habits
To make sense of the interview data, I created an affinity map — a method for synthesizing user quotes by grouping related thoughts, pain points, and behaviors. This allowed me to organize raw feedback into clear, actionable patterns.
Affinity Map
I organized the observations into clusters like benefits, common findings, pain points, and possible solutions. This helped highlight what mattered most to users and revealed key opportunities for the product.
Insights
Just One is grounded in key research insights, including:
Users want a lightweight and flexible routine that doesn’t feel overwhelming
Positive reinforcement and a sense of progress are key motivators
Switching between multiple apps (alarms, journals, lists) creates friction
A consistent morning routine helps users feel in control and focused
After completing the affinity map, I moved beyond research and into the next phase — Prioritization & Roadmapping — where I translated insights into actionable design decisions.
Project goal & Roadmapping
To guide the direction of this feature, I started by clearly defining the project goals and outlining a realistic task flow. The goal was to give users instant clarity on whether a video was worth their time. From selecting a YouTube link to generating a concise, timestamped summary, each step in the flow was designed to feel effortless and focused. This foundation helped me prioritize what mattered most and ensure the experience was both functional and intuitive.
To guide the design of this feature, I established a set of clear project goals focused on clarity, speed, and user control:
Allow users to quickly decide if a video is worth watching
Give control over summary length with an intuitive slider
Display clear, timestamped bullet summaries
Keep the experience fast, focused, and clutter-free
Help users retain and revisit key moments from the video
Project Goals
After defining these project goals, I mapped out a task flow to visualize how users would move through the YouTube AI Summarizer experience — from landing on the feature to receiving their final summary. This helped ensure every interaction supported the core goals while maintaining simplicity and speed.
Task Flows
Summarize video
After mapping out the task flow, I had a clear understanding of how users would move through the feature. This allowed me to begin shaping the wireframes with intention—focusing on simplicity, hierarchy, and clarity. I wanted every screen to feel purposeful, ensuring that key actions like adjusting the summary length or accessing timestamps were both intuitive and visually accessible.
Wireframes
With the core experience defined and the task flow locked in, I shifted my focus to translating structure into layout. I began by sketching low-fidelity wireframes to quickly explore how users would move through the feature. These initial drafts helped me clarify hierarchy, prioritize essential actions, and ensure the layout supported intuitive interaction. Once the flow felt right, I translated these sketches into high-fidelity wireframes to visualize the polished experience and guide the next phase of testing.
Before diving into high-fidelity design, I reflected on how the interface should feel—focused, easy to use, and grounded in simplicity. I wanted the wireframes to visually reinforce the tool’s purpose: making AI summaries accessible and clear. This meant using minimal distractions, intuitive layout patterns, and clear touchpoints that aligned with user expectations and feedback.
Lofi Sketches
After creating low-fidelity wireframes to explore layout ideas and user interaction, I moved straight into high-fidelity wireframes—refining spacing, typography, and visual hierarchy to better simulate the final product and prepare for usability testing.
Hifi Wireframes
These high-fidelity wireframes brought the feature to life—uniting functionality and flow into a clean, focused experience. Every detail reinforced the goal of making summaries feel accessible, clear, and time-saving. The interaction model now felt streamlined and purposeful, aligning well with users' needs for efficiency and clarity.
With the experience fully structured, it was time to test how real users interacted with the summarizer. I used these sessions to fine-tune layout, catch minor friction points, and ensure the feature felt seamless and helpful in actual usage—just as it did in planning.
Iterations and Testing:
Once the high-fidelity designs were complete, I moved into usability testing by interviewing and observing real users as they interacted with the prototype. The goal was to uncover friction points, see what felt intuitive, and hear honest reactions to the flow and features. I combined task-based testing with quick interviews to get both behavioral insight and verbal feedback. These sessions revealed what was working and what still needed refinement — helping me prioritize meaningful improvements backed by real usage.
To clearly capture insights from these sessions, I created a scorecard to track task success, timing, and direct user comments.
Usability Test Scorecard
From this testing, it became clear that while users understood the feature’s purpose, there were key opportunities to improve usability and interaction clarity. I focused my iterations on enhancing feedback and accessibility: adding a visible “Set Summary Length” label to eliminate confusion, and introducing a speaker icon to let users listen to summaries aloud. These updates made the feature feel more intuitive, inclusive, and polished—bringing the experience closer to real-world usage needs.
Iterations
1. Summary Interface Enhancements
Users were unclear on how the summary length slider functioned and missed that audio playback was available. To improve clarity and engagement, I added a label above the slider to explain its purpose and introduced a speaker icon that lets users listen to the summary out loud — making the experience more accessible and intuitive.
Conclusion
Designing the YouTube AI Summarizer wasn’t just about speeding up video consumption — it was about giving users back control of their time. By focusing on clarity, precision, and personalization, I crafted an experience that transforms long-form content into digestible, high-impact insights. Every detail — from layout to interactions — was shaped by real user needs and tested with intention. The result is a seamless feature that empowers users to get the value of a 15-minute video in seconds, without losing context.
Next steps
Explore more granular summary controls, such as topic segmentation or chapter tagging
Add a “Save Summary” feature for quick reference or note-taking
Introduce AI personalization so summaries reflect individual preferences or history
Continue testing with diverse user groups to refine accessibility and mobile usability
Key Takeaways
Users value speed and clarity — summarization must feel instant and frictionless
Control matters — sliders, toggles, and customization help users feel in charge of their experience
Summaries must maintain context — users want condensed content, but not at the cost of meaning
AI is only as good as its UX — clean design and smart defaults turn complex tech into an intuitive tool