Captions

Product Design, UI/UX Design, Secondary Research, User Interviews, Art Direction, Design Workshops

About Product

Captions started off as the one-to-go video captions editor for beginners. Powered by AI, the appcan automatically split the frames to catch blank spots and remove all the filling words like  "Mmh", "Euh", etc. Just press the button and ta-da, it's done!

When they first reached me the goal was to redesign the app and enrich the product withdifferent basic video editor functionalities such as split, trim, pic in pic, transitions,  etc. Weobviously also built on captions functionalities and enabled more options such as highlightingwords, editing style, matching stickers to words, and so on.

Opportunity

Soon the company started surfing on the wave of AI's new generation language models andpivoting towards Artificial Intelligence as the center of their offering. From voice assistants to text-to-image generators, AI opens a universe of possibilities and through the power of thistechnology, Captions decided to turn speech recognition to the next level.

Lip Dub Feature

Lip dub was the first feature we designed, a powerful one where you can basically change whatyou're saying. Use cases examples:

  • You recorded a new video but made a mistake, you can overdub a word or an entire part and the AI will make your lip move to the new script.
  • You want to reach a whole new audience in another part of the world, just press the button and the AI will translate the entire video into another language.

User engagement and response were so positive that we decided to bring it to the next leveland create Captions' very own script-writing assistant. Imagine, you just describe what youwant to talk about and the app will write the entire script for you.

Process

Ideas Generator V1

The first iteration was a simple ideas generator that can give daily ideas for users to generatecontent faster. It can scan the user project content, see what they talk about, and suggest ideasrelated to their interest. We imagined an experience where the user needs at least 5 projects to unlock the idea generator. That would encourage users to engage with the app in order to unlock the feature. In this experience, the user gets one idea (or a few) at a time and can shuffle, like, or control how random it is. It was already a step forward in injecting more AI power into the product, but why not push it even further?

Script-writer Assistant V2

What if the AI could take that idea and literally write that script for your next TikTok video or techreview you want to record? Meet Geniio, your script-writer assistant.

The Flows:

  • Tap on an idea a generate a script from it
  • Go through the settings first to adjust that idea and narrow down the result: Title, description, tone of voice, keywords, and length
  • Get a script with a layout that's made for a video (includes a hook, intro, and paragraph for each outline item and a conclusion)
  • On the script, see the word count, save it for later, and go back to settings to adjust the input

Optimization

We tested the feature and got some feedback from users that some aspects of the AI were notclear. We also put priority on the scrip writer AI and moved the ideas generator to the next phase.

Templates for Customization

People record videos for different purposes and the format is very different from making an ad fortheir local hair saloon or an influencer talking about tech so we introduced a template section inthe editor

Talk to Human

We made it clearer that they could describe exactly what they want like they would describe it to a real human, so we added explicit hint text and real examples.

Editor to Script

We wanted to teach the users that filling in the inputs enables them to 'Generate' the scriptoutput. Once on the output screen (the script), the user can easily 'Edit' and go back to editormode. In fact, the input from users and the output from the AI are like a conversation.

Walk through

We set a first-time-use event (scroll after 5 seconds) to trigger a bottom drawer that encourages

Navigation

In the higher level of navigation, we placed the 'Version history' to enable users to access previousversions. From the script mode, the user can 'Record', 'Shuffle', or 'Delete'.

Telempropter

From that script, the user should be able to record it and go to teleprompter mode.

Dynamic Island

While in another app, the user should be able to see his script and record a video​​​​​​.

Iteration

That feature is the result of a continuous conversation with the client to follow his vision andoptimised from user feedback. From ideas-only, ideas-to-script, to editor-script mode, it took afew tries to get the flow and the layout right while keeping the design smooth and the vibe right.

What’s a Rich Text element?

The rich text element allows you to create and format headings, paragraphs, blockquotes, images, and video all in one place instead of having to add and format them individually. Just double-click and easily create content.

Static and dynamic content editing

A rich text element can be used with static or dynamic content. For static content, just drop it into any page and begin editing. For dynamic content, add a rich text field to any collection and then connect a rich text element to that field in the settings panel. Voila!

How to customize formatting for each rich text

Headings, paragraphs, blockquotes, figures, images, and figure captions can all be styled after a class is added to the rich text element using the "When inside of" nested selector system

image caption