DMC SC

Why caption? Why subtitle?

Because a large chunk of your audience either needs or prefers to watch video reading text without sound. Subtitles and captions are an essential part of business for any media organization who wants to reach the widest possible audience, for a multitude of reasons: legal requirements, international distribution, accessibility…. Captions provide a textual representation of dialogue and other important audio for people with hearing loss. They also give the viewer additional information about the video, such as context for a news clip. Subtitles provide a translation of the dialogue, essential to reach international markets.

So, we all agree captions and subtitles are key, but… creating them is hard. 

First, you need solid speech-to-text transcription so that the right words are created from the audio stream. As many of us who have been working with this technology for a while quickly learnt, this is not enough. There is the complicated business of correctly segmenting, laying the text out, punctuation, timing and so on, often creating much manual work for captioning and subtitling teams. 

What if you could be smart about captioning and subtitling?

What if you had a technology solution that automates most of these editorial decisions applying artificial intelligence to the output of a good speech-to-text engine? Enter Dalet Media Cortex.

Over the last few months, our teams have worked tirelessly to improve the quality of our Smart Captions, part of the Dalet Media Cortex Speech service, providing users with high-quality automatic captions and subtitles for their video content.

If you are using the Dalet Media Cortex API, Smart Captions will be delivered to you in the form of SRT or TTML files. If you are using Dalet Media Cortex integrated with Dalet Galaxy five or the Ooyala Flex Media Platform, captions will also be displayed as timecoded locators so that users can search and navigate through subtitles and captions easily.

What is so special about Smart Captions? We have developed algorithms based on speech density, natural language processing and speaker diarization (the process of partitioning an input audio stream into homogeneous segments according to the speaker identity) to generate captions and subtitles that are as close as possible to the BBC Subtitle Guidelines. This takes care of essential elements in captioning and subtitling: such as text that flows well and is synchronized with the speakers’ voice and cadence, lines that split at natural points based on sentence structure and text length that is properly adjusted to screen size. 

DMC

How does this help you?

Beyond the improved quality of these AI-generated subtitles and captions, there are immediate and tangible benefits. You will see the time you spent in adjusting subtitles or captions cut in half, when compared to traditional speech-to-text based solutions. You don’t have an automatic captioning/subtitling system today? Good news: you will save over 80% of your time in generating quality captioning and subtitles for your video content.

Besides, having great captions and subtitles will increase the value of your media, and bring you new business opportunities, such as expanding to new markets or increase your online and social media engagement. 

No doubt Smart Captions is one of the multiple differentiators that makes Dalet Media Cortex an “IBC Best of Show Award” winner. 
 

Want to know more? Get in touch with the Dalet team here!

AIWIP

Share this post:

Subscribe

Sign up for Dalet updates and our quarterly newsletter. Expect #MediaTech insights!

Leave a Reply

Your email address will not be published. Required fields are marked *

Recommended Articles

AI technologies progressed drastically in the last few years. Speech-to-text and face recognition are prime examples of use cases where AI-driven solutions that have existed for many years have now reached an acceptable level of maturity and commercial viability.

Read More
AmberFin_hero

Leveraging Premium Media Processing for Business Success

In today's fiercely competitive media business environment, every company is looking for means to stay ahead of the pack. Smart, highly efficient media processing can be a game-changer. Discover how Dalet AmberFin delivers high-quality content that grows your audience.

Read More

How Cloud-Native MAM is Adopting PAM Features for Seamless Media Workflows 

Recently, the lines between Media Asset Management (MAM) and Production Asset Management (PAM) have become increasingly blurred. This convergence reflects the quickly evolving needs of the industry. In recent years a huge leap forward in technological capability has coincided with rising creative demands and shifting media consumption trends. This has all had a significant effect...

Read More

AI technologies progressed drastically in the last few years. Speech-to-text and face recognition are prime examples of use cases where AI-driven solutions that have existed for many years have now reached an acceptable level of maturity and commercial viability.

Read More
AmberFin_hero

Leveraging Premium Media Processing for Business Success

In today's fiercely competitive media business environment, every company is looking for means to stay ahead of the pack. Smart, highly efficient media processing can be a game-changer. Discover how Dalet AmberFin delivers high-quality content that grows your audience.

Read More

How Cloud-Native MAM is Adopting PAM Features for Seamless Media Workflows 

Recently, the lines between Media Asset Management (MAM) and Production Asset Management (PAM) have become increasingly blurred. This convergence reflects the quickly evolving needs of the industry. In recent years a huge leap forward in technological capability has coincided with rising creative demands and shifting media consumption trends. This has all had a significant effect...

Read More