YouTube’s automatic captioning system can now describe sound effects

#YouTube has long had an automatic captioning system that, thanks to Google’s machine learning advances in recent years, has gotten pretty good at automatically transcribing spoken words in a video. As the company announced today, its technology is now able to take this a step further by also … [Read More]

Summary… * Machine-Made


#YouTube has long had an automatic captioning system that, thanks to Google’s machine learning advances in recent years, has gotten pretty good at automatically transcribing spoken words in a video.
For now, the automatic effects captioning is actually restricted to those exactly these three sounds.
Now that Google has the systems in place to caption those sounds, though, it should be relatively easy to also caption other sounds.
In the backend, #YouTube’s sound captioning system is based on a Deep Neural Network model the team trained on a set of weakly labeled data.
Whenever a new video is now uploaded to #YouTube, the new system runs and tries to identify these sounds.

Opinion… * Man-Made


The #AI captioning system of #YouTube can now describe sounds; starting with laughter, applause and music. #Trend

Source: Techcrunch