FIR Podcast Network

For Immediate Release: Podcasts for Communicators

  • Home
  • Shows
  • Episodes
  • People
  • Blog
  • Subscribe
  • Sponsors
  • Contact Us
    • Join the FIR Podcast Network
  • Archives
You are here: Home / YouTubular Conversations / YouTube’s Automatic Caption Service Has A Potty Mouth

YouTube’s Automatic Caption Service Has A Potty Mouth

January 16, 2017 by Harry Hawk Leave a Comment

http://traffic.libsyn.com/fir/-a8-au.mp3

Podcast: Play in new window | Download (Duration: 8:52 — 4.2MB) | Embed

Subscribe: Google Podcasts | Email | RSS

One of the really great and amazing things about YouTube is the free transcription or closed captioning system they have. It works remarkably well. It uses the same Google Cloud Speech API you can access through other services like Auphonic.com (details about Auphonic’s transcription features).

Warning:  If you are uploading video to YouTube you should probably check out what the automatic transcription service is serving up. Most of the time you will be happily surprised, but you may also be shocked when you learn it has a bit of a potty mouth.

Transcripts and closed captions are not identical, but are very similar. Within YouTube the automatic system produces closed captions. For use in show-notes, I prefer a format known as a Production Transcript. If you are uploading to YouTube you can use nearly any text based file that contains time codes and their system will sync the text to the video; you can also upload special CC formatted files such as WebVTT.

Google (via YouTube) now also offers to broker transcription and translation services. For a 1.5 minute video I uploaded for one of my students, (as part of the research for this episode) I paid YouTube $6 for a transcript from Tomedes – Smart Human Translations.es; other offers were priced at $10; there were several offers for me to pick from but they all had a three day turn-around.

In comparison I get a typical 72 hour turn-around from CLK Transcription, Inc., at a price more affordable then those offered by Google. However, it is nice that Google offers both transcription and translation services. CLK charges “$1.02 – $1.50 per minute for non-medical transcriptions.” In this case, the 1.5 min file transcribed by CLK cost $1.75 vs the $6 Tomedes charged.

I ordered the transcript from CLK at 8:32 Pm Sunday and I had the completed transcript 11 hours later. A turn around time that fast is not typical of larger jobs. Here is the file they produced. The file is in the production transcript format (e.g., it’s not a CC file), but when I have saved this format as a txt file and uploaded it to YouTube, YouTube’s in-take system has always done a great job of sync’ing it to my video.

The file from Tomedes was completed at 4:52 Pm or a turn around time of about 21 hours; faster than the promised three-day turn-around. However, it cost 3.4 times more, and took nearly 50% longer that CLK (my regular vendor). Tomedes did not send me a file; they directly installed the transcript as an English CC into my video (an added perk, but probably not worth the extra $5.25). If you view the video, you can see their work as the currently installed English closed captioning.

I was concerned during the process because while I ordered a transcription, their messaging to me referred to it as a translation. Their final message referred to it as English captions. The machine translation that I edited on the 15th (see below), was rendered “un-published” but wasn’t deleted.

Here are some edited screen shots of how I edited the automated transcription and ordered the $6 transcription from Tomedes.

Ordering a transcription for closed captioning within YouTube.

Ordering a transcription for closed captioning within YouTube.

Message indicating that the process to purchase a transcript from one YouTube's captive partners was completed.

Message indicating that the process to purchase a transcript from one YouTube’s captive partners was completed.

A message that my payment was successful.

A message that my payment was successful.

Editing the machine translation "keeping queen" to the correct, "keeping clean."

Editing the machine translation “keeping queen” to the correct, “keeping clean.”

before fixing the word icebox wrongly spelled as iceboxx

before fixing the word icebox wrongly spelled as iceboxx.

After fixing the word icebox wrongly spelled as iceboxx.

After fixing the word icebox wrongly spelled as iceboxx.

Example of the user interface for editing a machine generated transcription within YouTube.

Example of the user interface for editing a machine generated transcription within YouTube.

The user interface to unpublish or save an installed transcription within YouTube; in this case a machine generated English transcript.

The user interface to unpublish or save an installed transcription within YouTube; in this case a machine generated English transcript.

Multiple closed caption files for a single video, one published, the rest, un-published. The published one has a green "dot" vs. a "grey" dot.

Multiple closed caption files for a single video, one published, the rest, un-published.

I hope everyone has a great week! Bye Bye.

PS. the video I created for my class was made using Adobe Spark.

 

 

Share this:

  • Click to share on Facebook (Opens in new window)
  • Click to share on LinkedIn (Opens in new window)
  • Click to share on Twitter (Opens in new window)
  • Click to email a link to a friend (Opens in new window)

Related Posts

  • Material Design for YouTube
  • YouTube Lost Major Advertisers Over Content
  • CREATIVITY & FACEBOOK IQ ANNUAL REPORT !
    053 : Meet The YouTube Champions in India

About Harry Hawk

Harry Hawk is a marketing consultant focused on cross channel attribution, paid social, community building and content strategy. As an Adj. Prof (CUNY + SWCCD) he has a focus in Hospitality, Business, eCommerce & Event Management/Planning.

Harry has a diverse range of clients in Robotics, Logistics and Baking. Harry has worked with many companies including HubSpot Labs, Momentum Machines, Phillip Morris, Paramount Home Pictures, NY Water Taxi, and many other startups.

Filed Under: YouTubular Conversations Tagged With: @hhawk, Auphonic, Automated Transcription, Harry Hawk, Transcription Service, YouTube

Share Your Comments on YouTubular Conversations

FIR Community on Google+Share your comments or questions about this podcast, or suggestions for future podcasts, in the online FIR Podcast Community on Google+.

You can also send us instant voicemail via SpeakPipe, right from the FIR website. Or, call the Comment Line at +1 415 895 2971 (North America), +44 20 3239 9082 (Europe), or Skype: fircomments. You can tweet us: @FIRpodcast. And you can email us at .(JavaScript must be enabled to view this email address). If you wish, you can email your comments, questions and suggestions as MP3 file attachments (max. 3 minutes / 5Mb attachment, please!). We’ll be happy to see how we can include your audio contribution in a show.

Leave a Reply Cancel reply

Social connect:
Login Login with facebook
Login Login with twitter
Login Login with google

Your email address will not be published. Required fields are marked *

Connect with FIR

Support FIR

Become a Patron

Pledge as little as $1 per month to help support the existing stable of shows, make new shows possible, and be part of the growth of the FIR Podcast Network.

FIR Podcast Network Shows

FIR SPONSOR

The FIR Podcast Network is made possible by the generous support of our sponsors, of which we currently have none. Please reach out to us if you are interested in sponsoring FIR. Contact us at fircomments@gmail.com.

Looking for Older Episodes?

Visit the FIR Podcast Network Archives at forimmediaterelease.biz.

Join Our Facebook Community

Facebook Group Icon

FIR Podcast Network on Twitter

My Tweets

We Want Your Feedback

Email comments with or without audio attachments (5 MB/3 minute limit) to fircomments at gmail dot com.

North America Comment Line:
+1.415.895.2971

Europe Comment Line:
+44.20.3239.9082

Skype: fircomments

Promote FIR!

Let your readers and colleagues know about FIR by adding a logo to your blog or web page. Get it here!

FIR Podcast Network Logo

Help Spread the Word about FIR!

GaggleAMP: spread the word about FIR!

License

Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

FIR Podcast Network website © 2023 Shel Holtz and Neville Hobson | Community Guidelines | Privacy Policy

Website development by WP Fangirl. Album art and design by Effective Edge Communications.