Blog

Malcom X: A Life of Reinvention – Manning Marable

Malcom X A Life of Reinvention - Manning Marable - ATC Blog
 On April 4, 2011 Malcolm X: A Life of Reinvention was published, and the magnum opus of Manning Marable’s life’s research was finally in print.  Sadly, for those who do not know, Dr. Marable passed away 3 days before the book hit the shelves.  Thankfully, he was able to see the book in print before he passed away. 

Continue reading “Malcom X: A Life of Reinvention – Manning Marable”

Analog vs. Digital: Pay Now or Pay More Later

Analog vs Digital Pay Now or Pay More Later - ATC Blog

Some of you have asked me why we still have information on our website about “going digital,” but clearly the fact that we still receive newly recorded audio on “old-fashioned” cassette tapes  tells me that some people just don’t understand the importance of upgrading technology (on a lot of levels).  After 44 years in business, we finally took the “tape” out of our name, because it’s all about the audio!

Today I’m writing about more than “going digital,” but I will also touch upon recording habits in general.  Remember, just because you’re recording digitally does NOT mean that you will automatically have broadcast quality audio.  (WHAT?! You’re thinking, ‘it’s digital, so it has to be better quality.’)  There’s a lot involved in recording, and as the person conducting the recording, you need to stop and think about the details of recording for more than a couple of seconds.  That’s right, we know that some of you already know these things, but do you truly take the time to learn your device before using it?  I know that’s a very personal question, so think about it for a moment.  You don’t have to share.

The quick points to remember:

First and foremost, it’s now 2011, so use a digital recorder!  You can walk into any electronics store, or jump online and find one.  Just do some research first.  Remember, in 2004, 90 % of our clients used analog equipment to record their interviews.  Now in 2011, 95% of our clients use digital equipment to record their interviews.  You’ll have immediate access to your audio recording.  Volume too low? There’s software for you to give the file a quick boost to increase the sound quality.  Is your transcriptionist next door or across the country?  It doesn’t matter where they are located, because you can upload your audio to them, and still have access to listen your audio.  Imagine never having to spend shipping dollars again!!

Clearly the facts demonstrate there’s been a near total reversal in the analog vs. digital battle.  Remember, your transcripts are only as good as the audio your transcriptionist receives, and better quality audio will save time and save those all important dollars in your budget.  Again though, just remember, it’s more than just “going digital”!

You’ve purchased that device, but you really don’t want to delve into the box with the paperwork and all sorts of wires that are tucked neatly inside.  Read the paperwork, and use the wires.  Of all the wires in the box, use an A/C power-supply – it might be 2011, but batteries die quickly, so plug in when you can.  For those times that you forgot it at home, bring plenty of backup batteries!!   Seriously, go buy stock in the major brands, because you will always want to have an ample supply of batteries quickly within reach!  You never know when you’ll have to record those unexpected longer interviews.  Think of it as practicing “safe recording”!

Now you’re sitting there ready to hit the record button, but stop and check recording volume regularly.  I can’t tell you how many interviews we get where the recording levels are so low you can barely hear the person, so don’t forget to check those recording levels beforehand.  If your recording device has meters, refer to them, but also be sure to listen to the audio levels with headphones at the start of the interview session.
Another important piece of equipment to use is an external microphone.  Different situations require different types of microphones, so you’ll need to do a little studying up on what your recording environment needs.  If you’re able, try more than one external microphone among the group, to be sure you have properly mic’d all of your speakers.  This is especially important for any group larger than 3 individuals, and be sure to place these microphones as close as possible to the people who are speaking.  Sitting at a long table with people at both ends of the table? Think about how the person at the end of the table will sound if there is only one microphone in the middle of the table.  Murphy’s law also says that person will be your most verbal in the group.  Conducting a one-on-one interview?   Drop into Radio Shack beforehand, and grab a lapel mic.  The difference in recording quality is remarkable, and you’ll thank yourself later (as will your transcriptionist).
Don’t forget about the longevity of your recording for your archives!  Your transcriptionists do not require large archival files for transcribing, they just require some good audio to hear those words clearly.  On that note, if you’re going to be storing these recordings for archival posterity, make sure you do your research on the latest technological advances in formats for saving your audio files.  .wav? b-.wav? .mp3? Spend the time, do your research, and know the facts on digital audio longevity.  (See our previous blog on thinking beyond the shoebox.)

For a more detailed read, look over our recording tips page, and check out some of the other service providers we recommend as well.

Always remember your ultimate goals when you’re recording.  If you’re going to have your audio transcribed, you want the best recording possible, so give your transcriptionists audio that they can transcribe both fast and accurately!  If you can believe it, we’re telling you to spend a little more up front, that will save you money on a service we provide.  Go figure…

Reality Check: Transcription Vs. Speech Recognition Software – The Showdown

Transcription vs Speech Recognition Software Audio Transcription Center Blog
If anyone reading is a fan of the game show Jeopardy!, you already know that this week, IBM super-computer Watson is taking on legendary past Jeopardy! champions (and human beings) Ken Jennings and Brad Rutter in a Human vs. Human vs. Machine grudge match, and we now know Machine has won!
Congratulations to Watson.
We don’t have a super-computer, or a fancy game-show soundstage, but we are bringing you the results of our Human vs. Machine faceoff. Can human transcriptionists from the Audio Transcription Center (ATC) slay the Dragon? Read on and find out!
(Full disclosure: we’re a transcription company that has been in business since 1966. Successful speech recognition software could put us out of business. Just so you know.)
Championships have been won in Boston: the Red Sox have won World Series, the Celtics NBA Championships, and the Bruins Stanley Cups, all just five minutes from our very offices. So it is fitting that our office be the site of this titanic Human vs. Machine bout!
First of all, I will introduce the Machine… wearing a green cardboard box, from Nuance Software, Dragon Naturally Speaking 10, Home Edition, or as we prefer to call it “Team Dragon”. (Version 11 has been released since we began testing; and we will put it to the test at a later date.)
And in the other corner, wearing headphones, torn jeans and flexing their fingers… the human transcriptionists of the Audio Transcription Center (ATC), specifically four randomly-selected competitors from our staff of dozens of versatile, multi-talented transcriptionists. All four, collectively known as “Team ATC”, were eager to take on the challenge.
“But wait,” you exclaim! “Dragon only works with one voice at a time, this is an unfair fight!” Correct. But rather than automatically claim victory, we decided to level the playing field by having both competitors work with only one voice, who would be speaking on a variety of subjects.
Dragon Naturally Speaking (or “Team Dragon”), as well as our team of terrific transcriptionists (or “Team ATC”), would be transcribing the voice of… me. Your humble blogger, formerly heard on college radio and occasionally behind a karaoke machine, would be the voice that would take both competitors to their limits!
Let’s begin the match, shall we?
First of all: speed of delivery
Team Dragon: walk to the store, purchase the software, come back to the office.
Team ATC: walk to the subway, purchase subway ticket, come to the office.
Advantage: We’ll call this one a tie.
Speed of installation
Team Dragon: 32 minutes for “complete installation”. The DVD-ROM was a very bright shade of orange.
Team ATC: less than 10 minutes for installation, and that includes pouring themselves a cup of coffee while the computer boots up. Occasionally wears bright colors as well.
Advantage: Team ATC.
Speed of training for first-time use
Team Dragon: 39 minutes, from first launch until the program was ready for prime-time, including entering the serial number at least 4 times.
Team ATC: About two hours, including filling out at least 4 pieces of paperwork. We’re thorough that way.
Advantage: Team Dragon.
So far, before we’ve introduced actual transcription into the contest, we’re tied at 1-1. It’s a close match in the early going…
Now, let’s bring in some actual audio. Specifically, about 1,135 words, spoken over about 7 minutes, on a variety of subjects, by yours truly.
“But wait,” you exclaim. Again. “’Team Dragon’ has to be trained to recognize your voice! It’s designed to improve as you use it more!” Correct. Whereas ‘Team ATC’, none of whom have ever heard my voice on a recording, can hit the ground running immediately. Advantage: Team ATC.
Back to the audio: our four transcriptionists each took one pass at it, transcribing it verbatim (with ums and ahs). Once done, the audio was given a real-time review, and time needed to perform corrections was noted.
Transcription time for “Team ATC” for seven minutes of audio, spoken in a quiet room, clearly and methodically: averaged out to 20 minutes.
But how did it look, you ask? There was an average of two errors in the 7 minute file. Out of 1,135 words, that’s over 99.8% accuracy before review. Review time averaged out to eight minutes, for a total score of 28 minutes.
Now, for the first round with “Team Dragon”. For the first round, I once again spoke slow-ly and meth-od-ic-al-ly. I also spoke punctuation and carriage returns in their appropriate places, as per instructions.
Dictation time for “Team Dragon”, first round? 16 minutes. Which sounds fast, until you realize that reading the audio into a recorder at ‘normal’ pace took less than half that time.
But how did it look, you ask? Not so good. Review time took 18 minutes; with over 60 errors (versus two!), for a total score of 34 minutes, and around 94% accuracy or roughly 15 errors per page. Which sounds good, until you remember that this is one voice, speaking slow-ly and meth-od-ic-al-ly. Which most of us don’t do in our daily lives.
 
Advantage for round one: “Team ATC”.
Before the competition, and in between rounds, while “Team ATC” was eating lunch or going for walks, “Team Dragon” was in training, as I read and corrected material from various sources into the software. Song lyrics, blurbs from dust jackets, chocolate bar wrappers… “Team Dragon” was being further trained to recognize my dulcet tones.
For round two with “Team Dragon”, I changed a setting to speed up the process; Dragon has a setting which inserts commas and periods in logical places. That indeed shaved a few minutes from the dictation time: dictation now took 11 minutes.
But how did it look, you ask?  Still not so good. There were over 40 errors; review time took 13 minutes (which was, again, longer than the dictation itself), so over 96% accuracy or roughly 10 errors per page. Which, again, sounds impressive, until you compare it to 99% accuracy.
Total time for round 2, including review time: 24 minutes. Which means…
Advantage for round two: “Team Dragon”.
So what have we learned? That speech recognition software can, with repeated training, be accurate enough that your dictation time, plus your review time, can be faster than a human transcriptionist.
So “Team Dragon” wins? The robots are taking over?
Uh, no.
If your audio input consists of one voice, and only one voice, and you have enough access to that one voice to allow Dragon to become further accustomed to that one voice, then by all means, stop reading now, and become a proud supporter of “Team Dragon”.
For everyone else, “Team ATC” is still miles ahead. “Team ATC” can transcribe your all-hands meeting, with its 27 participants from the CEO to the intern. “Team Dragon” can’t.
“Team ATC” can transcribe your interview with your Nana where she talks about the old country; and because the Audio Transcription Center (ATC) can match your interview subject matter up with the right member of “Team ATC”, you can get a transcript with 99% accuracy or higher, even though we’ve never heard your voice.
 
“Team Dragon” can transcribe you or your Nana, at lower than 99% accuracy, and only knows what it’s been programmed about the old country.
And most importantly, the human beings at the Audio Transcription Center (ATC) can consult with you before your project even begins, and work with you to help you get the most out of your limited transcription budget.
When and if “Team Dragon” catches up to us, and is able to transcribe the material our talented, smart human beings are able to transcribe, quickly and accurately, we will be the first to jump on the bandwagon. Until “Team Dragon” puts us out of business.
But for now, if you call the Audio Transcription Center (ATC), there are no machines to train, no dragons to slay, just friendly, helpful customer service, a second-to-none transcription staff and a 100% satisfaction guarantee.
Next in line for us is a white paper that will help you find your best transcription solution, even if it is (gasp) not us!
by Patrick Emond

Reality Check: Transcription vs. Speech Recognition Software

Transcription Vs Speech Recognition Software Audio Transcription Center Blog 
Here at ATC, we occasionally get the tough questions. One in particular that briefly stops us in our tracks: “Why can’t I just use speech recognition software?”

Nobody likes being replaced by a computer, or a robot, and we are no exception. Our short answer to that question is this: “we are more accurate and more versatile than the software available today.”

Still don’t believe us? Well, we’re going to introduce you to our competition.

Speech recognition has been around since 1952: that early device could recognize single spoken digits. (We, on the other hand, have been around since 1966, and were able to recognize whole spoken sentences immediately.)

The next large leap forward came in 1982: Dragon Software, who still release speech recognition software today, released software for industrial use. By 1985, that software had a vocabulary of 1,000 words – spoken one at a time. (That is comparable to a four-year-old child. We don’t recommend having a four-year-old, even a precocious one, transcribe your audio.)

Dragon itself even admits this today: “Most of us develop the ability to recognize speech when we’re very young. We’re already experts at speech recognition by the age of three or so.” Our college-educated transcriptionists had vocabularies in the 17,000-word (and up) range. Even in 1985. And they still do.

By 1993, a computer could recognize over 20,000 spoken words, which put it on a par with human beings. Except for the accuracy, which was only 10% in 1993. By 1995, the error rate had dropped to 50%, which is quite a leap in a short time. (Our transcriptionists test at 98% accuracy.)

In 1997, Dragon released “Naturally Speaking”, its first consumer speech-recognition product. By 1997, we already had a 31-year head start on transcription for consumers at large.

We know, we know…

“That was back then. How about now?”

We’re glad you asked. 

Since 1985, the National Institute of Standards and Technology have been benchmarking speech recognition software. The graph below illustrates some key data points highlighting several of their relevant benchmark tests.  (Click the graph to enlarge.)
 
(source: National Institute of Standards and Technology, http://www.itl.nist.gov/iad/mig/publications/ASRhistory/index.html)

There are a lot of data points up there, so let me highlight the important features:

    • Take a look at the error rates (WER means Word Error Rate) for Conversational Speech (in red) and Meeting Speech (in pink). They aren’t even close to what human beings can deliver.
    • That 2% to 4% range is human error. As in, the accuracy rate you would get from our human beings. And we aim for even lower than that.
    • The only tests that match up with human accuracy are air travel planning kiosk tests (bright green). Also known as “People Who Speak Very Deliberately and Slowly in Airports.”
    • Very few people speak deliberately and slowly in real life.
    • The error rate for broadcast news readers (blue), ie: people who are very well-paid to speak clearly, is around 10%.
Software has to be trained to recognize your voice. And re-trained to recognize anyone else’s. Our transcriptionists can handle a meeting full of speakers and accurately differentiate them.

A 98% accuracy rate means you will spend much less time reviewing your audio, correcting errors and inaccuracies, and much more time growing your business.

The bottom line is this: computers are getting smaller, and more powerful, all the time. They can do many things better than human beings can.

But not, as you can see, transcription. And looking at the graph, they won’t catch up anytime soon.

Your audio wasn’t recorded in a lab, it was recorded in the real world, where we live. We transcribe conversations and meetings every day, from all over the world. Not to mention webcasts, dictation, presentations, and conferences.

Again, Dragon says it themselves: “People can filter out noise fairly easily, which lets us talk to each other almost anywhere. We have conversations in busy train stations, across the dance floor, and in crowded restaurants. It would be very dull if we had to sit in a quiet room every time we wanted to talk to each other! Unlike people, computers need help separating speech sounds from other sounds.”

Our transcriptionists and production staff are highly educated, well-trained, and are constantly learning, whether that means going to graduate school, reading magazines, or watching the newest viral videos.

We like computers, and we think we can co-exist. So, by all means, speak your destination into your cell phone’s GPS, or say “tech support” to speak to technical support. Those are two versions of speech-recognition software that many of us use almost every day.

But if your audio is any more complicated than that, call us. We’re versatile, we’re accurate, and if you pour us enough coffee, we won’t crash.

We have run full tests on the entire Dragon experience, from opening the box all the way to the proof of the pudding, which is in the crust… er, the transcript. We will publish those results on or before February 17, so keep an eye on your inbox and this blog for the results!

Powderhouse Productions – Client Spotlight September 2010

Powderhouse Productions - Client Spotlight September 2010 - ATC Blog

Here at the Audio Transcription Center we’re always amazed at the diversity of our clients’ audio.  One day we may be transcribing a high-brow legal hearing, and the next we’re creating a transcript about the world’s shortest cat.  And truly, everything that you can imagine in between is heard by our team of transcriptionists.  With the mix of clients we have, the content we transcribe truly is, “soup to nuts.”  
 
But, back to that world’s shortest cat, and the client that sent that audio our way,  Powderhouse Productions

Headquartered in Somerville, Ma. Powderhouse Productions has been producing a wide range of television shows since 1994 for channels such as PBS, National Geographic, TLC, and the programs we’ve most recently been transcribing, Dogs 101,” “Cats 101,” and “Pets 101” for the Animal Planet network.  

So you truly want to know the answers to these questions, well you could just ask my team of transcriptionists, but then you wouldn’t be watching the premiere this Saturday night at 8 p.m. on Animal Planet.

“Powderhouse relies on the Audio Transcription Center for high quality, accurate transcripts delivered on time and on budget.   They understand the demands of television production – their turnaround time is fast and their customer service is excellent.  We depend on them to meet our tight deadlines and they always deliver!”   

– Dan Miller VP, Production 

Archiving – Thinking Beyond the Shoebox!

Archiving Thinking Beyond the Shoebox - ATC Blog

In our inimitable fashion here at ATC (www.audiotranscriptioncenter.com) we’re constantly reading through all those emails we’re receiving from different listservs about any number of things.  The latest one that caught our eyes was about how rapidly technology is changing, and it got us thinking on many levels.  WWOCD?  What Would Our Clients Do?  The article in the latest issue of ComputerWorld.com is written by Lamont Wood, “Fending off the digital dark ages: The archival storage issue.” So this is where transcription of those audio/video collections is key to the longevity of your archives. 

When was the last time you tried to play a 33 rpm record?  When did you find an old floppy disk with information that you couldn’t access?  How about that interview of Aunt Lucy and Uncle Joe in the shoebox that was recorded in 1972 on any sort of media that is now outdated?  Point being, anything you record today will be outdated in 5 years, 10 years, 20 years.  Do you have a plan?  Does your customer have a plan?  We don’t have a plan either, but hey, we got you thinking about it. 

As far as I know no company is currently transcribing on sheepskin, but most everyone who receives their transcripts is storing them digitally.  These digital transcripts are now searchable documents, and then they are usually printed and stored for archival purposes as needed. 

The question again is, how often is digital media changing? 

Plainly, your audio archives will someday be obsolete, and you’ll have to look at ways to convert these collections to a new functional usable format. (How many of you are already doing this every 5, 10, 15 years or so?)  These transcripts of the media content provide the essence of what researchers need!

What will you do to make sure this scenario doesn’t happen to you or your client?  Or will you be retired by that point, and leave the “legacy” to someone else?

The Story of “Farakaveh”

The Story of “Farakaveh” - ATC Blog

Excuse me, which way is “Farakaveh”?

Not too long ago, a member of our production team was reviewing a transcript of an oral history interview before sending the completed work back to the client.

While the work was top-notch as usual, there was one word that just didn’t sit quite right with our eagle-eyed (or nitpicky, however you want to phrase it) production-er and he couldn’t bring himself to press “send.”

Instead, he took a few minutes to listen and re-listen to that little blip of audio but kept hearing the same thing the transcriber had: “Farakaveh.”

Eh, good enough… 

While putting the word in brackets with a question mark to indicate it as a guess and sending off the transcript might have been the next acceptable step, he just couldn’t let it go.

So he brought in some outside expertise, someone with a background that might help decipher the accent of the interviewee — a Jewish, rather Russian and very New Yawk elderly woman.  In this case, that “outside expert” happened to be ATC founder and president, Sandy Poritzky.

Bringing in the “Big Kahuna”

While we couldn’t quite get our top exec to sit down and listen to the audio on a pair of headphones (though we admit it is fun to picture that scenario in our minds), we did the next best thing by putting the printed transcript on his desk.  One quick read and Sandy recognized “Farakaveh” pretty much immediately.  It’s a little neighborhood in Queens, NY.  Probably better known as “Far Rockaway.”

The moral of the story?  

Well, it could be that we go the extra mile (yay!).  Or, it could be that we employ fiendishly detail-oriented and extremely cautious people (they’re our heart and soul!).  Or, it could be that sometimes the big boss actually does have all the answers (shudder).  Mostly, we like to think it illustrates another point: Transcription isn’t always just what you *think* you hear.

 

Archive of American Television – Client Spotlight April 2010

Archive of American Television - Client Spotlight April 2010 - ATC Blog

Oral History Meets TV = Transcription Bliss @ ATC

We here at the Audio Transcription Center have probably all watched more television in our lives than we care to admit.  Still, we like to think of ourselves less as gluttons for the tube and more as refined connoisseurs of the medium.  But, though we may try to act all cool, nothing sets us a-twitter like receiving a new transcription assignment from our friends at the Archive of American Television (you might know them better as “The Emmys”)

Wait, TV has an Archive?

Yup.  Founded in 1997, the Television Academy Foundation’s Archive of American Television is a treasure trove of one-on-one interviews with TV luminaries — from the early pioneers who shaped the medium, to beloved personalities of TV’s golden age; from the actors, actresses, news anchors, and hosts who’ve worked in front of the camera, to the directors, writers, composers and producers who’ve worked behind the scenes.

From the BoobTube to YouTube

And, as part of its vision “to chronicle electronic media history as it evolves… and make the interviews available worldwide,” the Archive has digitized over 2,000 hours of its original content, making hundreds of these interviews accessible online.  For FREE.

A few of the most recently posted interviews include:

  • Beloved Children’s television performer (and avid sweater collector) Fred Rogers, of Mister Rogers Neighborhood.
  • Famously irreverent and ever insightful comedian George Carlin (ironically, the man who gave the world the “Seven Dirty Words You Can Never Say on Television,” which he discusses here).
  • Former Golden Girl who’s having a late career revival with roles in hit movies and TV commercials and an upcoming gig hosting Saturday Night Live, Betty White.

And of course our first love, Transcripts

In addition to having hundreds of interviews available to watch online, complete transcripts of most of the interviews are available from the Archive (for a fee, and for research purposes only).  For additional info, contact the Archive’s Digital Projects Manager, Jenni Matz.

 

We’re not your mother’s transcription service

We’re not your mother’s transcription service - ATC Blog
As we talked about earlier this month, the cassette tapes and reels are just about gone.  Typewriters, of course, have all but completely disappeared (although we do keep one in the backroom just in case of all-out digital failure/crash, call us prepared/paranoid).  And don’t even get Sandy, our very old president, started about the ol’ cylinder Dictaphones

 

SO WHAT’S THE DIFF?

But it’s not just the difference in technology that separates the Audio Transcription Center from the transcription services of yore (mother’s days).  It’s our attitude towards the business and our firm belief that we are only as good as the people we employ.


We strive to be more than just an office where an army of anonymous typists sit and click out transcripts day after day in stuffy cubicles.

Sure, we have a large staff (100 plus!) who all can type a minimum of 75 WPM, and who seem to work tirelessly at our 15 workstations, 24/7/365.  And, okay, there might be some cubicle-esque work areas here… but that’s where the similarities end.


A PLACE FOR BRAINIACS

But what we continually take pride in is that all of our staff members have so much more to offer than just speed, accuracy, and efficiency.  They’re brainiacs, to be honest.  At any given time, we employ some of the best and brightest transcriptionists with degrees ranging from BAs, to JD, to PhDs.  To use a classic Sandy-ism, “Since when do Boston’s PhDs have to give up eating?”

Not only do our transcriptionists come to us well-educated (and usually hungry), they also come from a myriad of social and ethnic backgrounds with knowledge sets ranging from science and tech, to popular culture, to art history, to finance, to law, and more.

Working with this large and diverse pool of knowledge and talent allows us to custom match transcription projects to just the right person (or people) for the job.

Add that all together with our ability to handle pretty much any audio file, our streamlined work flow and digital workstations, and that’s what gives us the confidence to offer:

  • Incredibly fast (like blazing) project turnaround
  • 100% Quality Guarantee (or your money back)
  •  Rush service at no extra charge (ever!)
 

WHAT ARE YOU, LIKE SUPERHEROES OR SOMETHING? 

Well, when it comes to transcription at least. And we certainly think we’d all look pretty sharp in capes…