Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
Home
Discussion Groups
General
GeneralPortable MacsHardwareNetworking
Applications
Mac ApplicationsEudoraFirefox / MozillaInternet ExplorerOutlook ExpressMS OfficeEntourageExcelPowerPointWordVirtual PCMedia PlayerOther MS Products
Programming
Mac ProgrammingCodeWarriorPerl
Country Specific
Australian Mac GroupUK Mac Group

Mac Forum / Applications / Mac Applications / January 2005



Tip: Looking for answers? Try searching our database.

hardware and software for recording and transcribing interviews

Thread view: 
Enable EMail Alerts  Start New Thread
Thread rating: 
MaryMc - 31 Dec 2004 06:51 GMT
I've just started evaluating the technology I'm going to need to help
me do the interviews for my doctoral dissertation.  I expect to be
doing a dozen or more one-on-one interviews, some in person and some by
telephone.  They will probably last in the neighborhood of 1-4 hours
each.  I'm planning to record them in some form and transcribe them
into Word documents.  I use a PowerBook G4 1.5 GHz,  running OS 10.3.7,
with 1 GB of memory.

As I see it, I have two options:

--Get a digital voice recorder and speech recognition software, and
have my computer do the transcribing (realizing that, if this is even
do-able, I'll have to do some significant editing and cleanup of the
transcripts it produces); or

--Get a dictation machine with foot pedals and type in the suckers
myself.

Obviously, the first approach has the potential to save me a lot of
work--*IF* it works as planned.  Has anybody out there done this?  I'd
really like to hear about your experiences.  In particular, can you
tell me:

--Did you use MacSpeech, Inc.'s iListen, IBM's Via Voice, or some other
speech recognition/transcription software?  What did you think of it?

--How did it work for transcribing interviews with a variety of people?
(from what I've read, to get the best results, you're supposed to
"train" the software to work with your voice and speech patterns--is
this going to be a big problem when I'm recording a number of different
voices?)

--Did you try having the software transcribe recordings made over the
phone?  How well did that work?  (Olympus makes a "Telephone Recording
Device" to go with their Mac-compatible digital recorders, but it
doesn't appear to plug directly into the phone line--it records the
output from the telephone receiver.  Has anybody worked with a setup
like that?)

--Do you have any experience with using speech recognition software
with recordings made on a lower-end, consumer-quality digital recorder
(the ones I'm seeing are in the $150-$200 range) vs. a higher-end,
professional-quality one (probably $400 and up)?  Do I really need to
spend the money to get the recording quality I need for decent speech
recognition results?

--Any other advice you can share?

And, if I go the lower-tech, type-by-hand route, do you have any advice
or suggestions about that?  In particular, any recommendations on
cassette vs. digital recorders, in terms of sound quality, ease of use,
and any other relevant features?

Thanks much!

Signature

MaryMc
marymc11@SPAM.BEGONE.mindspring.com
                                )
                               (
"Espresso est, ergo cogito."  C[_]

Alice Faber - 31 Dec 2004 14:49 GMT
> I've just started evaluating the technology I'm going to need to help
> me do the interviews for my doctoral dissertation.  I expect to be
[quoted text clipped - 3 lines]
> into Word documents.  I use a PowerBook G4 1.5 GHz,  running OS 10.3.7,
> with 1 GB of memory.

There's a third possibility. Linguists have a variety of transcription
tools, but they all involve simplifying the process of listening,
sometimes repeatedly, to recordings. If you make a digital recording,
you can use a variety of waveform display programs that let you
highlight subsections, play them, and write your transcription to a
time-aligned tier. Praat, for instance, while it does so much more (most
of which is probably unnecessary for your purposes), would let you
delineate utterances on one tier, transcribe them on another, and then,
via a macro, export the transcription of each utterance to a text file,
together with a time-stamp (how many seconds into your file the
utterance starts). You'd also be able to define an additional tier, for
comments about whatever you want to comment on. There are undoubtedly
other tools that would let you do this, perhaps with built-ins replacing
the macro you'd have to define, but this is the program that I'm
familiar with.

Several of our researchers have had good luck making recordings with a
Marantz digital recorder, in WAV format. This recorder writes the files
to one of the flash cards, which can be read by an off-the-shelf card
reader. As a result, getting the files on to a laptop is a piece of
cake, just a regular copying operation in the finder. Another researcher
has made decent recordings with a mini-disc. But, unless you get a
"professional" level recorder, you lose one of the major advantages of
using mini-disc. The consumer recorders only have analog output, so
getting the files to your computer involves digital-to-analog *and*
analog-to-digital conversions. Aside from the loss of recording quality,
this takes a lot of time.

Signature

AF
"Non Sequitur U has a really, really lousy debate team."
             --artyw raises the bar on rec.sport.baseball

matt neuburg - 31 Dec 2004 15:42 GMT
> I've just started evaluating the technology I'm going to need to help
> me do the interviews for my doctoral dissertation.  I expect to be
[quoted text clipped - 10 lines]
> do-able, I'll have to do some significant editing and cleanup of the
> transcripts it produces)

Not possible. No speech recognition software can transcribe a stranger's
voice - that's not how it works. See my reviews of ViaVoice and iListen
in TidBITS to learn how it does work. The only speech that speech
recognition software can transcribe is yours, because you've trained it
extensively, and you must speak directly to it.

> --Get a dictation machine with foot pedals and type in the suckers
> myself.

Uck. What I would do is something in between. First, I'd use dictation /
transcription *software*. This puts the business of repeating sections
of the recording into the computer, so you're using the mouse and
keyboard, not foot pedals. There are many interesting possibilities; you
might look at Listen & Type:

http://www.nattaworks.com/english.html

or at LoopyLoop:

http://www.loopyloop.com/pages/547871/index.htm

There are others, too - this why there is Google and versiontracker. Go
out and find one that suits you.

Anyhow, having gotten some dictation software of that sort, I would then
use it to transcribe the interviews. Here, a mixture of typing and
speech recognition could be used, e.g. listen to a stretch of what the
interviewee said, then say it yourself, to your speech recognition
software, as a way of entering it as text.

> --Did you use MacSpeech, Inc.'s iListen, IBM's Via Voice, or some other
> speech recognition/transcription software?  What did you think of it?

I believe that part of what you are asking is how accurate speech
recognition is these days. In general, I would have to say that its
accuracy is quite astonishing. Naturally, things become tricky as your
vocabulary becomes more specialized. And you must take the time to train
your speech recognition software, and you must be alert enough to permit
it to train you. If you are willing to do that, I think you will find
speech recognition software to be little short of miraculous. This
entire paragraph  was entered using only my voice, and as you can see,
no mistakes were made. Actually, I did have to make one correction: the
computer incorrectly capitalized one word. But it made far fewer
mistakes then I would have made typing the same material manually.

m.

Signature

matt neuburg, phd = matt@tidbits.com, http://www.tidbits.com/matt/
AppleScript: The Definitive Guide
http://www.amazon.com/exec/obidos/ASIN/0596005571/somethingsbymatt
Read TidBITS! It's free and smart. http://www.tidbits.com

Dave Hinz - 31 Dec 2004 16:04 GMT
> Not possible. No speech recognition software can transcribe a stranger's
> voice - that's not how it works. See my reviews of ViaVoice and iListen
> in TidBITS to learn how it does work. The only speech that speech
> recognition software can transcribe is yours, because you've trained it
> extensively, and you must speak directly to it.

I believe you, but can you tell me how the voice-recognition
systems that, for instance, my bank uses work?  Is it just because
the limited number of things it needs to check against make it work
with a variety of voices?  I've tried toying with it, and it accepts
and understands a wide variety of inflections and bad voicings.
matt neuburg - 31 Dec 2004 19:05 GMT
> > Not possible. No speech recognition software can transcribe a stranger's
> > voice - that's not how it works. See my reviews of ViaVoice and iListen
[quoted text clipped - 6 lines]
> the limited number of things it needs to check against make it work
> with a variety of voices?

Yes, exactly so. That's also how Mac's own "Apple Speakable Items"
works. No training needed, because we just compare against a very, very
limit repertoire. Plus it's a distinct short phrase definitively bounded
by silence - this avoids the whole "where are the pieces" problem, which
is very complex indeed. Continuous speed recognition is a completely
different technology (and, itself, has nothing whatever to do with how
your ear/brain works - that's *another* technology). See, as I said
before, my TidBITS articles on all of this. I get quite deep into the
technical details. m.

Signature

matt neuburg, phd = matt@tidbits.com, http://www.tidbits.com/matt/
AppleScript: The Definitive Guide
http://www.amazon.com/exec/obidos/ASIN/0596005571/somethingsbymatt
Read TidBITS! It's free and smart. http://www.tidbits.com

Wayne C. Morris - 31 Dec 2004 19:41 GMT
> > Not possible. No speech recognition software can transcribe a stranger's
> > voice - that's not how it works. See my reviews of ViaVoice and iListen
[quoted text clipped - 7 lines]
> with a variety of voices?  I've tried toying with it, and it accepts
> and understands a wide variety of inflections and bad voicings.

Speech recognition can be made to work very well with a wide range of
voices if it doesn't have to recognize continuous speech (i.e. you pause
between words), or if it only needs a limited vocabulary, or if it only
needs to recognize one person's voice.
pfw - 31 Dec 2004 17:46 GMT
>And, if I go the lower-tech, type-by-hand route, do you have any advice
>or suggestions about that?  In particular, any recommendations on
>cassette vs. digital recorders, in terms of sound quality, ease of use,
>and any other relevant features?

I record a lot of interviews and frequently have to  transcribe small
portions by hand.

I use an Olympus DS-330 ($130 at my local Best Buy electronics store).
It is a small handheld digital recorder. You can get an attachment to
use it for telephone interviews through the Olympus web site and
probably at a Radio Shack. But I've not tried one of these.

The recorder comes with software for the PC and the Mac so that you
can download your audio into the computer and listen to it through the
computer while you transcribe. This software also helps you organize
and store the audio on the computer.

The recorder holds 2.5 to 5 hours of audio, depending on the quality
setting you pick. If the subject is near the recorder, speaks loudly
and clearly and there are few extraneous sounds, you probably can use
the long-play, lower-quality setting with no problem. You should
experiment with the player to learn the best settings for your needs
before you take it into the field.

I've used it on my desktop WinXP machine, a laptop Win2000 machine,
and on my iBook G4 with OSX 10.3.x.

I found the recorder to be extremely helpful, one of the best tools
I've ever bought for my work. Note, however, that the playback
software is better on Windows than on the Mac.

On the Windows side, when you are listening to the audio, you can use
the function keys on the computer keyboard to control the playback
while you are transcribing it in a word processor.  It's very easy,
while, say, typing in Word, to just reach up and press a function key
to pause, rewind a little, fast-forward, slow down the playback, speed
up the playback, etc.

The function keys don't work on the Mac version.

On the Mac side, if you need to pause or rewind, etc., you have to
switch back-and-forth between the word processor and the audio
software. And you have to take a hand off the keyboard and use the
mouse or trackpad to press the on-screen buttons through the software.
Somewhere I saw some documentation that claimed you can use the
function keys on the Mac side, but I have yet to make this work. I've
tried. This flaw takes away about half the usefulness of the system on
a Mac.

The only alternative I've found is to buy an add-on footpedal from
Olympus. It would plug into the Mac and you can use that to control
playback. But I've not done that.
Joshua Steinberg - 01 Jan 2005 14:14 GMT
Yup, I'm afraid Matt's right in his reply to you.  Can't be done.
Speech Recognition (SR) software requires a lot of effort just to get it
to understand your single voice under perfect conditions.  Then ruin
that sound quality by running it through a recorder and things get
tougher.  But introduce another voice that the software isn't tuned for,
and forget it.

I doubt there's any way around recording and manually transcribing the
interviews.  Whether you record it digitally or analog, and whether you
type it on a computer or a typewriter, it's still gonna be the same
process of listen-type-listen-type-listen-type, catch your breath, drink
some coffee, stretch your aching back, listen-type-listen-type, etc.

I'm sure a zillion other doctoral students, especially in the social
sciences and humanities, have faced the same issue.  Talk to them.

-- Josh

> I've just started evaluating the technology I'm going to need to help
> me do the interviews for my doctoral dissertation.  I expect to be
[quoted text clipped - 50 lines]
>
> Thanks much!
Tim Murray - 01 Jan 2005 18:51 GMT
> --Get a dictation machine with foot pedals and type in the suckers
> myself.

For a heavy workload of manually transcribing tapes, well worth the cost.

> --Did you try having the software transcribe recordings made over the
> phone?  How well did that work?  (Olympus makes a "Telephone Recording
> Device" to go with their Mac-compatible digital recorders, but it
> doesn't appear to plug directly into the phone line--it records the
> output from the telephone receiver.  Has anybody worked with a setup
> like that?)

I use a TMP-636S from dynametric.com.  It hooks directly in line with your
telephone and sends output to the microphone jack of a computer or a tape
recorder.  But I've never used a software form of a transcriber; only manual.
The Tufted Nerble - 01 Jan 2005 20:37 GMT
Hi, Original Poster. I didn't save your inquiry, but here's a URL that
might be helpful....

http://www.next-wave-solutions.com/mailsoncd.html

I have neither experience nor connection with this supplier, but the
specs look promising for you.

Goof luck.

Signature

Tufty
--
The President's deep-space initiative:
Send a nucula rocket to the Horsehead Neblear.

MaryMc - 01 Jan 2005 20:41 GMT
> I use a TMP-636S from dynametric.com.  It hooks directly in line with your
> telephone and sends output to the microphone jack of a computer or a tape
> recorder.  But I've never used a software form of a transcriber; only manual.

Does this work with a Mac?  I can only find reference to Windows
software on the website.
Tim Murray - 02 Jan 2005 01:47 GMT
>> I use a TMP-636S from dynametric.com.  It hooks directly in line with your
>> telephone and sends output to the microphone jack of a computer or a tape
[quoted text clipped - 3 lines]
> Does this work with a Mac?  I can only find reference to Windows
> software on the website.

Yes, because the device doesn't care about what's on the other end. It just
pumps the signal to a plug, and it's the user's responsibility to capture it
using whatever can record from the Mic port.
MaryMc - 05 Jan 2005 05:58 GMT
THANK YOU to everyone who has responded with their advice and input.  I
can't tell you how helpful your suggestions have been.  You've probably
saved me a whole lot of time, money and frustration.

I posted this query in a few different places--here, plus a newsgroup
for medical transcriptionists, and a bulletin board for people doing
research similar to mine.  Here's a summary of the main points I
gathered from all the responses:

Using speech recognition software to transcribe interviews with
multiple interviewees sounds great, but it probably won't work in the
real world.  There were widely differing opinions on the speech
recognition software in general--some people think it's all rudimentary
and flawed at best, some say it can be surprisingly accurate and
useful--but several people pointed out that a lot depends on training
it to work with your own voice and speech patterns.  It's just not
likely to perform well with recordings of many different speakers--the
amount of time you'd spend correcting and cleaning up the output will
probably negate the time you'd save on the initial data entry.  Some
people suggested playing back the recordings of the interviews and
repeating them in chunks in my own voice into the computer's
microphone, but that doesn't really sound like a viable option to
me--it seems too time-consuming to be much of a time-saver, for one
thing.

Several people said they prefer digital recorders over cassettes
because you get better quality recordings.  One person said that
digital recorders can pick up lower-volume speech that cassette
recorders can't.  A few people said that even lower-end digital
recorders produce good results on the highest-quality recording mode
(several people recommended recording only in that mode).   But a major
limitation on the cheaper models is storage space--you may not be able
to fit all of a long interview in the high-quality mode in the storage
capacity they offer.  One person recommended a higher-end model for
this reason, preferably with removable storage media.  Several people
mentioned Olympus as a good brand (and I can attest that they have good
tech support--I called them the other day with some questions about
their products, and the person I talked with was extremely helpful and
knowledgeable).  One person did point out that it's easier to lose (or
accidentally delete) a digital recording file than it is to erase a
whole interview on tape.

Several people recommended investing in a really good microphone--not
the one that came with the recorder.  One person suggested getting a
two-channel recorder and separate microphones for you and your
interviewee, so you can separate voices when both of you speak at once
(she said she learned this from a journalist who does interviews for a
living).  If you're using just one mic, another person recommended
using a tabletop model, since a lapel mic on your interviewee may not
pick up your voice as well.  He also pointed out that you and your
interviewee aren't tethered to a tabletop mic, and can just walk away
from it if necessary without being unhooked.  He said that his tabletop
mic--a Sony ECM-R100 with "boundary effect" and a "noise reduction"
setting--worked very well.  Still another person said he thinks the
built-in microphones in current-model digital recorders are very good
and an add-on mic isn't necessary.

Several people emphasized the importance of taping the interviews in a
quiet setting, as free from background noise as possible (several
strongly recommended against coffee houses, bars, restaurants and other
public locations, and one warned about noise from wind, air
conditioning, etc. that you won't hear, but the microphone will).
Recordings with less junk in the background are easier to transcribe,
and if you're paying someone to do it for you, you'll end up paying
them less.

For recording a telephone conversation, one person recommended the
TMP-636S from Dynametric.com. It plugs directly into your phone line
and sends output to the microphone jack of a computer or a tape
recorder.  He said it works on Windows or Mac (although the company's
website only mentions PC).  NextWaveSolutions.com also makes a
Telephone Record Coupler (#TRX2035B) that does the same thing.

Lots of people recommended a playback device with foot pedal controls
(the professional transcriptionists think you'd be crazy not to use
one).  You can get these to use with both cassette and digital
recordings.  Many people suggested looking for a used one on eBay, and
selling it there when you're done.

Opinions among the researchers about paying a transcriptionist were
mixed.  Some said that doing it yourself lets you immerse yourself in
the data in a way that's invaluable to the analysis.  Others were glad
they'd found the money to invest a transcriptionist, and didn't think
they missed anything of value in letting go of that task.  Some said
that what they got from the transcriptionist was really a first draft,
and they spent a fair amount of time correcting and polishing it, so
they got plenty familiar with the data in that process.

A couple of the transcriptionists recommended word expansion software,
like SpellCatcher.  With this program, besides the spell checking
functions, you can create shortcuts that let you type in a few letters
or a short abbreviation, and it automatically expands this into a
longer word or phrase.  I can imagine this being very handy for some of
the cumbersome and repetitive terminology that a lot of us use in our
writing.  It's available for Mac and Windows from
http://www.rainmakerinc.com/.  They have a 15-day free trial download
on the website, and the full version is $40.  I'm going to try it.

And finally, one person pointed out that at least one qualitative data
analysis program--HyperResearch for the Mac--lets you code audio
recordings directly, without having to transcribe the entire interview.
You'd only have to type the utterances you were quoting directly in
your paper.

I hope all this is as helpful to others here as it has been to me!

Signature

MaryMc
marymc11@SPAM.BEGONE.mindspring.com
                                )
                               (
"Espresso est, ergo cogito."  C[_]

matt neuburg - 05 Jan 2005 07:28 GMT
Just a couple of further notes:

> For recording a telephone conversation, one person recommended the
> TMP-636S from Dynametric.com. It plugs directly into your phone line
> and sends output to the microphone jack of a computer or a tape
> recorder.  He said it works on Windows or Mac (although the company's
> website only mentions PC).  NextWaveSolutions.com also makes a
> Telephone Record Coupler (#TRX2035B) that does the same thing.

You can record a telephone conversation into your computer through your
modem port; there is no need to purchase special equipment.

> A couple of the transcriptionists recommended word expansion software,
> like SpellCatcher

You might look at other choices, as discussed in this thread:

 <http://db.tidbits.com/getbits.acgi?tlkthrd=2397>

m.

Signature

matt neuburg, phd = matt@tidbits.com, http://www.tidbits.com/matt/
AppleScript: The Definitive Guide
http://www.amazon.com/exec/obidos/ASIN/0596005571/somethingsbymatt
Read TidBITS! It's free and smart. http://www.tidbits.com

 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2009 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.