transcribtxt
Guide 9 min read2026-06-07

How to Transcribe a Phone Interview: A Step-by-Step Guide

Record and transcribe a phone interview accurately: call-recording tips, AI transcription steps, realistic accuracy on compressed audio, and consent rules.

Record the call with a phone call-recording app, then upload the audio file to an AI transcription tool to get text. Expect lower accuracy than studio audio because phone lines compress sound: budget for roughly 88 to 94 percent on a typical call, then proofread names and key quotes before you publish.

Phone interviews are everywhere in journalism, research, and recruiting, but the audio is harder to transcribe than a face-to-face conversation. The phone network strips out part of the frequency range your voice normally occupies, and that missing detail is exactly what a transcription engine relies on. The good news: with a clean recording and the right workflow, you can still get a usable, accurate transcript in minutes. Here is how to do it end to end.

Step 1: Record the call cleanly

Everything downstream depends on the recording. A muddy file is the single biggest cause of bad transcripts, so spend a minute getting this right.

  1. Pick a recording method. On iPhone, recent iOS versions can record calls natively, or you can use a dedicated call-recording app. On Android, several call-recording apps capture both sides of the line. If app-based recording is unreliable on your device, put the call on speakerphone and record with a second phone or a voice recorder placed close to the speaker.
  2. Prefer a direct line over speakerphone. A direct in-app recording captures each side cleanly. Speakerphone adds room echo, picks up background noise, and can blur the two voices together, which makes speaker separation harder later.
  3. Find a quiet room. Close windows, turn off fans and notifications, and avoid coffee shops. Background noise competes with speech and drags accuracy down.
  4. Test for 10 seconds first. Record a short clip, play it back, and confirm both voices are audible before the real interview starts.
  5. Start recording early. Hit record before the conversation begins so you never lose the opening question or an unexpected early answer.

Save the file in a common format. Most call-recording apps export MP3 or M4A, both of which are ready to upload.

Step 2: Get consent before you record

Recording laws vary. Some places allow recording when just one party consents; others require every party on the call to agree. Rules differ by state and country, and the location of each person on the line can matter.

The safe habit: ask for permission at the start of the recording and capture the "yes" on tape. That protects you and creates a clear record. If you are unsure what applies to your situation, check the rules for your jurisdiction before you record. This is general guidance, not legal advice.

Step 3: Upload and transcribe

Once you have a clean file, the transcription itself is fast.

  1. Upload your audio. On TranscribTxt, drag in the recording. Supported inputs include MP3, M4A, WAV, MP4, MOV, and WebM, plus YouTube and other URLs, so most call-recording exports work directly.
  2. Let the model detect the language. TranscribTxt runs on ElevenLabs Scribe and auto-detects across 99 languages, so you usually do not need to set anything for an English-language interview.
  3. Turn on speaker labels. For a two-person interview, speaker labels separate the interviewer from the source. Labels are available on the Pro and Business plans and make a phone interview far easier to read and quote.
  4. Review the draft. The transcript appears with timestamps. Skim it against the audio, focusing on the moments that matter most.
  5. Export in the format you need. Download as TXT for a clean reading copy, SRT if you need timed captions, or JSON with timestamps if you are feeding the data into another tool.

Your uploaded audio is deleted after transcription, which matters when the recording contains a private conversation.

If you want the full general workflow beyond phone calls, see our guide to transcribing any interview.

Step 4: Understand realistic accuracy on phone audio

Set expectations before you start editing. Clean, close-mic studio audio commonly transcribes at around 95 to 98 percent accuracy. Phone and other compressed audio typically lands closer to 88 to 94 percent. The drop is not the tool failing; it is physics.

Three things work against a phone recording:

  • Narrow frequency range. Phone lines transmit a limited slice of the audio spectrum, so consonants and subtle sounds the engine uses to distinguish words are simply missing.
  • Compression artifacts. Cellular and VoIP codecs squeeze the signal, introducing distortion that smears word boundaries.
  • Dropped packets and lag. A spotty connection produces gaps and glitches that read as garbled words in the transcript.

You can push toward the top of that range by recording over a direct line, using a quiet room, and asking your source to avoid talking over you. For a deeper breakdown of what drives the numbers, read our AI transcription accuracy guide.

Step 5: Fix names, quotes, and terms

The proofreading pass is where a phone transcript goes from "draft" to "publishable." Focus your time where errors are most likely and most costly.

  1. Verify proper nouns first. Names, companies, places, and product names are the most common errors because the model cannot guess spelling from compressed audio. Check each against your notes or a quick search.
  2. Re-listen to key quotes. Before you put anything in quotation marks, play that segment back and confirm the wording word for word. A single transcription slip can change the meaning of a quote.
  3. Use Find and Replace for repeat terms. If your source uses a recurring acronym or technical term the engine misheard, fix every instance in one pass.
  4. Mark uncertain spots. Where the audio was unintelligible, leave a clear marker like [inaudible] rather than guessing, so you can follow up if needed.
  5. Keep the timestamps. They let you jump straight back to any moment in the recording during fact-checking.

For reporters working under deadline, our notes on transcription for journalism cover quote accuracy and sourcing in more depth, and our roundup of the best interview transcription software for journalists compares options for newsroom workflows.

Quick checklist

  • Record over a direct line, in a quiet room, starting before the call begins.
  • Confirm consent on the recording and check your local rules.
  • Upload the MP3 or M4A, enable speaker labels, and review with timestamps.
  • Expect roughly 88 to 94 percent accuracy and proofread accordingly.
  • Verify every name and re-listen to every quote before publishing.

Try it on your next call

You can transcribe a phone interview for free on TranscribTxt: the Free plan includes 5 files per month with no card required. If you record interviews regularly, Pro is $12 per month for 1,200 minutes with speaker labels, and Business is $29 per month for 6,000 minutes. Record cleanly, upload, and let the AI handle the first draft so you can spend your time on the words that matter.

Frequently Asked Questions

How do I record a phone interview to transcribe it?

Use a dedicated call-recording app on iPhone or Android, or a recorder app on a second device placed near the speaker. For the cleanest audio, record over a direct line rather than speakerphone. Always start recording before the conversation begins, then save the file as MP3, M4A, or WAV to upload later.

How accurate is transcription of a phone call?

Phone and compressed audio typically transcribe at around 88 to 94 percent accuracy, lower than the 95 to 98 percent you get from clean studio recordings. The narrow phone-line frequency range, background noise, and dropped packets all reduce accuracy. Quiet rooms, a direct line, and clear speakers push results toward the top of that range.

Is it legal to record a phone interview?

It depends on where you and the other person are located. Some regions allow recording with one party's consent, while others require all parties to agree. Laws vary by state and country, so check the rules that apply to your call and ask permission on the recording when in doubt. This is general information, not legal advice.

Can AI add speaker labels to a phone interview?

Yes. AI transcription tools can separate and label speakers, which is useful for two-person interviews. On TranscribTxt, speaker labels are available on Pro and Business plans. Labels work best when each voice is distinct and the line is clean; heavy overlap or a single muffled speakerphone can make separation less reliable.

What file formats can I upload for phone interview transcription?

TranscribTxt accepts common audio and video formats, including MP3, M4A, WAV, MP4, MOV, and WebM, plus YouTube and other URLs. Most call-recording apps export MP3 or M4A by default, so you can usually upload the file directly. Exports are available as TXT, SRT, and JSON with timestamps.