How to Transcribe a Sales Call: A Step-by-Step Guide
Turn a sales call to text: record from your dialer or Zoom, upload, separate rep and prospect with speaker labels, and log objections and next steps to your CRM.
Record the call from your dialer, Zoom, or phone, then upload the audio file to an AI transcription tool to get text. Turn on speaker labels to separate the rep and prospect, read through for objections and next steps, and paste a short summary into your CRM. Expect lower accuracy on VoIP audio than studio recordings.
A clean transcript turns a sales call from a fading memory into a searchable record. You get exact quotes for handoffs, accurate notes for the CRM, and a way to review what actually worked. The catch is that sales calls are usually recorded over VoIP or a phone line, which is harder to transcribe than a face-to-face conversation. With the right workflow you can still get a usable, accurate transcript in minutes. Here is how to do it end to end. For the bigger picture on coaching, pipeline review, and team rollout, see our guide to transcription for sales teams.
Step 1: Record the call from your dialer, Zoom, or phone
Everything downstream depends on the recording, so spend a minute getting this right.
- Use your existing tool's recorder. Most dialers and VoIP platforms have a built-in record button, and Zoom, Teams, and Meet all record to a local or cloud file. A direct in-app recording captures each side of the line cleanly, which is what you want.
- Avoid speakerphone when you can. Speakerphone adds room echo, picks up background noise, and can blur the rep and prospect into one voice, which makes speaker separation harder later. If you must use a phone, a direct call-recording app beats holding a second device near a speaker.
- Find a quiet room. Close the door, mute notifications, and step away from open-office noise. Background sound competes with speech and drags accuracy down.
- Start recording early. Hit record before the conversation begins so you never lose the opening rapport or an early commitment.
Save the file in a common format. Most dialers and meeting tools export MP3, M4A, or MP4, all of which are ready to upload.
Step 2: Get consent before you record
Recording laws vary. Some places allow recording when just one party consents; others require every party on the call to agree. Because reps and prospects are frequently in different states or countries, you can have two different rules on the same call.
A few practical habits keep you on safe ground:
- Announce it. A simple line like "I record my calls for note-taking, is that okay with you?" sets a clear expectation and is often enough on its own.
- Know the difference. "One-party" consent means only one person on the call needs to agree; "all-party" consent means everyone does. Some US states and many countries lean toward all-party rules.
- Verify local law. Check the rules that apply to both your location and the prospect's before you hit record, and when in doubt, ask on the recording.
This is general information, not legal advice. For a fuller walkthrough, read do you need consent to record and transcribe a meeting.
Step 3: Upload the file and transcribe it
Once you have the recording, transcription is the easy part.
- Upload the audio or video. Drag the file into your transcription tool. TranscribTxt accepts common formats including MP3, M4A, WAV, MP4, MOV, and WebM, plus URLs.
- Pick the language. TranscribTxt runs on ElevenLabs Scribe and supports around 99 languages, so multilingual or accented calls are handled well.
- Wait for the result. A typical sales call transcribes in minutes rather than the hours manual transcription would take.
You can try TranscribTxt free with up to 5 files a month and no card required. Paid plans add capacity: Pro is $12 a month for around 1,200 minutes, and Business is $29 a month for roughly 6,000 minutes. After a file is transcribed, the audio is deleted.
Step 4: Turn on speaker labels to separate rep and prospect
A wall of unattributed text is hard to act on. Speaker labels group the transcript by voice so you can instantly see who raised an objection and who made a commitment.
On TranscribTxt, speaker labels are available on Pro and Business plans. They work best when each voice is distinct and the line is clean; a shared speakerphone or heavy crosstalk can make separation less reliable. If you want to understand how this technology decides who is speaking, speaker diarization explained covers it in plain terms. The technique is the same one used to transcribe a phone interview cleanly.
Step 5: Extract what matters and log it to your CRM
The transcript is the raw material; the value is in what you pull out of it. Read through with the deal in mind and capture:
- Objections. Pricing, timing, competitors, or internal blockers the prospect raised, in their own words.
- Next steps. The specific action agreed on and the date attached to it, so nothing slips.
- Commitments. What each side promised, from a follow-up demo to a budget sign-off.
- Signals. Budget, timeline, and the names of other stakeholders who need to be in the loop.
Export the transcript as TXT, SRT, or JSON. A short summary of those points pasted into the CRM record is usually more useful to your team than the full word-for-word text, though keeping the full transcript attached is handy for disputes or coaching reviews later.
A note on accuracy
Be realistic about VoIP and phone audio. It typically transcribes at around 88 to 94 percent accuracy, below the 95 to 98 percent you would expect from a clean studio recording. The phone network compresses the frequency range and occasionally drops packets, and that missing detail is exactly what the engine relies on. A direct dialer recording, a quiet room, and one person speaking at a time all push results toward the top of that range. Whatever you get, proofread names, numbers, and any quote before you forward it.
That is the whole loop: record cleanly, get consent, upload, label the speakers, and log the parts that move the deal. Do it consistently and every call becomes a searchable asset instead of a half-remembered conversation.
Frequently Asked Questions
How do I transcribe a sales call?
Record the call from your dialer, Zoom, or phone, then upload the audio or video file to an AI transcription tool to get text. Turn on speaker labels so the rep and prospect are separated, then read through to pull out objections, commitments, and next steps. Finally, paste the summary or full transcript into your CRM record.
Is it legal to record a sales call?
It depends on where you and the prospect are located. Some regions allow recording with one party's consent, while others require everyone on the call to agree. Because reps and buyers are often in different states or countries, the safest habit is to announce the recording at the start and confirm the rules that apply. This is general information, not legal advice.
How accurate is transcription of a VoIP or phone sales call?
VoIP and phone audio typically transcribe at around 88 to 94 percent accuracy, below the 95 to 98 percent of clean studio recordings. Phone lines compress the frequency range and drop packets, which costs detail the engine relies on. A direct dialer recording, a quiet room, and one speaker at a time push results toward the top of that range.
Can AI separate the sales rep from the prospect?
Yes. Speaker labels group the transcript by voice so you can see who said what. On TranscribTxt, speaker labels are available on Pro and Business plans, which is useful for two-party sales calls. Separation works best when each voice is distinct and the line is clean; heavy crosstalk or a shared speakerphone can make it less reliable.
What should I log from a sales call transcript?
Focus on what moves the deal: objections the prospect raised, commitments either side made, the agreed next step and date, budget or timeline signals, and the names of other stakeholders. A short summary of these points pasted into the CRM is usually more useful to your team than the full word-for-word transcript.