Guide

Qualitative research transcription

For researchers and grad students turning interviews and focus groups into coding-ready transcripts that hold up to a methods reviewer.

The short answer

Qualitative research transcription converts recorded interviews and focus groups into text for coding. Treat it as an interpretive first step of analysis, not clerical work. Choose a naturalized or denaturalized style to fit your research question. Document your conventions so transcripts stay consistent across a team. Anonymize direct identifiers to meet ethics and data-protection rules. Then export a speaker-labeled file your CAQDAS software can import.

Transcription is the first step of analysis, not a clerical task

Before it is a document, a transcript is a set of decisions. Bucholtz shows that transcribing involves both interpretive decisions (what gets represented) and representational decisions (how it is rendered on the page), and that an objective transcription is not possible (Bucholtz 2000). What you leave in and what you smooth out will steer the reading that follows.

Julia Bailey makes the same point for applied health research: putting speech into written form is an interpretive process and the first step in analysing data (Bailey 2008). Treat the transcript as analysis in progress. The person who transcribes is already coding, informally, by choosing where a sentence ends and whether a laugh belongs on the page.

That does not mean typing every word yourself. Manual transcription can run up to six hours for a single hour of audio (Haberl et al. 2023, citing Bell et al. 2018), which is time taken away from analysis. A practical route is the AI first pass, then a careful clean that experienced interviewers use: let a tool produce a speaker-labeled draft, then read it against the audio and make the interpretive calls yourself.

Naturalized and denaturalized transcripts answer different questions

The verbatim question has a standard vocabulary. Oliver, Serovich and Mason distinguish naturalism, where every utterance is captured in as much detail as possible, from denaturalism, where grammar is corrected and interview noise like stutters and pauses is removed (Oliver, Serovich & Mason 2005). Neither is more correct; they suit different analyses.

Match the style to your method. Conversation analysis and discourse work need a naturalized transcript, because pauses, overlaps and false starts are the data. Thematic or content analysis usually reads better denaturalized, where the focus is what participants mean across cases. Decide before you edit, since retrofitting one style onto the other means re-listening to everything.

Whatever you pick, apply it to every transcript in the study so cases stay comparable. If you are writing up a thesis, a dissertation methods chapter has to justify that choice and show an examiner how much transcription detail you kept.

Which transcription conventions should you document?

Rigor here is mostly about what you write down. Poland argued that transcription quality is an aspect of trustworthiness, and that methods sections should report the steps taken to ensure audiotape quality, the directions given to transcribers, and an assessment of the transcript's trustworthiness (Poland 1995). A reviewer cannot judge a transcript when the making of it is invisible.

For any project with more than one transcriber, write the rules down. McLellan, MacQueen and Neidig note there are no universal transcription formats adequate for all qualitative data, so teams need systematic, documented procedures for preparing text (McLellan, MacQueen & Neidig 2003). A one-page style sheet settles the recurring questions.

Cover the decisions that come up on every line: how you mark pauses and overlaps, whether you keep fillers, how you spell non-standard speech, what a laugh or an [inaudible] gap looks like, and how you label speakers. Store that sheet with the data so a second coder, or your future self, transcribes the next file the same way.

Anonymization and consent set the boundaries of what you keep

Most qualitative recordings are human-subjects data. Under the Common Rule, a human subject is a living individual an investigator studies, and identifiable private information is information for which the subject's identity may readily be ascertained (45 CFR 46.102). A transcript full of names, employers and places is exactly that kind of information.

If your data touch health information, the HIPAA Safe Harbor method lists 18 identifiers that must be removed, from names through any other unique identifying code (45 CFR 164.514). Even outside HIPAA, that list is a useful checklist for what to strip from a transcript before it circulates.

For EU participants, remember that pseudonymization is a weaker safeguard than anonymization. GDPR Recital 26 states that pseudonymized data still counts as personal data, and only truly anonymized data falls outside the rules (GDPR Recital 26). In practice, replace names with role labels as you clean the draft, keep an access-controlled master with the real identities, and use a tool that does not train on your files. Pepys never trains on your audio or transcripts, and anonymous jobs auto-delete about 12 hours after upload.

Preparing a qualitative research transcription for coding

Coding starts from a clean, speaker-attributed file. Get the interview or focus group down to a speaker-labeled first-pass draft with timestamps, correct whatever the tool misheard against the audio, then decide on your import format before you leave your editor.

Your CAQDAS software is particular about formats. NVivo imports plain text, CSV/TSV, rich text or Word, with an optional speaker-name column. ATLAS.ti accepts .doc, .docx, .rtf, .odt and .txt. Dedoose supports .doc, .docx, .rtf, .txt and .pdf. A DOCX export is the safe common denominator across all three, so a single file imports cleanly whichever package your team uses.

Get those four decisions right, style, conventions, anonymization and export format, and the transcript will hold up when a reviewer asks how it was made.

The steps, in order

01
Choose a verbatim style up front
Decide on naturalized or denaturalized transcription before you start, based on whether pauses and overlaps are your data or a distraction. Apply the same style to every recording in the study.
02
Get an AI first pass, then clean it
Upload the recording to get a speaker-labeled, timestamped draft in minutes, then read it against the audio and fix names, jargon and crosstalk yourself instead of typing from scratch.
03
Write and follow a transcription protocol
Document how you mark pauses, fillers, laughter and speakers in a one-page style sheet, and store it with the data so every transcriber works the same way.
04
Anonymize before the transcript circulates
Replace names, employers and places with role labels, using the HIPAA 18-identifier list as a checklist, and keep the real identities in an access-controlled master file.
05
Export a coding-ready file
Save to DOCX, which NVivo, ATLAS.ti and Dedoose all import, keeping speaker labels and any timestamps intact, then bring it into your CAQDAS project.

Tips from people who do this a lot

Transcribe your first two or three interviews yourself even if a tool does the rest; you will hear analytic themes early and calibrate your conventions before the study scales.
Keep the audio timestamp on every speaker turn. When a quote goes into your write-up, you can jump back and confirm it in seconds rather than scrubbing the file.
For focus groups, label the moderator distinctly from participants (M, P1, P2) so you can separate facilitation from data when you code.
Anonymize a copy and keep the master intact. If you redact your only file, you lose the ability to re-check who said what.
Write the style sheet before you transcribe the first file. Retrofitting conventions across a finished corpus is the slowest fix in the whole project.

Try it now

Drop in your recording or paste a link and get a clean, speaker-labeled transcript in minutes. Your first 60 minutes are free.

or paste a link

60 min free · no card required · we never train on your audio

Trusted by 100,000+ creators, podcasters, journalists & researchers

Qualitative research transcription – questions, answered

What is qualitative research transcription?

It is the process of turning recorded interviews, focus groups or observations into written text for coding and analysis. It is an interpretive first step of analysis, since choices about what to represent and how to render it shape the reading that follows, so researchers treat the transcript as data in its own right.

Should I use naturalized or denaturalized transcription?

Match the style to your method. Naturalized transcription captures every utterance in detail and suits conversation and discourse analysis, where pauses and overlaps are the data. Denaturalized transcription corrects grammar and removes interview noise, and reads better for thematic or content analysis focused on meaning across cases.

How do I anonymize a qualitative transcript?

Replace direct identifiers with role labels as you clean the draft. The HIPAA Safe Harbor list of 18 identifiers, from names to unique codes, is a practical checklist. Under GDPR, pseudonymized data is still personal data, so keep the key linking labels to identities in an access-controlled master file.

What file format should I use for NVivo, ATLAS.ti or Dedoose?

DOCX works across all three. NVivo also accepts plain text, CSV, TSV and rich text with an optional speaker column; ATLAS.ti takes .doc, .docx, .rtf, .odt and .txt; Dedoose supports .doc, .docx, .rtf, .txt and .pdf. Export a Word file with speaker labels intact for the simplest import.

How long does it take to transcribe qualitative data?

By hand, expect up to six hours of work per hour of audio, which is time taken from analysis. An AI first pass produces a speaker-labeled draft in minutes; you then spend your effort reading it against the recording and correcting names, jargon and crosstalk rather than typing every word.

References

1.Bucholtz (2000), The politics of transcription – Journal of Pragmatics 32(10):1439–1465 (via UC eScholarship)
2.Bailey (2008), First steps in qualitative data analysis: transcribing – Family Practice 25(2):127–131 (Oxford Academic)
3.Poland (1995), Transcription quality as an aspect of rigor in qualitative research – Qualitative Inquiry 1(3):290–310 (SAGE)
4.McLellan, MacQueen & Neidig (2003), Beyond the qualitative interview: data preparation and transcription – Field Methods 15(1):63–84 (SAGE)
5.Oliver, Serovich & Mason (2005), Constraints and opportunities with interview transcription – Social Forces 84(2):1273–1289 (Oxford Academic)
6.Haberl et al. (2023), Take the aTrain – transcription time cost, citing Bell et al. (2018) – arXiv:2310.11967
7.45 CFR 46.102 – definitions of human subject and identifiable private information – Cornell Legal Information Institute
8.45 CFR 164.514(b)(2) – HIPAA Safe Harbor 18 identifiers – Cornell Legal Information Institute
9.GDPR Recital 26 – pseudonymization vs anonymization – gdpr-info.eu
10.NVivo 12 Help – import audio/video transcripts (supported formats) – QSR International (NVivo)
11.ATLAS.ti Windows User Manual – supported file formats – ATLAS.ti
12.Dedoose Learning Center – supported file formats – Dedoose

Keep reading

Don't just take our word for it.

Ask ChatGPT, Claude, or Perplexity what Pepys is and who it's for. One click, and your favorite AI does the homework.

Ask ChatGPT Ask Claude Ask Perplexity

Get your transcript – free to start

Pay as you go – credits never expire, nothing to cancel. Or start free with 60 minutes, no card.

Start free – 60 minutes or see pricing