Pepys

Guide

How to transcribe an interview for a dissertation

A methods-aware workflow for graduate researchers: choose a defensible transcription convention, format the transcript for your coding software, and handle consent and citation right.

The short answer

Transcribe your dissertation interviews in two passes: get a speaker-labeled AI draft, then correct it against the audio to a convention your methodology can defend – naturalized (Jeffersonian) verbatim for conversation or discourse analysis, denaturalized clean verbatim for thematic coding. Structure each turn by speaker so NVivo, ATLAS.ti, or MAXQDA can import it, keep timestamps for re-checking, and justify the choice in your methods chapter.

Transcription is a methodological choice, not clerical work

Transcription is an act of representation that shapes what your analysis can find (Oliver, Serovich & Mason, 2005). Every transcript reflects interpretive decisions about what you capture and representational decisions about how you render it (Bucholtz, 2000). Your committee will expect you to name the convention you used and defend it, not treat the file as a neutral copy of the audio.

So write the choice into your methodology chapter. State whether you transcribed naturalized or denaturalized verbatim, why that fits your analytic approach, and who did the transcribing. That paragraph is short, but examiners look for it. It shows you understood transcription as a research decision, not a typing task.

This guide covers the parts specific to a dissertation. For recording setup and the general AI-first-pass-then-clean workflow, start with the pillar on how to transcribe an interview, then come back here for convention, coding, and ethics.

Naturalized or denaturalized verbatim – which does your analysis need?

The core split is naturalism versus denaturalism. Naturalized verbatim keeps every utterance in as much detail as possible; denaturalized verbatim corrects grammar, removes stutters and pauses, and standardizes non-standard accents (Oliver, Serovich & Mason, 2005). Neither is more accurate. They answer different questions.

If you're doing conversation or discourse analysis, you need the detail. A standard orthographic transcript bleaches out how and when things are said, which is exactly what those methods analyze (Hoey & Kendrick, 2022). Jeffersonian notation captures it: overlaps, timed pauses, emphasis. For thematic or content analysis, denaturalized clean verbatim is usually enough and much faster to code.

Pick one before you edit a single line, because the choice changes every turn. Switching conventions halfway leaves an inconsistent corpus that's hard to defend and harder to code.

Get an AI first pass, then transcribe for your dissertation codebook

Doing it by hand is the slow way. Manual transcription of one hour of interview audio can take up to six hours of work (Haberl et al., 2023, citing Bell et al., 2018). Across a dozen interviews, that's weeks. An AI first pass turns each hour into minutes of processing plus focused correction against the audio.

Get a speaker-labeled first-pass draft, then correct it to your convention. Clean speaker separation matters more here than in journalism: coding software keys on who said what, so an interviewer's prompt shouldn't get coded as a participant's answer.

Read the draft against the recording and fix what AI misses: participant pseudonyms, domain terms, and numbers said quickly. Mark anything genuinely unclear as [inaudible] with its timestamp, so a coded excerpt stays honest rather than a confident guess.

Format your dissertation transcript for NVivo, ATLAS.ti, or MAXQDA

Your transcript has to import cleanly into your CAQDAS. NVivo reads plain text, rich text, or Word documents, optionally with a speaker column; ATLAS.ti accepts .doc, .docx, .rtf, .odt and .txt; MAXQDA takes the same formats. DOCX is the safe common denominator.

Structure matters as much as format. In MAXQDA, each contribution starts a new paragraph with the speaker's name followed by a colon, and the software auto-codes every turn to that speaker. Keep the pattern consistent – 'Interviewer:' and 'P07:' at each turn's start – and your speaker coding is done on import.

Export a coding-ready DOCX with those speaker labels intact for import, and keep a copy for your appendix if your program requires transcripts submitted with the thesis.

Consent, special-category data, and how to cite participants

Identifiable interview data carries obligations. Under the US Common Rule, private information for which a subject's identity may readily be ascertained is identifiable, and collecting it makes you a human-subjects researcher (45 CFR 46.102). That is what your IRB consent and data-handling plan exist to cover.

If your interviews reveal health, religion, ethnicity, political opinions, or sexual orientation, they may be special-category data whose processing is prohibited without an Article 9 condition under GDPR. Store identifiable audio access-controlled, and pseudonymize in the transcript copy while keeping an un-redacted master secure.

Cite participants correctly. In APA, you don't cite your own research participants as personal communications – you quote them directly from your data. Personal-communication citations are for outside interviews you reference, not for the interviews that are your dataset.

The steps, in order

  1. 01

    Choose your transcription convention first

    Decide naturalized (Jeffersonian) or denaturalized clean verbatim before you edit, based on your analytic method, and note the choice for your methodology chapter.

  2. 02

    Get a speaker-labeled AI first pass

    Upload each recording to get a speaker-labeled, timestamped draft in minutes instead of up to six hours of manual typing per audio hour.

  3. 03

    Correct against the audio to your convention

    Read the draft while listening. Fix pseudonyms, jargon, and numbers, mark unclear passages as [inaudible] with timestamps, and apply your chosen verbatim style consistently.

  4. 04

    Structure each turn by speaker

    Start every contribution on a new paragraph with the speaker name and a colon, so NVivo, ATLAS.ti, or MAXQDA can code by speaker on import.

  5. 05

    Export, cite, and store

    Export a DOCX for your coding software and appendix, quote participants directly rather than as personal communications, and keep identifiable audio access-controlled.

Tips from people who do this a lot

  • Write one paragraph in your methods chapter naming your transcription convention and why it fits your analysis. Examiners look for it, and it is easy marks.

  • Keep speaker labels identical at every turn ('Interviewer:', 'P07:'). Consistent turn structure lets MAXQDA and NVivo auto-code by speaker instead of you doing it by hand.

  • Denaturalized clean verbatim is enough for thematic coding. Save Jeffersonian detail for conversation or discourse analysis, where how something was said is the data.

  • Pseudonymize in a working copy and keep an un-redacted master in access-controlled storage, so you never lose true attribution when you need to verify a coded quote.

  • Keep timestamps on every transcript. When a coded excerpt gets challenged at your viva, you can jump to the audio and confirm the wording in seconds.

Try it now

Drop in your recording or paste a link and get a clean, speaker-labeled transcript in minutes. Your first 60 minutes are free.

or paste a link
InstagramTikTokYouTubeFacebookSpotifyApple Podcasts

60 min free · no card required · we never train on your audio

PodcasterJournalistContent creatorResearcherStudent
Trusted by 100,000+ creators, podcasters, journalists & researchers

How to transcribe an interview for a dissertation – questions, answered

Which transcription method should I use for my dissertation?

It depends on your analysis. Naturalized (Jeffersonian) verbatim, which keeps pauses, overlaps, and emphasis, suits conversation and discourse analysis. Denaturalized clean verbatim, which removes stutters and tidies grammar, is usually enough for thematic or content analysis. Pick one, apply it consistently, and justify the choice in your methods chapter.

How long does it take to transcribe an interview for a dissertation?

By hand, up to six hours per hour of audio, so a study with a dozen hour-long interviews can eat weeks. An AI first pass cuts each hour to minutes of processing plus focused correction against the recording, which is where your attention actually belongs.

How do I import my transcript into NVivo, ATLAS.ti, or MAXQDA?

All three read Word documents, so export to DOCX. NVivo also takes plain text and can use a speaker column; ATLAS.ti accepts .docx, .rtf, .odt, and .txt. Start each turn on a new paragraph with the speaker name and a colon so the software can code by speaker.

Do I need consent to transcribe research interviews?

Yes, through your ethics or IRB process. Under the US Common Rule, collecting private information that can identify a living person is human-subjects research. Interview content revealing health, religion, or ethnicity may also be special-category data under GDPR Article 9, so handle and store it accordingly.

How do I cite interview participants in APA?

Quote your own participants directly from your data. APA says not to cite people you interviewed for your own research as personal communications. Personal-communication citations are reserved for outside interviews you reference, not for the interviews that make up your dataset.

References

  1. 1.Oliver, Serovich & Mason (2005), Constraints and Opportunities with Interview TranscriptionSocial Forces (Oxford University Press)
  2. 2.Bucholtz (2000), The politics of transcriptionJournal of Pragmatics (Elsevier)
  3. 3.Hoey & Kendrick (2022), The Benefits of a Jeffersonian TranscriptFrontiers in Communication
  4. 4.Haberl et al. (2023), Take the aTrain – transcription time cost, citing Bell et al. (2018)arXiv / University of Graz
  5. 5.45 CFR 46.102 (Common Rule) – definition of identifiable private informationU.S. HHS / eCFR (Cornell Legal Information Institute)
  6. 6.GDPR Article 9 – processing of special categories of personal dataRegulation (EU) 2016/679 (official text CELEX:32016R0679)
  7. 7.APA Style – Personal communications and quoting your own participantsAPA Style
  8. 8.Import transcripts – supported formats and speaker columnNVivo Help (Lumivero / QSR International)
  9. 9.Supported text document file formatsATLAS.ti User Manual (Windows)
  10. 10.Importing transcripts – speaker turn structure and auto-codingMAXQDA Online Manual (VERBI Software)

Keep reading

Don't just take our word for it.

Ask ChatGPT, Claude, or Perplexity what Pepys is and who it's for. One click, and your favorite AI does the homework.

Get your transcript – free to start

Pay as you go – credits never expire, nothing to cancel. Or start free with 60 minutes, no card.