Pepys

Guide

How to import a transcript into NVivo

A step-by-step guide for qualitative researchers: format a speaker-labeled transcript so NVivo imports it cleanly and autocodes who said what.

The short answer

To import a transcript into NVivo, export it as a .docx with each speaker's name at the start of its own line, then choose Import, Documents, and select the file. NVivo also accepts .doc, .rtf, .txt, and .pdf. After importing, run Autocode to create a case for each speaker. Attaching a transcript to an audio or video file is a separate workflow that maps timespan, speaker, and content columns.

Two ways to import a transcript into NVivo

NVivo handles a transcript two ways, and the choice decides how you code the rest of the project. Bring it in as a plain document and you get one source to code by hand or autocode by speaker. Attach it to an audio or video file instead and NVivo imports timed rows.

The timed-rows path is a different job. You map one field to Timespan and one to Content, with Speaker as an optional third. NVivo reads timespans written with a hyphen or a forward slash between the start and end. A transcript entry there is a start-to-end time paired with what was said.

For most coding work you want the plain-document path. It's simpler and it autocodes by speaker, without asking you to align text to media timings you may not need. This guide covers that route. The timed-transcript route is worth knowing only if you're analyzing the audio alongside the text.

Format the DOCX before you import it

The formatting you do before import decides whether autocoding works. NVivo's autocode-by-speaker needs each speaker's name at the very start of a line, with nothing in front of it, no tabs and no spaces. Each speaker needs a unique, consistent name. That is the rule the official help states.

The name can sit on the same line as the response or on its own line above it. A colon after the name is a readable convention many transcripts use, but NVivo's help doesn't require one. What it checks is the line-start position and a unique name. Keep 'Interviewer' as 'Interviewer' throughout, and don't switch to 'Q' halfway.

Start from a transcript that already carries speaker labels. If you have the recording, run it through speaker diarization to get who-said-what, then export a Word .docx with each name on its own line. Typing that hour by hand would cost up to six hours of manual work, so an AI first pass plus light cleanup is far quicker.

What NVivo imports, and who makes it now

NVivo imports five document formats: Word .docx, Word 97-2003 .doc, Rich Text .rtf, Plain Text .txt, and PDF .pdf. The current help lists exactly these. For a transcript you'll code, .docx is the safe default, because it keeps the paragraph structure autocoding reads.

Choose Word over PDF when you can. A PDF imports fine, but its text layer is less predictable, and autocode-by-speaker depends on clean line starts that a .docx preserves. Plain .txt works too if your transcript is simple and unformatted.

One naming note that trips up citations: NVivo is now a Lumivero product. QSR International became Lumivero, LLC effective July 1, 2023. The current help lives on Lumivero's community site, though the older QSR-hosted help pages still describe the same import behavior.

Autocode by speaker or by heading style

NVivo gives you two autocoding routes once the document is in, and they produce different things. Autocode by speaker creates a case for each unique speaker and codes their turns to that case. Autocode by heading style creates a code for each styled paragraph instead.

The style route suits a structured document. NVivo creates a code for every paragraph in a chosen style and codes the text beneath it, with the order of styles setting the nesting hierarchy. That's how you'd code a semi-structured interview by its standard questions.

Use speaker autocoding to separate participants, and use style autocoding to separate topics or questions. Many projects use both. For how the coding choice ties back to your method and verbatim style, see the qualitative-research transcription companion.

Will the same import work on NVivo for Mac?

Mostly, with one caveat around autocoding. The import formats match, so Word, RTF, TXT, and PDF open on both Mac and Windows. But NVivo's Mac help says not all autocoding methods are available to Mac users, or to Windows users on organization accounts.

An independent CAQDAS review from the University of Surrey reached the same read: the Mac version has most, but not all, of the Windows functionality. Autocode by speaker and by paragraph style both exist across platforms, but the exact options can differ.

If your team crosses platforms, confirm the autocoding step you rely on before you commit a project to one machine. Either way, the upstream work is the same: produce a clean, speaker-labeled interview transcript first, then import and code.

The steps, in order

  1. 01

    Format the transcript for autocoding

    In your Word file, put each speaker's name at the very start of a line with no tabs or spaces in front, and use one unique, consistent name per speaker.

  2. 02

    Save it as .docx

    Export the transcript as Microsoft Word .docx. NVivo also accepts .doc, .rtf, .txt, and .pdf, but .docx best preserves the line structure autocoding reads.

  3. 03

    Import it into NVivo

    In NVivo, go to Import, then Documents, and select your file. It comes in as a document source you can read, code, and query.

  4. 04

    Autocode by speaker

    Run Autocode on the document and choose the speaker option. NVivo creates a case for each unique speaker and codes their turns to it.

  5. 05

    Check and refine the coding

    Open the cases to confirm each speaker's turns landed correctly, fix any mislabeled names in the source, and re-run autocoding if needed.

Tips from people who do this a lot

  • Name speakers consistently across every file in a project. 'P01' and 'P02' beat 'Interviewer' and 'Me', which collide once you import several transcripts.

  • A colon after the speaker name is fine for readability, but it's the line-start position and unique name that autocoding actually checks.

  • Prefer .docx over .pdf for anything you'll autocode. A PDF's text layer can break the clean line starts NVivo looks for.

  • If you're analyzing the audio alongside the text, attach the transcript to the media file and map Timespan, Speaker, and Content instead of importing a plain document.

  • Keep heading styles in your Word file if you want to autocode by question or topic. NVivo turns each styled paragraph into its own code.

Try it now

Drop in your recording or paste a link and get a clean, speaker-labeled transcript in minutes. Your first 60 minutes are free.

or paste a link
InstagramTikTokYouTubeFacebookSpotifyApple Podcasts

60 min free · no card required · we never train on your audio

PodcasterJournalistContent creatorResearcherStudent
Trusted by 100,000+ creators, podcasters, journalists & researchers

Import transcript into nvivo – questions, answered

What file format should I use to import a transcript into NVivo?

Word .docx is the safest choice. NVivo also imports .doc, .rtf, .txt, and .pdf, but .docx best preserves the paragraph and line structure that autocode-by-speaker relies on. Use plain .txt only for simple, unformatted transcripts, and avoid PDF when you plan to autocode.

How does NVivo know who each speaker is?

Autocode-by-speaker reads the speaker's name at the start of each line. The name must sit at the very start with no tabs or spaces before it, and each speaker needs a unique name. NVivo then creates a case per speaker and codes their turns to it.

Do I need a colon after each speaker's name?

No. NVivo's autocoding checks that the name is at the start of a line and unique, not that it ends with a colon. A colon is a readable convention many transcripts use, but it isn't required for autocode-by-speaker to work.

Can I import a transcript that's tied to my audio or video?

Yes, but it's a separate workflow. Instead of importing a plain document, you attach the transcript to the media file and map columns: one to Timespan, one to Content, and optionally one to Speaker. Timespans use a hyphen or a forward slash between start and end.

Does importing work the same on NVivo for Mac?

Import formats match across Mac and Windows, so Word, RTF, TXT, and PDF open on both. Autocoding is the caveat: NVivo's own help notes not all autocoding methods are available to Mac users, so confirm the speaker or style option you need before you rely on it.

References

  1. 1.Files: documents you can import (docx, doc, rtf, txt, pdf)QSR International / Lumivero (official NVivo help)
  2. 2.Automatic coding in documents (speaker name at line start; heading-style coding)QSR International / Lumivero (official NVivo help)
  3. 3.Automatic coding techniques (paragraph styles vs a case per speaker)QSR International / Lumivero (official NVivo help)
  4. 4.Import audio and video transcripts (map Timespan, Content, Speaker)QSR International / Lumivero (official NVivo help)
  5. 5.Notice of legal entity change to Lumivero, LLC (effective July 1, 2023)Lumivero (official newsroom)
  6. 6.NVivo for Mac coding (not all autocoding methods available to Mac users)QSR International / Lumivero (official NVivo for Mac help)
  7. 7.NVivo R1 distinguishing features (Mac not yet fully equivalent to Windows)CAQDAS Networking Project, University of Surrey (Silver & Bulloch, 2021)
  8. 8.Haberl et al. (2024), Take the aTrain – manual transcription time cost, citing Bell et al. (2018)Journal of Behavioral and Experimental Finance (Elsevier)

Keep reading

Don't just take our word for it.

Ask ChatGPT, Claude, or Perplexity what Pepys is and who it's for. One click, and your favorite AI does the homework.

Get your transcript – free to start

Pay as you go – credits never expire, nothing to cancel. Or start free with 60 minutes, no card.