Pepys

Guide

How to approach oral history transcription

A working guide for oral historians and archivists: how to capture, label, edit, and rights-clear a transcript that can stand next to the recording in a repository.

The short answer

To transcribe an oral history, start with an archival recording (WAV or BWF, at least 48 kHz, 24-bit), then get a speaker-labeled draft and audit-edit it line by line against the audio. Label the two voices narrator and interviewer, bracket editorial insertions and inaudible spots, and let the narrator review and correct the transcript before it's deposited with the recording.

The recording is the archival object – capture it to standard

Preservation standards set the floor before you press record. IASA-TC 04, the standard for sound and audiovisual archives, specifies uncompressed WAV or BWF at a sampling rate of at least 48 kHz, and advises 24-bit depth (IASA-TC 04 §5.7.2). MP3 and other data-reduced formats don't meet those standards, so record lossless from the start.

No tool transcribes what the microphone missed. Put the recorder close to the narrator, off any hard surface that booms, and away from HVAC vents and ticking clocks. A lavalier clipped near the collar beats a phone across the kitchen table. The same technique carries over from any interview transcription workflow: mic close, and check your levels before you start.

Open with a spoken slate: your name, the narrator's name, the date, and the place. It anchors which voice is which and timestamps the session before the first question.

Consent and rights: the deed of gift does the legal work

A deed of gift is the document that transfers rights. The Oral History Association treats it as one of the instruments that governs ownership of an interview, alongside deposit agreements and jurisdictional law. These agreements should clearly articulate the rights being transferred (OHA Archives Principles, p.15–17).

Copyright is shared by default. Under OHA guidance, an oral history is co-created, so copyright most often belongs to both the interviewer and the narrator until a signed agreement transfers it (OHA Archives Principles, p.9); the narrator owns the words they speak (OHA, For Participants). That's the general pattern, and your repository's counsel writes the actual forms.

A narrator and repository can agree to restrict an interview for a set period, even after it's been gifted. The repository must honor those access terms over the life of the interview (OHA Archives Principles, p.9, p.16). Transferring ownership and opening the interview to researchers are two separate decisions.

Label the two voices as narrator and interviewer

Oral history has its own naming convention. The Oral History Association uses narrator for the person interviewed and interviewer for the person asking, not 'Speaker 1' and 'Speaker 2' (OHA Archives Principles). Baylor's Institute for Oral History uses interviewee for that same role (Baylor Style Guide). This guide follows OHA and labels the two voices narrator and interviewer; whichever you pick, rename the raw labels once, up front.

Get the split before you get the names. A diarization pass that separates the two speakers gives you a draft with the turns already broken out. You then relabel 'Speaker 1' as the narrator and 'Speaker 2' as the interviewer. Per-channel recording, one track per person, makes that split far cleaner where your setup allows it.

Most oral histories are two voices, which keeps labeling simple. Where a spouse, a second narrator, or a family member joins, name each one and note the addition. A later researcher should be able to tell who is speaking without playing the tape.

Oral history transcription is an audit-edit against the recording

Baylor's Institute for Oral History runs an 'audit-check' pass. An audit-checker or final editor listens against the recording to catch wrong words, homophones, and mishearings (Baylor Style Guide, p.4, p.16). The corrected transcript is measured against the source audio, and the edit itself stays light by design.

Editing a labeled draft beats typing from a blank page. Manual transcription can run up to about six hours per hour of audio (Haberl et al., 2023). Start from a speaker-labeled first pass and most of that time turns into a focused audit-edit.

Reserve square brackets for material not on the recording. In Baylor's convention, brackets hold editorial insertions the editor or reviewer adds, never the narrator's spoken words (Baylor Style Guide, p.7). Settle your editing style up front, too: an archival audit-edit sits nearer the strict end of the verbatim spectrum than a cleaned-up reading copy.

For the hard spots, follow set marks rather than guessing silently. Where the audio is uncertain, type your best guess, underline it, and add '(??)'. Where no guess is possible, leave a blank line of about the right length, then '(??)' (Baylor Style Guide, p.21). Mark overlap as '(both talking at once)' or '(speaking at same time)', never '(interrupts)', which editorializes.

On false starts and repetition, Baylor steers a middle course. Leave in enough to show individual speech patterns, keep a repetition when it carries emphasis in the voice, and don't render stuttering unless it's intentional (Baylor Style Guide, p.11). The goal is to preserve how the narrator actually spoke.

The narrator reviews the transcript before it's deposited

The narrator gets the last read. OHA's participant guidance gives the narrator a chance to review the recording and transcript, make changes, restrict access, or withdraw the interview before it's shared (OHA, For Participants). Their corrections shape the copy that enters the archive.

Baylor formalizes this as a review phase. The narrator's notes and clarifications are bracketed into the final transcript, tagged like '[Smith note: ...]', so the addition reads as the narrator's own and not the editor's (Baylor Style Guide, p.7).

Keep the corrected transcript with the recording; the audio stays the primary object. OHA directs archives to keep audiovisual components alongside transcripts, metadata, and documentation of any restrictions (OHA Archives Principles, p.8). Export the finished transcript to DOCX for an access copy, then deposit it with the master audio and the signed deed of gift.

The steps, in order

  1. 01

    Record to archival spec

    Capture uncompressed WAV or BWF at a sampling rate of at least 48 kHz, 24-bit if you can. Mic the narrator close, cut background noise, and slate the names, date, and place.

  2. 02

    Clear consent and rights

    Have the narrator sign a legal release or deed of gift that articulates the rights transferred and any access restrictions or embargo period before the interview is used.

  3. 03

    Get a speaker-labeled first pass

    Upload the recording for a diarized, timestamped draft, then relabel the two voices as narrator and interviewer instead of 'Speaker 1' and 'Speaker 2'.

  4. 04

    Audit-edit against the recording

    Listen line by line, fix homophones, names, and mishearings, reserve square brackets for editorial insertions, and mark inaudible spots with a best guess and '(??)'.

  5. 05

    Send it to the narrator for review

    Let the narrator correct, annotate, or restrict the transcript. Bracket their notes and clarifications into the final version.

  6. 06

    Deposit the transcript with the recording

    Export an access copy, such as DOCX, and archive it alongside the master audio, the metadata, and the signed agreement.

Tips from people who do this a lot

  • Slate the recording out loud – your name, the narrator's name, the date, the place – so speaker labels and consent are timestamped before the first question.

  • Record lossless WAV or BWF, not MP3. You can always make an MP3 access copy later; you can't recover the detail a lossy codec discarded.

  • Keep the '(??)' marks in the deposited copy. A flagged uncertain word is honest; a confident wrong word becomes a misquote in someone's research.

  • Don't silently fix a narrator's factual slip. Bracket it or leave it for the review phase, where the narrator can correct the record themselves.

  • Store the signed deed of gift with the transcript and audio. Access rights are only as good as the paperwork a future archivist can actually find.

Try it now

Drop in your recording or paste a link and get a clean, speaker-labeled transcript in minutes. Your first 60 minutes are free.

or paste a link
InstagramTikTokYouTubeFacebookSpotifyApple Podcasts

60 min free · no card required · we never train on your audio

PodcasterJournalistContent creatorResearcherStudent
Trusted by 100,000+ creators, podcasters, journalists & researchers

Oral history transcription – questions, answered

What audio format should an oral history be recorded in?

Record uncompressed WAV or BWF at a sampling rate of at least 48 kHz, with 24-bit depth advisable, following the IASA-TC 04 archival standard. Avoid MP3 and other data-reduced formats for the master; they don't meet preservation standards. Export a smaller MP3 access copy from the lossless original when you need one.

What's the difference between a narrator and an interviewer?

In oral history the narrator is the person being interviewed, the source telling their story, and the interviewer is the person asking questions. That narrator/interviewer pairing is the Oral History Association's convention; Baylor's Institute for Oral History uses interviewee for the same role. Label the two voices clearly in the transcript, not 'Speaker 1' and 'Speaker 2'.

Who owns the copyright to an oral history interview?

Under Oral History Association guidance, copyright is co-created and most often belongs to both the interviewer and the narrator until a signed agreement transfers it; the narrator owns the words they speak. A deed of gift or legal release transfers those rights to a repository. That describes the general pattern, and specific advice comes from your repository's counsel.

How do you mark an inaudible passage in an oral history transcript?

Following Baylor's style guide, type your closest guess, underline it, and add '(??)' when you can approximate the words. Where no guess is possible, leave a blank line of about the right length followed by '(??)'. Keep these marks in the archived copy so the uncertainty stays visible to a future researcher.

Can the narrator change or restrict the transcript after the interview?

Yes. Oral History Association guidance lets the narrator review the recording and transcript, make corrections, restrict access for a period, or withdraw the interview before it's shared. Corrections are typically bracketed into the final transcript, which is then preserved alongside the recording, its metadata, and documentation of any restrictions.

References

  1. 1.OHA Archives Principles and Best Practices Complete Manual (Adopted Oct 2019)Oral History Association
  2. 2.For Participants in Oral History InterviewsOral History Association
  3. 3.Style Guide: A Quick Reference for Editing Oral History Transcripts (Revised May 2018)Baylor University Institute for Oral History
  4. 4.IASA-TC 04, §5.7.2 Field Recording StandardsInternational Association of Sound and Audiovisual Archives (IASA)
  5. 5.Haberl et al. (2023), Take the aTrain – transcription time cost, citing Bell et al. (2018)arXiv / University of Graz

Keep reading

Don't just take our word for it.

Ask ChatGPT, Claude, or Perplexity what Pepys is and who it's for. One click, and your favorite AI does the homework.

Get your transcript – free to start

Pay as you go – credits never expire, nothing to cancel. Or start free with 60 minutes, no card.