Pepys

Guide

How to transcribe a podcast

Turn an episode into a transcript you can publish, repurpose, and rank – without rewatching or retyping a thing.

The short answer

To transcribe a podcast, upload the episode audio (or paste the link) to a transcription tool and get a clean, speaker-labeled, timestamped transcript in minutes. Then repurpose it: pull show notes and chapters from the timestamps, lift quotes for social, and publish the transcript on the episode page so search engines and listeners can find it. One episode becomes a transcript, notes, clips, and a searchable page from a single source file.

Use the cleanest master you have

Transcribe from the highest-quality file you can reach – ideally the edited master (WAV or a high-bitrate MP3), not the compressed version your host re-encodes for streaming. The clearer the audio, the fewer corrections you'll make, especially on guest names and technical terms. If your episode lives only as a published link, you can paste that instead, but a clean export will always read better.

If you record multitrack (each speaker on their own track), keep those tracks – they make speaker labeling far more reliable than a single mixed-down file, because the tool isn't untangling two voices that overlap. For a solo show none of this matters; for a two-host plus guest episode it's the difference between clean labels and a manual cleanup pass.

Transcribe after your edit, not before. If you cut tangents, fix audio, or drop a segment in post, transcribing the final cut means your transcript, chapters, and show notes all match what listeners actually hear – no phantom timestamps pointing at content you removed.

Turn the transcript into show notes and chapters

A timestamped transcript is the raw material for everything you publish around the episode. Read it once and mark the natural beats – the intro, each major topic, the guest's best story, the takeaway. Those marks become your chapter markers (with the timestamps already attached) and the skeleton of your show notes. You're editing a structure that exists, not inventing one from a blank page.

Built-in AI summaries shortcut the first draft: get a tight episode summary and a chapter list straight from the transcript, then tighten the wording in your own voice. The goal isn't to publish the AI's words verbatim – it's to skip the slowest part (figuring out what the episode was even about) and spend your time on polish.

Pull three or four quotable lines while you're in there. Punchy, self-contained sentences make the best episode descriptions, newsletter teasers, and quote cards – and you already have them timestamped, so you can grab the exact audio for an audiogram.

Publish the transcript for SEO and accessibility

Audio is invisible to search engines; a transcript is not. Posting the full text on the episode's page turns a 45-minute conversation into thousands of indexable words, so people searching for the topics, names, and phrases you discussed can actually land on your show. For many independent podcasts, transcript pages are the single biggest source of organic discovery.

Transcripts are also an accessibility baseline. Listeners who are deaf or hard of hearing, people in sound-off environments, and anyone who'd rather skim than listen all get access when the text is on the page. It's the same artifact doing double duty – reach and inclusion from one transcript.

Format it for humans, not just crawlers: keep speaker labels, break it into the same sections as your chapters, and link timestamps back to the player if your site supports it. A wall of unbroken text technically ranks but nobody reads it; a structured transcript gets used.

Make captions and audiograms from the same file

If you cut video clips or audiograms for social, you need captions – and your transcript already contains them. Export the transcript as SRT or VTT and you've got subtitle files with correct timestamps, ready to drop onto a clip. Captioned clips dramatically outperform silent, uncaptioned ones in feeds where most people watch with sound off.

For an audiogram, find the standout 30-to-60-second moment using your quote index, grab that timestamp range, and pair the waveform with the matching caption lines. Because everything traces back to one transcript, your clip captions and your published transcript say exactly the same thing – no retyping, no drift.

Translate the transcript to reach non-English listeners: a translated transcript plus translated captions opens the episode to audiences who'd otherwise bounce. It's a low-effort way to extend a single episode's reach across languages.

The steps, in order

  1. 01

    Start from the final, clean master

    Use the edited episode export (WAV or high-bitrate MP3), or paste the published link. Transcribe the final cut so notes and chapters match what airs.

  2. 02

    Get a speaker-labeled transcript

    Upload it to Pepys for a clean, timestamped, speaker-labeled transcript in minutes – with AI summary and chapters built in.

  3. 03

    Build show notes and chapters

    Use the summary and timestamps to draft notes and chapter markers, then tighten them in your own voice. Pull three or four quotable lines.

  4. 04

    Publish the transcript on the episode page

    Post the full, structured transcript for SEO and accessibility – keep speaker labels and section it like your chapters so people actually read it.

  5. 05

    Export captions and clips

    Export SRT/VTT for captioned audiograms and video clips, and translate the transcript to reach listeners in other languages.

Tips from people who do this a lot

  • Transcribe the final edit, never a rough cut – it keeps your transcript, chapters, and timestamps pointing at content that's actually in the episode.

  • Multitrack recordings make speaker labels far cleaner; keep the per-speaker tracks instead of only the mixdown.

  • Treat the transcript page as a real landing page: structure it with headings and speaker labels so it ranks and reads, not just ranks.

  • Captioned clips beat silent ones in sound-off feeds – your SRT export gives you captions for free, so always add them.

  • Keep a running quote index from the timestamps; it's the fastest way to source audiograms, newsletter teasers, and pull quotes later.

Try it now

Drop in your recording or paste a link and get a clean, speaker-labeled transcript in minutes. Your first 60 minutes are free.

or paste a link
InstagramTikTokYouTubeFacebookSpotifyApple Podcasts

60 min free · no card required · we never train on your audio

PodcasterJournalistContent creatorResearcherStudent
Trusted by 100,000+ creators, podcasters, journalists & researchers

How to transcribe a podcast – questions, answered

How do I transcribe a podcast episode?

Upload the episode audio – or paste its link – to a transcription tool and get a clean, speaker-labeled, timestamped transcript in minutes. From that one file you can pull show notes, chapters, quotes, captions, and a publishable transcript page, without rewatching or retyping anything.

Can I publish the transcript on my website for SEO?

Yes – and you should. Audio isn't indexable, but a transcript turns a long episode into thousands of searchable words, so people searching your topics and guests can find the show. Keep speaker labels and structure it into sections so it reads well, not just ranks.

How do I get show notes and chapters from a transcript?

Use the built-in AI summary and the timestamps as a first draft: the summary gives you the episode gist and the timestamped beats become chapter markers. Then tighten the wording in your own voice and pull a few quotable lines for descriptions and social.

Can I make captions for clips from the transcript?

Yes. Export the transcript as SRT or VTT and you have subtitle files with correct timestamps, ready to drop onto audiograms or video clips. Captioned clips perform far better in sound-off social feeds than uncaptioned ones.

What audio quality do I need for a good transcript?

Transcribe from the cleanest master you have – ideally the edited WAV or a high-bitrate MP3 rather than a re-compressed streaming version. Multitrack recordings also improve speaker labeling, since the tool isn't untangling overlapping voices from a single mixed file.

Keep reading

Get your transcript – free to start

Pay as you go – credits never expire, nothing to cancel. Or start free with 60 minutes, no card.