Pepys
12,438,517minutes transcribed

Video to Text

Turn any video into accurate, speaker-labeled text – upload a video file or paste a link and get a clean transcript of every word spoken, in minutes.

or paste a link
InstagramTikTokYouTubeFacebookSpotifyApple Podcasts

Accepts MP4, MOV, MKV, AVI, WebM, M4V and other video files – or a link · returns a clean, timestamped transcript of everything spoken, ready to edit and export.

60 min free · no card required · we never train on your audio

PodcasterJournalistContent creatorResearcherStudent
Trusted by 100k+ usersRated 4.9 out of 5 by 100k+ users

How do I convert a video to text for free?

Pepys converts video to text by transcribing the spoken audio with AI. Upload a video file (MP4, MOV, MKV and more) or paste a link, and you get a clean, timestamped, speaker-labeled transcript in minutes, in 99+ languages, with AI summaries built in. This reads the speech, not on-screen text. First 60 minutes free, no card.

How video to text works

01

Upload a video or paste a link

Drop in any video file or paste a link – any format, any language. We read the audio track.

02

Get your transcript

AI transcribes the spoken words into clean, speaker-labeled text with timestamps, ready in minutes.

03

Edit and export

Fix anything inline, then export to TXT, Markdown, DOCX, PDF, SRT, or VTT.

Got a recorded webinar, a course lecture, a customer call, or a clip you want in writing? Typing it out by hand means scrubbing back and forth for hours. Pepys does it in minutes: upload the video or paste a link and get back a clean, accurate transcript of every word spoken that you can search, quote, and repurpose.

To be clear, this transcribes the speech in your video, not the text printed on screen – Pepys listens to the audio track, so slides, captions burned into the frame, and signs in the background aren't what you get. Every transcript comes speaker-labeled and timestamped, with AI summaries and chapters built in, and we never train on your video. Pay only for what you transcribe; credits never expire.

Clean paragraphs. No more um's and ah's.

The left is what Pepys hands back – logical paragraphs with the filler stripped out, punctuated and readable. The right is the raw, one-line-per-segment dump most transcribers leave you with.

reel-voiceover.mp4

um so yeah everyone keeps telling you to like lead with your best line right but uh honestly if you give away the whole answer in the first second you know there's basically no reason for anyone to keep watching so the hook isn't kind of the smartest thing you say it's like a loop you open that they need to close and um that's the part that actually keeps people around

Raw
BeforeAfter
  • Transcribes the spoken audio in your video – any format, any length

  • Speaker labels and timestamps on every transcript

  • AI summaries, chapters, and chat built in – not a separate ChatGPT trip

  • 99+ languages, auto-detected · we never train on your video

Any language – 99+ detected automatically

Works with the platforms you live in.

Paste a link from YouTube, TikTok, Instagram, Facebook, Spotify, or Apple Podcasts – or drop in any audio or video file. We transcribe it once, then you export it however your workflow needs.

  • YouTubeYouTube
  • TikTokTikTok
  • InstagramInstagram
  • FacebookFacebook
  • SpotifySpotify
  • Apple PodcastsApple Podcasts
  • or any file

Export to any format

  • TXT
  • Markdown
  • DOCX
  • PDF
  • SRT
  • VTT
  • JSON

Timestamps, speaker labels, and subtitle timing carry through to every export.

Video to text – questions, answered

How do I convert a video to text for free?

Upload your video file or paste a link on this page – your first 60 minutes are free, no card required. Pepys transcribes the spoken audio into a clean, timestamped transcript in minutes that you can edit and export.

What video formats can I transcribe?

MP4, MOV, MKV, AVI, WebM, M4V, and most other common video formats. You can also paste a link to a video instead of uploading a file.

Does this read the text shown on screen?

No – Pepys transcribes the speech in the video's audio, not on-screen text or graphics. It's not OCR. If no one is talking, there's nothing to transcribe.

Do I need to extract the audio first?

No. Upload the video as-is – Pepys reads the audio track for you, so there's no need to convert it to MP3 or pull the audio out beforehand.

Do you train on my video?

Never. We don't train AI on your video or transcripts, and you can auto-delete your files after processing.

More free tools

Keep reading

Video to text – free to start

Pay as you go – credits never expire, nothing to cancel. Or start free with 60 minutes, no card.