Video to Text
Turn any video into accurate, speaker-labeled text – upload a video file or paste a link and get a clean transcript of every word spoken, in minutes.
Accepts MP4, MOV, MKV, AVI, WebM, M4V and other video files – or a link · returns a clean, timestamped transcript of everything spoken, ready to edit and export.
60 min free · no card required · we never train on your audio
How do I convert a video to text for free?
Pepys converts video to text by transcribing the spoken audio with AI. Upload a video file (MP4, MOV, MKV and more) or paste a link, and you get a clean, timestamped, speaker-labeled transcript in minutes, in 99+ languages, with AI summaries built in. This reads the speech, not on-screen text. First 60 minutes free, no card.
How video to text works
Upload a video or paste a link
Drop in any video file or paste a link – any format, any language. We read the audio track.
Get your transcript
AI transcribes the spoken words into clean, speaker-labeled text with timestamps, ready in minutes.
Edit and export
Fix anything inline, then export to TXT, Markdown, DOCX, PDF, SRT, or VTT.
Got a recorded webinar, a course lecture, a customer call, or a clip you want in writing? Typing it out by hand means scrubbing back and forth for hours. Pepys does it in minutes: upload the video or paste a link and get back a clean, accurate transcript of every word spoken that you can search, quote, and repurpose.
To be clear, this transcribes the speech in your video, not the text printed on screen – Pepys listens to the audio track, so slides, captions burned into the frame, and signs in the background aren't what you get. Every transcript comes speaker-labeled and timestamped, with AI summaries and chapters built in, and we never train on your video. Pay only for what you transcribe; credits never expire.
Clean paragraphs. No more um's and ah's.
The left is what Pepys hands back – logical paragraphs with the filler stripped out, punctuated and readable. The right is the raw, one-line-per-segment dump most transcribers leave you with.
um so yeah everyone keeps telling you to like lead with your best line right but uh honestly if you give away the whole answer in the first second you know there's basically no reason for anyone to keep watching so the hook isn't kind of the smartest thing you say it's like a loop you open that they need to close and um that's the part that actually keeps people around
RawTranscribes the spoken audio in your video – any format, any length
Speaker labels and timestamps on every transcript
AI summaries, chapters, and chat built in – not a separate ChatGPT trip
99+ languages, auto-detected · we never train on your video
Any language – 99+ detected automatically
- English
- 中文
- Español
- العربية
- हिन्दी
- Français
- 日本語
- Português
- Русский
- Deutsch
- 한국어
- Italiano
- বাংলা
- Türkçe
- فارسی
- Tiếng Việt
- தமிழ்
- Polski
- ไทย
- Українська
- Nederlands
- עברית
- Ελληνικά
- తెలుగు
- Bahasa Indonesia
- اردو
- Svenska
- मराठी
- Română
- Magyar
- Čeština
- ગુજરાતી
- Kiswahili
- ქართული
- Tagalog
- አማርኛ
Works with the platforms you live in.
Paste a link from YouTube, TikTok, Instagram, Facebook, Spotify, or Apple Podcasts – or drop in any audio or video file. We transcribe it once, then you export it however your workflow needs.
- YouTube
- TikTok
- Spotify
- Apple Podcasts
- or any file
Export to any format
- TXT
- Markdown
- DOCX
- SRT
- VTT
- JSON
Timestamps, speaker labels, and subtitle timing carry through to every export.
Video to text – questions, answered
How do I convert a video to text for free?
Upload your video file or paste a link on this page – your first 60 minutes are free, no card required. Pepys transcribes the spoken audio into a clean, timestamped transcript in minutes that you can edit and export.
What video formats can I transcribe?
MP4, MOV, MKV, AVI, WebM, M4V, and most other common video formats. You can also paste a link to a video instead of uploading a file.
Does this read the text shown on screen?
No – Pepys transcribes the speech in the video's audio, not on-screen text or graphics. It's not OCR. If no one is talking, there's nothing to transcribe.
Do I need to extract the audio first?
No. Upload the video as-is – Pepys reads the audio track for you, so there's no need to convert it to MP3 or pull the audio out beforehand.
Do you train on my video?
Never. We don't train AI on your video or transcripts, and you can auto-delete your files after processing.
More free tools
Keep reading
Video to text – free to start
Pay as you go – credits never expire, nothing to cancel. Or start free with 60 minutes, no card.