Video to Word (DOCX) Transcript
Upload a video and download its transcript as a clean, formatted Word document – speakers and timestamps included.
Accepts a video file – MP4, MOV, MKV, AVI, WEBM and more · returns a formatted DOCX transcript with speakers and timestamps.
60 min free · no card required · we never train on your audio
How do I turn a video into a Word document?
To turn a video into a Word document, upload it to Pepys and it transcribes the speech, then exports a clean, formatted .docx with speaker labels and timestamps in minutes, in 99+ languages. Open it in Word or Google Docs and edit like any document. Your first 60 minutes are free, no card required.
How video to word works
Upload your video
Drop in the file – Pepys extracts the audio track for you, so there's nothing to convert first.
Pepys writes the transcript
AI transcribes the speech into clean, speaker-labeled paragraphs with timestamps, ready in minutes.
Download the DOCX
Tidy anything inline, then export a formatted .docx – or grab TXT, SRT, VTT, or JSON instead.
When the transcript has to land in a report, a brief, or a shared draft, you want a Word document, not a wall of plain text. Pepys delivers exactly that: upload your video and it returns a clean, properly formatted .docx with speaker labels and timestamps, ready to open in Word or Google Docs and edit like anything else.
The DOCX is built from an accurate, timestamped transcript, so paragraphs break sensibly and every line is attributed and time-stamped. It works in 99+ languages with auto-detect, there's nothing to install, and you pay only for the minutes you transcribe – credits never expire and we never train on your video.
Clean paragraphs. No more um's and ah's.
The left is what Pepys hands back – logical paragraphs with the filler stripped out, punctuated and readable. The right is the raw, one-line-per-segment dump most transcribers leave you with.
um so yeah everyone keeps telling you to like lead with your best line right but uh honestly if you give away the whole answer in the first second you know there's basically no reason for anyone to keep watching so the hook isn't kind of the smartest thing you say it's like a loop you open that they need to close and um that's the part that actually keeps people around
RawClean, formatted .docx – not raw text – ready for Word or Google Docs
Speaker labels and timestamps carried straight into the document
Also export TXT, SRT, VTT, or JSON from the same transcript
99+ languages, auto-detected · we never train on your video · credits never expire
Any language – 99+ detected automatically
- English
- 中文
- Español
- العربية
- हिन्दी
- Français
- 日本語
- Português
- Русский
- Deutsch
- 한국어
- Italiano
- বাংলা
- Türkçe
- فارسی
- Tiếng Việt
- தமிழ்
- Polski
- ไทย
- Українська
- Nederlands
- עברית
- Ελληνικά
- తెలుగు
- Bahasa Indonesia
- اردو
- Svenska
- मराठी
- Română
- Magyar
- Čeština
- ગુજરાતી
- Kiswahili
- ქართული
- Tagalog
- አማርኛ
Works with the platforms you live in.
Paste a link from YouTube, TikTok, Instagram, Facebook, Spotify, or Apple Podcasts – or drop in any audio or video file. We transcribe it once, then you export it however your workflow needs.
- YouTube
- TikTok
- Spotify
- Apple Podcasts
- or any file
Export to any format
- TXT
- Markdown
- DOCX
- SRT
- VTT
- JSON
Timestamps, speaker labels, and subtitle timing carry through to every export.
Video to word – questions, answered
How do I turn a video into a Word document?
Upload your video on this page – the first 60 minutes are free, no card. Pepys transcribes the speech and exports a clean, formatted .docx transcript you can download in minutes.
Will the DOCX include speakers and timestamps?
Yes. The Word document keeps speaker labels and timestamps, formatted into readable paragraphs, so it's ready to share or edit without cleanup.
Can I open and edit it in Google Docs?
Yes – it's a standard .docx, so it opens cleanly in Microsoft Word, Google Docs, Pages, or LibreOffice, and you can edit it like any document.
Can I get the transcript in another language?
Yes – language is auto-detected across 99+ languages, and you can translate the finished transcript before exporting it to DOCX.
Do you keep my video?
Only as long as needed to transcribe it, and you can auto-delete it after. We never train AI on your video or transcripts.
More free tools
Keep reading
Video to word – free to start
Pay as you go – credits never expire, nothing to cancel. Or start free with 60 minutes, no card.