Guide

Meeting notes without a bot

For anyone pulling auto-joining notetakers out of their calls: how to get an accurate, speaker-labeled transcript by recording the meeting yourself and uploading it after.

The short answer

To get meeting notes without a bot, record the call with your platform's own recording, then upload that file to a transcription tool afterward for a speaker-labeled transcript and summary. Nothing joins the meeting live, so no third-party bot streams your audio mid-call. You control who gets captured, and you secure consent before the substance starts.

What does getting meeting notes without a bot mean?

Two models produce meeting notes, and they split at one moment: when the recording happens. An auto-joining bot dials into the call as a participant, streams the audio to a vendor in real time, and posts a summary when you hang up. The other model records nothing live. You capture the meeting with your own platform's recording, then upload that file afterward to transcribe a meeting without a bot ever in the room. The distinction sounds small. Legally and practically, it's the whole thing.

People pull the bot for concrete reasons. A named notetaker in the participant list changes how a room talks – people give more guarded answers once they know a recorder is on the roster. It also seats a third party inside every conversation it joins. Recording after the fact keeps the tool out of the meeting and leaves the choice of what gets captured in your hands.

Skipping the bot doesn't mean going back to typing. Manual transcription runs up to six hours per hour of audio (Haberl et al., 2023, citing Bell et al., 2018). An AI first pass on your uploaded file returns a draft with each speaker labeled in minutes. You give up the live in-call summary. You keep everything else.

Why is an auto-joining bot a consent liability?

Under GDPR, an announced bot or a 'by joining, you agree' banner is not the same as consent. Valid consent needs a clear affirmative act, and silence, pre-ticked boxes, or inactivity don't qualify (GDPR Recital 32). A participant who simply stays on the call hasn't actively agreed to being recorded.

Recording a meeting with identifiable EU participants processes personal data, so it needs a lawful basis under GDPR Article 6. Consent is one of six bases. For internal business calls, legitimate interests is often the better fit. Either way, a bot that quietly joins and streams to a vendor hasn't resolved that question for you.

US law works differently. Federal wiretap law sets a one-party-consent baseline (18 U.S.C. § 2511), and where someone is warned and keeps talking, consent is generally presumed. But about 11 states require every party to consent (RCFP), and on a call spanning several states, the cautious reading is that the strictest law applies. A bot joining a mixed-state call inherits that entire tangle.

Record the meeting yourself, then transcribe the file

Your meeting platform already records. Zoom, Teams, and Google Meet each save a local or cloud recording you control, with no extra participant in the list. Turn it on and tell people you're recording. That file is yours: stored where you put it, deleted when you decide.

The exact capture steps vary by platform, and they're worth getting right, because per-participant audio makes speaker labels much cleaner. The step-by-step for a Zoom call covers where the recording lands and how to get separated tracks. Once you have the file, upload it for a speaker-labeled transcript instead of retyping the meeting.

Recording yourself also closes the consent gap. You can ask for a clear yes before the substance starts and capture it in the audio – a stronger record than a banner nobody reads. If someone objects, you stop, and no vendor is already holding a copy.

Is an upload-after transcript as good as a live bot's notes?

For accuracy, the recording model barely matters. On spontaneous conversational speech, speech recognition and professional human transcribers both sit near 5-6% word error rate on the Switchboard benchmark (Microsoft Research). An uploaded file runs through the same class of model a bot uses. Transcript quality isn't where the two approaches diverge.

The real catch applies to both: speech recognition invents text. A peer-reviewed study found roughly 1% of Whisper transcriptions contained entire fabricated phrases, and 38% of those hallucinations carried explicit harms (Koenecke et al., FAccT 2024). A bot's summary you never check is riskier than a file you can read against your own recording. Review before you trust either one.

What a live bot gives you that an upload doesn't is the in-call summary, ready the moment the meeting ends. You can still get that artifact. Run the transcript through a meeting summary and action-items generator afterward. It lands minutes later instead of instantly, built from text you've had a chance to verify.

So which model should you use?

Pick the bot only when a hands-off summary during the call outweighs the rest. Everywhere else, recording and uploading wins on control: no third party in the room, and audio that stays where you put it. A bot is a live participant with legal and privacy strings; recording yourself drops them. The transcript is just as accurate. The summary just arrives minutes slower.

Cost tilts it further for irregular meetings. Notetaker bots bill per seat every month, running whether you meet or not. If your calls come in bursts, pay-once transcription with credits that never expire fits the pattern better than a subscription sitting idle between projects.

Tips from people who do this a lot

Turn on your platform's own recording and announce it out loud. A clear yes captured in the audio beats a bot's banner as a consent record.
Grab per-participant audio wherever your platform offers it. Separated tracks make speaker labels far cleaner than one mixed file.
Never ship a bot's auto-summary unread. Speech recognition can fabricate whole phrases, so check the text against your recording first.
On calls with people in different states or the EU, assume the strictest consent law applies before you hit record.
For a bursty meeting schedule, a pay-once transcript beats a monthly notetaker seat that keeps billing through the weeks you don't meet.

Try it now

Drop in your recording or paste a link and get a clean, speaker-labeled transcript in minutes. Your first 60 minutes are free.

or paste a link

60 min free · no card required · we never train on your audio

Trusted by 100,000+ creators, podcasters, journalists & researchers

Meeting notes without a bot – questions, answered

How do I get meeting notes without a bot joining the call?

Record the meeting with your platform's own recording (Zoom, Teams, and Google Meet all support it), then upload that file to a transcription tool afterward. You get a speaker-labeled transcript in minutes, and no third-party bot ever sits in the participant list or streams your audio during the call.

Are meeting notetaker bots a privacy or consent problem?

They can be. Under GDPR, an announced bot or a 'by joining you agree' banner isn't valid consent, which needs a clear affirmative act (Recital 32). In the US, about 11 states require every party's consent. Recording yourself lets you ask for a clear yes before anything sensitive gets said.

Is an uploaded transcript as accurate as a live bot's notes?

Yes. Both use the same class of speech recognition, which sits near 5-6% word error rate on conversational benchmarks, close to professional human transcribers. The one real difference is timing: an upload returns the transcript minutes after the call rather than during it. Accuracy doesn't depend on whether a bot was in the room.

Can I get a summary and action items without a bot?

Yes. Run your uploaded transcript through a meeting summary and action-items generator after the call. You get the same artifact a live bot would produce, just minutes later, from text you've had a chance to review. Speech recognition can invent phrases, so verifying before you trust it matters.

Do I have to type the notes myself if I skip the bot?

No. Manual transcription runs up to six hours per hour of audio, so nobody expects that. An AI first pass on your uploaded recording returns a speaker-labeled draft in minutes. You review the parts that matter – names, numbers, and decisions – instead of typing the whole thing from scratch.

References

1.Introduction to the Reporter's Recording Guide (state-by-state consent laws) – Reporters Committee for Freedom of the Press
2.18 U.S.C. § 2511 – federal wiretap one-party-consent baseline – Cornell Law School, Legal Information Institute
3.GDPR Recital 32 – consent requires a clear affirmative act – Regulation (EU) 2016/679 (GDPR)
4.GDPR Article 6 – lawfulness of processing (personal data needs a lawful basis) – Regulation (EU) 2016/679 (GDPR)
5.Haberl et al. (2023), Take the aTrain – manual transcription time, citing Bell et al. (2018) – arXiv / University of Graz
6.Xiong et al. (2016), Achieving Human Parity in Conversational Speech Recognition – Microsoft Research
7.Koenecke et al. (2024), Careless Whisper: Speech-to-Text Hallucination Harms – ACM FAccT 2024

Keep reading

Don't just take our word for it.

Ask ChatGPT, Claude, or Perplexity what Pepys is and who it's for. One click, and your favorite AI does the homework.

Ask ChatGPT Ask Claude Ask Perplexity

Get your transcript – free to start

Pay as you go – credits never expire, nothing to cancel. Or start free with 60 minutes, no card.

Start free – 60 minutes or see pricing