Speaker identification for uploaded vs virtual meeting recordings
How it works
Speak AI automatically detects different speakers in your recordings, but the level of detail depends on how the recording was captured.
Virtual meeting recordings (Meeting Assistant)
When the Speak AI Meeting Assistant joins your Zoom, Google Meet, Teams, or Webex call:
Speakers are automatically identified by their meeting participant names
If calendar sync is enabled, names from the calendar invite are used
Each speaker gets their own label from the start
Speaker analytics (word count, speaking time, pace) are calculated per person
Uploaded in-person recordings
When you upload a recording from a phone, handheld recorder, or other device:
Speakers are detected by voice patterns and labeled as Speaker 0, Speaker 1, Speaker 2, etc.
The system separates speakers based on voice differences, but cannot identify names automatically
You need to rename speakers manually after transcription
Renaming speakers
For uploaded recordings, rename speakers right after transcription:
Open the transcribed file
Click on any speaker label (e.g., "Speaker 0")
Type the person's name and press Enter
The name applies throughout the entire transcript. You can also use AI Chat: "Change Speaker 0 to Dr. Smith".
Tips for better speaker detection in uploaded recordings
Clear audio helps: Minimize background noise and crosstalk
Separate microphones: If possible, use individual microphones for each speaker
Central placement: Place the recording device in the center of the table
Speak one at a time: Overlapping speech is the hardest scenario for speaker detection
Once speakers are named, AI Chat becomes much more powerful. You can ask "What did Dr. Smith say about the treatment plan?" and get speaker-specific answers.
