Skip to content

Transcription and Diarization based on OpenAI's Whisper

Notifications You must be signed in to change notification settings

Majdoddin/lexicaps

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 

Repository files navigation

lexicaps

www.lexicaps.com
Transcription and Diarization based on OpenAI's Whisper It augments Whisper's transcription, by adding "speaker tags", so you know who says what. Currently it works for 2 speakers, and is tested for English.
I trained a classifier on top of Whisper model features (medium.en), that identifies any two speakers. No third-party package is used for Diarization.
Integrated with Whisper, it provides a full Transcription-Diarization service.
Give it a try or show a Sample.
Thanks @karpathy for the fun project
Thanks @sidhantls for inspirative repo

ToDo

-- Add large-v2 Whisper model

About

Transcription and Diarization based on OpenAI's Whisper

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published