lexicaps

www.lexicaps.com
Transcription and Diarization based on OpenAI's Whisper It augments Whisper's transcription, by adding "speaker tags", so you know who says what. Currently it works for 2 speakers, and is tested for English.
I trained a classifier on top of Whisper model features (medium.en), that identifies any two speakers. No third-party package is used for Diarization.
Integrated with Whisper, it provides a full Transcription-Diarization service.
Give it a try or show a Sample.
Thanks @karpathy for the fun project
Thanks @sidhantls for inspirative repo

ToDo

-- Add large-v2 Whisper model

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

lexicaps

ToDo

About

Releases

Packages

Majdoddin/lexicaps

Folders and files

Latest commit

History

Repository files navigation

lexicaps

ToDo

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages