turnkey self-hosted offline transcription and diarization service with llm summary
-
Updated
Jun 2, 2024 - Python
turnkey self-hosted offline transcription and diarization service with llm summary
Synchronized Translation for Videos. Video dubbing
UniSpeech - Large Scale Self-Supervised Learning for Speech
Gecko - A Tool for Effective Annotation of Human Conversations
Identify the emotion of multiple speakers in an Audio Segment
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
Python package for combining diarization system outputs.
A lightweight library to compute Diarization Error Rate (DER).
Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization")
Easy to use Multi-Provider ASR/Speech To Text and NLP engine
Rust bindings to https://github.com/k2-fsa/sherpa-onnx
On-device speaker diarization powered by deep learning
Tool for automatic transcription and speaker diarization based on whisper and pyannote.
Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and diarization pipelines to tensorflow models.
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
pyannote audio diarization in rust
This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
Automated Multi Speaker diarization API for meetings, calls, interviews, press-conference etc.
A Whisper to TextGrid script that I use to automatize Corpus Annotation on Praat, with speaker diarization.
Add a description, image, and links to the diarization topic page so that developers can more easily learn about it.
To associate your repository with the diarization topic, visit your repo's landing page and select "manage topics."