[R] Speaker diarization | allainews.com

April 24, 2024, 3:01 p.m. | /u/anuragrawall

Machine Learning www.reddit.com

Hi All,

I am working on a project where I want to create speaker-aware transcripts from audios/videos, preferably using open-source solutions. I have tried so many approaches but nothing seems to work good enough out of the box.

I have tried:

1. whisperX: [https://github.com/m-bain/whisperX](https://github.com/m-bain/whisperX) (uses pyannote)

2. whisper-diarization: [https://github.com/MahmoudAshraf97/whisper-diarization](https://github.com/MahmoudAshraf97/whisper-diarization) (uses Nemo)

3. AWS Transcribe

4. AssemblyAI API

5. Picovoice API

I'll need to dig deeper and understand what's causing the incorrect diarization but I am looking for suggestions to …

api assemblyai aws box create diarization good machinelearning nothing project solutions speaker transcribe transcripts videos work

More from www.reddit.com / Machine Learning

[D] Impact of solar storm on QLORA + RLHF of Llama3 8B? 3 hours ago | www.reddit.com

article control current experience +13

Feeling at a loss with all these transformer models from Hugging Face in NLP "[Discussion]" 7 hours ago | www.reddit.com

classification competition essay face +14

[P] Open source library to scrape PDFs, YouTube, URLs, Presentations, etc for API-hosted vision-language models 18 hours ago | www.reddit.com

fun machinelearning

[P] LoRA from scratch implementation for LLM classifier training 21 hours ago | www.reddit.com

classifier implementation llm lora +3

[D] Dealing with conflicting training configurations in reference works. 22 hours ago | www.reddit.com

active learning compute detection machinelearning +7

[R] Marcus Hutter's work on Universal Artificial Intelligence 1 day, 4 hours ago | www.reddit.com

artificial artificial intelligence bayesian biography +11

[P] LLMinator: A Llama.cpp + Gradio based opensource Chatbot to run llms locally(cpu/cuda) directly from … 1 day, 5 hours ago | www.reddit.com

chatbot community context cpp +13

[D] Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow 2nd Edition 1 day, 6 hours ago | www.reddit.com

book keras learn machine +7

[D] How to train very shallow (dot product) networks with huge embeddings on a GPU … 1 day, 7 hours ago | www.reddit.com

cluster compute cpu embedding +11

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net