Nov. 14, 2023, 1:11 p.m. | /u/jrstelle

Artificial Intelligence www.reddit.com

For instance, if I'm recording an interview between two people, and I have something like Whisper recording the discussion, can it break out the dialogue between the speakers? Seems like this would be a fairly simple feature, but I'm not sure if it exists.



Doesn't have to be Whisper per se, but is there a known S2T model or solution for this?

artificial dialogue feature instance interview people recording simple something speakers speech speech-to-text text whisper

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Business Data Analyst

@ Alstom | Johannesburg, GT, ZA