[P] Fish Speech TTS: clone OpenAI TTS in 30 minutes | allainews.com

May 22, 2024, 10:11 a.m. | /u/lengyue233

Machine Learning www.reddit.com

While we are still figuring out ways to improve the agent's emotional response to OpenAI GPT-4o, we have already made significant progress in aligning OpenAI's TTS performance. To begin this experiment, we collected 10 hours of OpenAI TTS data to perform supervised fine-tuning (SFT) on both the LLM (medium) and VITS models, which took approximately 30 minutes. After that, we used 15 seconds of audio as a prompt during inference.

Demos Available: [here](https://firefly-ai.notion.site/OpenAI-Examples-34975ae263a9496c84e89fb7b1ea25a4?pvs=4).

As you can see, the model's emotion, …

agent clone data experiment fine-tuning fish gpt gpt-4o llm machinelearning medium openai openai gpt performance progress sft speech supervised fine-tuning tts while

More from www.reddit.com / Machine Learning

[R] M3-AUDIODEC: Multi-channel multi-speaker multi-spatial audio codec 8 hours ago | www.reddit.com

audio codec machinelearning spatial +1

[P] C-GAN based MNIST model evaluator/validator 11 hours ago | www.reddit.com

building gan gans generative +5

[R] [CVPR 2024] AV-RIR: Audio-Visual Room Impulse Response Estimation 13 hours ago | www.reddit.com

audio cvpr machinelearning room +1

[Research] Exploiting the Layered Intrinsic Dimensionality for Practical Adversarial Training 14 hours ago | www.reddit.com

adversarial adversarial training aes algorithm +16

[D] Patenting in ML 15 hours ago | www.reddit.com

academia algorithms application applications +10

[R] Weight Rescaling: Applying Initialization Strategies During Training 20 hours ago | www.reddit.com

machinelearning strategies training

[P] llama.ttf: A font which is also an LLM 1 day ago | www.reddit.com

llama llm machinelearning

[D] Thought Space in LLMs? 1 day, 3 hours ago | www.reddit.com

concepts create generate image +12

Cuda advanced learning materials, [D] 1 day, 7 hours ago | www.reddit.com

advanced books course cuda +9

Senior Data Engineer

@ Displate | Warsaw

View on ai-jobs.net

Content Designer

@ Glean | Palo Alto, CA

View on ai-jobs.net

IT&D Data Solution Architect

@ Reckitt | Hyderabad, Telangana, IN, N/A

View on ai-jobs.net

Python Developer

@ Riskinsight Consulting | Hyderabad, Telangana, India

View on ai-jobs.net

Technical Lead (Java/Node.js)

@ LivePerson | Hyderabad, Telangana, India (Remote)

View on ai-jobs.net

Backend Engineer - Senior and Mid-Level - Sydney Hybrid or AU remote

@ Displayr | Sydney, New South Wales, Australia

View on ai-jobs.net