June 14, 2024, 3:26 p.m. | /u/Pritish-Mishra

Machine Learning www.reddit.com

I recently had an idea: what if I fine-tuned a large language model (LLaMA-3-8B) using years of my WhatsApp chat history to see if it could impersonate the way I write messages?

First, I downloaded my chat history. WhatsApp has a feature that lets you export chat history in plain text (.txt format). After downloading the data, I did some light pre-processing (like removing extremely long messages) but kept it mostly raw to retain authenticity. I formatted the messages using …

chat chat history export feature history language language model large language large language model llama llm machinelearning messages the way whatsapp you

Senior Data Engineer

@ Displate | Warsaw

Professor/Associate Professor of Health Informatics [LKCMedicine]

@ Nanyang Technological University | NTU Novena Campus, Singapore

Research Fellow (Computer Science (and Engineering)/Electronic Engineering/Applied Mathematics/Perception Sciences)

@ Nanyang Technological University | NTU Main Campus, Singapore

Java Developer - Assistant Manager

@ State Street | Bengaluru, India

Senior Java/Python Developer

@ General Motors | Austin IT Innovation Center North - Austin IT Innovation Center North

Research Associate (Computer Engineering/Computer Science/Electronics Engineering)

@ Nanyang Technological University | NTU Main Campus, Singapore