[D] Seeking advice on curating a DPO dataset for a 7B model | allainews.com

April 10, 2024, 3:29 a.m. | /u/aadityaura

Machine Learning www.reddit.com

Hi everyone,

I am curating a dataset for a specific domain (finance) and need to create a DPO dataset for a custom 7B model. I was wondering if anyone could share their experience or advice on the best practices for creating DPO datasets.

As a starting point, I was thinking of using GPT-4 answers as the "chosen" responses and the 7B model's answers as the "rejected" ones. However, I am concerned that if I choose accepted answers from a different …

advice best practices dataset datasets domain experience finance machinelearning practices

More from www.reddit.com / Machine Learning

[R] NExT: Teaching Large Language Models to Reason about Code Execution 4 hours ago | www.reddit.com

abstract code debug debugging +20

How much coursework is required to land an entry-level ML job? [D] 6 hours ago | www.reddit.com

berkeley building epidemiology job +4

[D] Foundational papers for Graph Adversarial Learning? 7 hours ago | www.reddit.com

machinelearning papers understanding

[D] Suggestions for NLP Papers Commonly Implemented in ML Interviews 18 hours ago | www.reddit.com

companies implementation interview interviews +10

[D] How can attention mechanisms retrieve meaningful information over long distances when using RoPE or … 21 hours ago | www.reddit.com

attention attention mechanisms information machinelearning +3

[D] Do Lead's in an AI/DS/ML team always have PhDs, is it a requirement? 22 hours ago | www.reddit.com

hello lecture machinelearning masters +3

[D] Correct me if I'm wrong, use KL divergence for NLP, and MMD for CV. … 1 day, 2 hours ago | www.reddit.com

distribution divergence fields found +5

[R] New Teleoperation Tool with VisionPro 1 day, 6 hours ago | www.reddit.com

machinelearning teleoperation tool

[R] Dynamic Gaussians Mesh 1 day, 6 hours ago | www.reddit.com

dynamic machinelearning mesh

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Research Scientist (Computer Science)

@ Nanyang Technological University | NTU Main Campus, Singapore

View on ai-jobs.net

Intern - Sales Data Management

@ Deliveroo | Dubai, UAE (Main Office)

View on ai-jobs.net