[D] Is there a way of “negative prompting” at fine-tuning time? | allainews.com

Feb. 14, 2024, 9:35 p.m. | /u/threevox

Machine Learning www.reddit.com

Let’s say I’m attempting to fine-tune a pretrained language model, and I’d like to alter its response format. Normally, I’d fine-tune on a bunch of examples of responses in the new format. But doing so would also change the model’s semantic behavior to more closely mimic the type of text present in the SFT examples. Is there a way to fine-tune on an example in the new format, then effectively *negatively* fine-tune on the same text in the finetuning example …

behavior change examples fine-tuning format language language model machinelearning negative normally pretrained language model prompting responses semantic type

More from www.reddit.com / Machine Learning

[R] Training-free Graph Neural Networks and the Power of Labels as Features 5 hours ago | www.reddit.com

features free graph graph neural networks +6

[D] Modern best coding practices for Pytorch (for research)? 7 hours ago | www.reddit.com

coding config example good +14

[P] I reproduced Anthropic's recent interpretability research 11 hours ago | www.reddit.com

anthropic attention basic capabilities +8

[R] KAN: Kolmogorov-Arnold Networks 12 hours ago | www.reddit.com

abstract every function functions +11

[D] Looking for a recent study/paper/article that showed that an alternate model with a similar … 12 hours ago | www.reddit.com

article conversation machinelearning nothing +4

[2404.10667] VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time 13 hours ago | www.reddit.com

audio generated machinelearning vasa +1

[D] Is RPE still a valid approach, or is RoPE entirely superior? 16 hours ago | www.reddit.com

attention datasets embed information +8

[D] TensorDock — GPU Cloud Marketplace, H100s from $2.49/hr 18 hours ago | www.reddit.com

building cloud cloud gpu gpu +17

How does freezing a model work? [D] 21 hours ago | www.reddit.com

clip encoder guides inputs +9

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Sr. VBI Developer II

@ Atos | Texas, US, 75093

View on ai-jobs.net

Wealth Management - Data Analytics Intern/Co-op Fall 2024

@ Scotiabank | Toronto, ON, CA

View on ai-jobs.net