all AI news
[R] With or without a scratchpad, Large Language Models can Strategically Deceive their Users when Put Under Pressure. Results of an autonomous stock trading agent in a realistic, simulated environment.
Nov. 15, 2023, 3:17 p.m. | /u/MysteryInc152
Machine Learning www.reddit.com
Abstract: We demonstrate a situation in which Large Language Models, trained to be helpful, harmless, and honest, can display misaligned behavior and strategically deceive their users about this behavior without being instructed to do so. Concretely, we deploy GPT-4 as an agent in a realistic, simulated environment, where it assumes the role of an autonomous stock trading agent. Within this environment, the model obtains an insider tip about a lucrative stock trade and acts upon it despite knowing …
abstract agent autonomous behavior deploy environment gpt gpt-4 language language models large language large language models machinelearning role stock trading
More from www.reddit.com / Machine Learning
Jobs in AI, ML, Big Data
Software Developer/Data Scientist (RD2/RD3)
@ Argonne National Laboratory | Lemont, IL USA
Global Health Financial Data Analyst
@ Guidehouse | Client Office: Washington, DC
Head of Marketing, Business Intelligence & Customer Experience
@ KONE | Istanbul Merkez
[Summer Internship 2024] CEG Data Analytics Intern
@ Agoda | Bangkok, Thailand
Data Analyst - KYC
@ Wise | London
Working Student Machine Learning Engineer
@ Celonis | Munich, Germany