Feb. 13, 2024, 5:44 a.m. | Nathan I. N. Henry Mangor Pedersen Matt Williams Jamin L. B. Martin Liesje Donkin

cs.LG updates on arXiv.org arxiv.org

The value-loading problem is a significant challenge for researchers aiming to create artificial intelligence (AI) systems that align with human values and preferences. This problem requires a method to define and regulate safe and optimal limits of AI behaviors. In this work, we propose HALO (Hormetic ALignment via Opponent processes), a regulatory paradigm that uses hormetic analysis to regulate the behavioral patterns of AI. Behavioral hormesis is a phenomenon where low frequencies of a behavior have beneficial effects, while high …

alignment apocalypse artificial artificial intelligence challenge cs.ai cs.cy cs.lg cs.ma econ.th halo human intelligence loading researchers systems value values via work

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote