Longitudinal changes following the introduction of socially assistive robots in nursing homes
Search
Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models
Longitudinal changes following the introduction of socially assistive robots in nursing homes
Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models