Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure
Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure
arxiv.org /abs/2311.07590
0
comments