Large Language Models can Strategically Deceive their Users when Put Under Pressure
Large Language Models can Strategically Deceive their Users when Put Under Pressure
arxiv.org /abs/2311.07590
Large Language Models can Strategically Deceive their Users when Put Under Pressure