Tag: Language model truthfulness benchmarks

The undesirable phenomenon of flattery in large language models and what can be done about it