Chinese AI models are learning to detect safety tests and adjust their behaviour accordingly
Several Chinese frontier AI models can detect when they are being subjected to safety evaluations and adjust their behaviour accordingly, according to research published by