Teaching AI to Copy Us Is Wrong. Here Is How to Make It Sound Truly Human
Large language models (LLMs) are highly skilled at playing helpful assistants. But researchers are increasingly trying to teach them to play the opposite role: the everyday human user. These “user simulators” are vital tools for stress-testing customer service bots, training AI assistants, and modeling social behavior. However, capturing the quirky, unpredictable, and often informal nature of human speech has proved incredibly difficult.
Now, a team of researchers from MIT, Stanford, and the MIT-IBM Watson AI Lab has introduced Turing-RL, a training method that abandons the traditional goal of making AI copy human text word-for-word. Instead, it teaches AI to simply be indistinguishable from humans—effectively gamifying the famous Turing Test.
Why Copying Fails
To build an intuition for why traditional training fails, imagine asking a friend what they want for dinner. They might say, “I’m craving pizza,” “Surprise me,” or just grunt, “Dunno.”
If we train an AI strictly to match a “ground truth” response (like “I’m craving pizza”), the algorithm penalizes the model for saying “Surprise me,” even though both are perfectly natural human reactions. This strict matching approach forces AI simulators to produce dry, safe, and generic responses, or conversely, overly verbose, “assistant-like” prose.
Gamifying the Turing Test
Turing-RL solves this by utilizing reinforcement learning. Instead of grading the AI on how closely its vocabulary matches a specific target, Turing-RL sets up a game.
The simulator drafts a response to a conversation. Then, a highly capable “LLM Judge” is shown the conversation history alongside two options: the simulator’s draft and the actual human’s response. The judge rates which response feels more human on a 1-to-7 scale. The simulator then receives a “Turing Reward” based on how successfully it fooled the judge. Over time, the simulator learns to mimic the stylistic choices, brevity, and casual tone of real people.
Chatting Like a Human
The results are striking when compared to state-of-the-art models, which often fail to shake off their polite, helpful training.
Consider a Reddit forum thread discussing the news headline: “Kushner loses access to top-secret intelligence.”
- The Real Human replied with a punchy, sarcastic swipe: “Because he will ask Ivanka to ask for him.”
- GPT-5 (untrained as a simulator) generated a sterile, overly analytical paragraph: “This wasn’t a noble choice, it was Kelly’s memo forcing the issue… Consequences, not courage.” It reads like an op-ed, not a casual forum post.
- Turing-RL successfully blended in, asking a natural follow-up: “Why did he need top secret intel in the first place?”
In another chat about whether technology is making society lonelier, the human replied in lowercase with casual grammar: “i think the tech company’s can be to blame.” While other models tried to force a structured debate, the Turing-RL model stayed in character, replying: “i think we have the technology to connect more than ever why dont we try harder.”
A Powerful, Dual-Use Technology
Tested across Reddit forums and multi-turn dialogues, Turing-RL consistently fooled both advanced AI judges and human evaluators.
While highly realistic human simulators could revolutionize computational social science and personalize digital assistants, the authors warn of the technology’s dual-use nature. An AI that is indistinguishable from a specific person could easily be weaponized for impersonation or social engineering. The researchers emphasize that further development of user simulators must be paired with safety safeguards, such as digital watermarking and robust AI detection tools.
Chat about this paper
To chat about this paper, you'll need a free Gemini API key from Google AI Studio.
Your API key will be stored securely in your browser's local storage.