CMU's Sherry Tongshuang Wu releases DITTO, training an 8B LLM to match GPT-5.4 at simulating human behavior using verbal feedback
The code is open-sourced in the OdysSim GitHub repository.
——0——
The code is open-sourced in the OdysSim GitHub repository.
Many users are celebrating DITTO's verbal feedback method for RL-based human behavior simulation, calling the presentations excellent and the research a string of awesome advances.
3 comments with sentiment.