Improving agents from trajectories, in token space, with no weight updates · HackerLangs