Discretizing Reward Models · HackerLangs