Show HN: A benchmark for the failure modes of agent memory · HackerLangs