Accelerating LLM Inference on AMD GPUs with Low-Latency GEMMs · HackerLangs