architecture 3 vLLM's op IR, or: where the inference engine meets the compiler Jun 17, 2026 Loop Unrolling in the ML Era Jun 16, 2026 "Hello, World!" in a Heterogeneous System Jun 13, 2026