AI News — Sunday, April 12, 2026

7
DMax: Aggressive Parallel Decoding for dLLMs

A new paper introduces DMax, an aggressive parallel decoding method aimed at significantly improving the inference speed and efficiency of distributed Large Language Models (dLLMs).

Hugging Faceresearch