Apple taught an LLM to predict tokens up to 5x faster in math and coding tasks

Apple taught an LLM to predict tokens up to 5x faster in math and coding tasks

<div class="feat-image">

</div><p>A new research paper from Apple details a technique that speeds up large language model responses, while preserving output quality. Here are the details.</p>

<a data-layer-pagetype="post" data-layer-postcategory="apple-research" data-layer-viewtype="unknown" data-post-id="1013494" href="https://9to5mac.com/2025/08/08/apple-research-teaches-llms-to-think-faster/#more-1013494" class="more-link">more�Apple taught an LLM to predict tokens up to 5x faster in math and coding tasks