Apple taught an LLM to predict tokens up to 5x faster in math and coding tasks<div class="feat-image">

</div><p>A new research paper from Apple details a technique that speeds up large language model responses, while preserving output quality. Here are the details.</p>
<a data-layer-pagetype="post" data-layer-postcategory="apple-research" data-layer-viewtype="unknown" data-post-id="1013494" href="
https://9to5mac.com/2025/08/08/apple-research-teaches-llms-to-think-faster/#more-1013494" class="more-link">moreā
Apple taught an LLM to predict tokens up to 5x faster in math and coding tasks