CVE-2025-46570 – Apache vLLM PageAttention Chunk Prefill Timing Vulnerability

The following table lists the changes that have been made to the
CVE-2025-46570 vulnerability over time.

Vulnerability history details can be useful for understanding the evolution
of a vulnerability, and for identifying the most recent changes that may
impact the vulnerability’s severity, exploitability, or other characteristics.

  • New CVE Received
    by [email protected]

    May. 29, 2025

    Action Type Old Value New Value
    Added Description vLLM is an inference and serving engine for large language models (LLMs). Prior to version 0.9.0, when a new prompt is processed, if the PageAttention mechanism finds a matching prefix chunk, the prefill process speeds up, which is reflected in the TTFT (Time to First Token). These timing differences caused by matching chunks are significant enough to be recognized and exploited. This issue has been patched in version 0.9.0.
    Added CVSS V3.1 AV:N/AC:H/PR:L/UI:R/S:U/C:L/I:N/A:N
    Added CWE CWE-208
    Added Reference https://github.com/vllm-project/vllm/commit/77073c77bc2006eb80ea6d5128f076f5e6c6f54f
    Added Reference https://github.com/vllm-project/vllm/pull/17045
    Added Reference https://github.com/vllm-project/vllm/security/advisories/GHSA-4qjh-9fv9-r85r
Share the Post:

Related Posts