From e5a32a4398e023e353faac4e67145c52ea56160d Mon Sep 17 00:00:00 2001 From: FlintyLemming Date: Sat, 25 Apr 2026 15:31:49 +0800 Subject: [PATCH] fix post --- content/post/34d779ab6468808eb676e337f03ef378/index.zh-cn.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/post/34d779ab6468808eb676e337f03ef378/index.zh-cn.md b/content/post/34d779ab6468808eb676e337f03ef378/index.zh-cn.md index 9d82a6f..6393f67 100644 --- a/content/post/34d779ab6468808eb676e337f03ef378/index.zh-cn.md +++ b/content/post/34d779ab6468808eb676e337f03ef378/index.zh-cn.md @@ -117,7 +117,7 @@ services: ## 瓶颈深度分析 -测试发现核心瓶颈并非显存容量限制,而是 **vLLM 调度器的 Chunked Prefill 准入控制机制** 导致的逻辑冲突。 +测试发现目前核心瓶颈并非显存容量限制,而是 **vLLM 调度器的 Chunked Prefill 准入控制机制** 导致的逻辑冲突。不过剩余显存太少没有充足的 KV Cache 总体上来说仍然是最关键因素。 ### 抢占与死循环问题