Commit 446ee64
committed
[bug] Fix "Current vLLM config is not set." warnings when FlashInfer attention is used
VLLM config is set only during initialization stage, not during runtime
stage. Therefore, we should not call get_current_vllm_config() during
dunrime stage. Instead, cache the config we want during initialization
stage and reuse it during runtime stage.
Signed-off-by: Po-Han Huang <[email protected]>1 parent c6df05e commit 446ee64
2 files changed
+10
-2
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
269 | 269 | | |
270 | 270 | | |
271 | 271 | | |
| 272 | + | |
| 273 | + | |
272 | 274 | | |
273 | 275 | | |
274 | 276 | | |
| |||
296 | 298 | | |
297 | 299 | | |
298 | 300 | | |
| 301 | + | |
| 302 | + | |
299 | 303 | | |
300 | 304 | | |
301 | 305 | | |
302 | 306 | | |
303 | | - | |
304 | 307 | | |
305 | 308 | | |
306 | 309 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
43 | 43 | | |
44 | 44 | | |
45 | 45 | | |
| 46 | + | |
46 | 47 | | |
47 | 48 | | |
48 | 49 | | |
| |||
357 | 358 | | |
358 | 359 | | |
359 | 360 | | |
360 | | - | |
361 | 361 | | |
362 | 362 | | |
363 | 363 | | |
| |||
499 | 499 | | |
500 | 500 | | |
501 | 501 | | |
| 502 | + | |
| 503 | + | |
| 504 | + | |
| 505 | + | |
502 | 506 | | |
503 | 507 | | |
504 | 508 | | |
| |||
779 | 783 | | |
780 | 784 | | |
781 | 785 | | |
| 786 | + | |
782 | 787 | | |
783 | 788 | | |
784 | 789 | | |
| |||
0 commit comments