vllm.model_executor.warmup.kernel_warmup
Warmup kernels used during model execution. This is useful specifically for JIT'ed kernels as we don't want JIT'ing to happen during model execution.
Warmup kernels used during model execution. This is useful specifically for JIT'ed kernels as we don't want JIT'ing to happen during model execution.
kernel_warmup ¶vllm/model_executor/warmup/kernel_warmup.py