LLM Inference Sampling: What Temperature, Top-p, and Top-k Actually Control
· â 7 min read · âī¸ k4i
A small 5-token example for understanding temperature, top-p, and top-k during LLM inference, with source-reading notes from the vLLM V1 sampler.


