Tag: latency

  • Power-SMC: Low-Latency Sequence-Level Power Sampling for Training-Free LLM Reasoning

    Power-SMC: Low-Latency Sequence-Level Power Sampling for Training-Free LLM Reasoning arXiv:2602.10273v1 Announce Type: new Abstract: Many recent reasoning gains in large language models can be explained as distribution sharpening: biasing generation toward high-likelihood trajectories already supported by the pretrained model, rather than modifying its weights. A natural formalization is the sequence-level power distribution $pi_alpha(ymid x)propto p_theta(ymid…