JetSpec: Parallel Tree Drafting Overcomes Bottleneck in Speculative Decoding26. June 2026AI Models, Claude AIJetSpec overcomes scaling limits of speculative decoding through parallel tree drafting with causal conditioning, achieving up to 9.64x speedup in LLM inference. Share on: