Post · Indieweb Studio

Post

Log in

Open Research DevRoom

@FosdemResearch@fosstodon.org · 3 days ago

Eldar Kurtić will now present:

*Accelerating vLLM Inference with Quantization and Speculative Decoding*

https://fosdem.org/2026/schedule/event/WJUJ3R-accelerating_vllm_inference_with_quantization_and_speculative_decoding/

#LLM #FOSDEM #OpenScience

FOSDEM 2026 - Accelerating vLLM Inference with Quantization and Speculative Decoding

Indieweb Studio

This is a relaxed, online social space for the indieweb community, brought to you by indieweb.social.

Please abide by our code of conduct and have a nice time!

Indieweb Studio: About · Code of conduct · Privacy · Users · Instances

Bonfire social · 1.0.2-alpha.7 no JS en

Automatic federation enabled

Log in Create account