Discussion
Loading...

Post

Log in
  • Sign up
  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
Open Research DevRoom
Open Research DevRoom
@FosdemResearch@fosstodon.org  ·  activity timestamp 3 days ago

Eldar Kurtić will now present:

*Accelerating vLLM Inference with Quantization and Speculative Decoding*

https://fosdem.org/2026/schedule/event/WJUJ3R-accelerating_vllm_inference_with_quantization_and_speculative_decoding/

#LLM #FOSDEM #OpenScience

FOSDEM 2026 - Accelerating vLLM Inference with Quantization and Speculative Decoding

  • Copy link
  • Flag this post
  • Block

Indieweb Studio

This is a relaxed, online social space for the indieweb community, brought to you by indieweb.social.

Please abide by our code of conduct and have a nice time!

Indieweb Studio: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.2-alpha.7 no JS en
Automatic federation enabled
Log in Create account
  • Explore
  • About
  • Members
  • Code of Conduct