


Toronto vLLM Meetup
Join us for the Toronto vLLM Meetup!
We’re excited to invite you to vLLM’s first 🍁Canada meetup in Toronto, hosted by NVIDIA, Red Hat, and Vector Institute on September 25th at the beautiful Schwartz Reisman Innovation Campus, located next to University of Toronto’s downtown campus at University and College Street.
This meetup brings together vLLM users, developers, maintainers, and engineers to explore the latest in optimized inference. Expect deep technical talks, practical demos, and plenty of time to connect with the community.
Agenda (Subject to Change)
5:00-5:30: Doors Open & Meet the vLLM Team
5:30-5:55: Intro to vLLM and Project Update
5:55-6:30: Tackling Distributed Inference at Scale with vLLM
6:30-6:45: Break
6:45-7:00: Reducing Latency with EAGLE Speculative Decoding
7:00-7:15: Accelerating Inference Kernels with FlashInfer
7:15-7:30: Ways to Contribute and Closing Remarks
7:30-9:00: Networking with Light Refreshments 🥪 🤝 🧃
Important Information
Registration Deadline: Registration closes 24 hours prior to the event. We will be unable to admit any attendees who are not registered.
We look forward to seeing you there!