🏷️Topic

Sglang

4 articles
First tracked: Jan 22, 2026
Last updated: May 29, 2026

Latest Coverage

Liquid AI reveals 8B-A1B MoE trained on 38T

↗

Liquid AI unveils LFM2.5-8B-A1B, an 8B parameter MoE edge model with 128K context, 38T pretraining, expanded tokenizer, and strong on-device benchmarking and tool-calling capabilities.

May 29, 20261%

Interaction Models

↗

Thinking Machines unveils 'Interaction Models'—a real-time, multimodal, two-model architecture with 200ms micro-turns and encoder-free fusion, aiming to embed interactivity directly into the model and benchmarked against multiple rivals.

May 12, 20261%

Boosting multimodal inference performance by >10% with a single Python dict

↗

A technical blog post shows a 16% throughput and ~11% end-to-end latency improvement in multimodal inference by caching CUDA IPC pool handles in a Python dict, reducing host-side overhead in SGLang.

May 9, 20261%

Sources: Project SGLang spins out as RadixArk with $400M valuation as inference market explodes

↗

SGLang spun out as RadixArk with a $400M valuation, backed by Accel, built on Ion Stoica’s UC Berkeley open-source work.

Jan 22, 20261%

Unlock 4+ topic insights

Subscribe for real-time topic updates and unlimited access to our intelligence platform.

Get WatchSign in

Related Entities

🏷️TopicBenchmarking
7
📈StockMLX
4
🏷️TopicMultimodal
9
🏷️TopicGpu/cpu Throughput
1
🏷️TopicBenchmarks
15