🏷️Topic

Moe

11 articles

First tracked: Dec 9, 2025

Last updated: Jul 9, 2026

Overview

Moe is a topic tracked in our intelligence system with 6 linked articles.

Latest Coverage

The only AI glossary you’ll need this year

↗

TechCrunch publishes a living AI glossary with concise, practical definitions of key terms (e.g., AGI, LLM, RLHF) and notes its ongoing updates, plus a small event promo embedded in the page.

Jul 4, 20261%

A 10 year old Xeon is all you need (for 26B-A4B MTP Drafters without GPU)

↗

A 2016 Intel Xeon server with 128 GB DDR3 RAM and no GPU runs a 26B Mixture-of-Experts model using CPU-optimized inference and a long, flag-heavy tuning process, illustrating memory-bandwidth limits and the claimed viability of open-weight AI on commodity hardware.

Jun 1, 20261%

Rotary GPU: Exploring Local Execution for Large MoE Models Under Limited VRAM

↗

A research paper demonstrates Rotary GPU enabling local execution of large Mixture-of-Experts models on consumer hardware (8 GB VRAM), achieving 2048 tokens at ~6.3 GB VRAM and ~21 tokens/sec, signaling edge-deployment viability under VRAM constraints.

May 31, 20261%

Liquid AI reveals 8B-A1B MoE trained on 38T

↗

Liquid AI unveils LFM2.5-8B-A1B, an 8B parameter MoE edge model with 128K context, 38T pretraining, expanded tokenizer, and strong on-device benchmarking and tool-calling capabilities.

May 29, 20261%

Real-time LLM Inference on Standard GPUs: 3k tokens/s per request

↗

Kog claims real-time LLM inference on standard datacenter GPUs can reach about 3,000 tokens/s per request on a 2B model by co-designing a monokernel runtime, GPU code, and a Laneformer architecture, with scalability toward frontier MoEs as memory bandwidth grows.

May 29, 20261%

ZAYA1-8B: An 8B Moe Model with 760M Active Params Matching DeepSeek-R1 on Math

↗

ZAYA1-8B is a sub-1B active-parameter open-source MoE model (8.4B total) trained entirely on AMD hardware, achieving competitive math/coding benchmarks and highlighting an AMD-focused pathway with open weights and proprietary inference tech.

May 7, 20261%

Unlock 11+ topic insights

Subscribe for real-time topic updates and unlimited access to our intelligence platform.

Get Watch Sign in

Related Entities

🏷️TopicHugging Face

🏷️TopicOpen-source

235

🏷️TopicMemory Bandwidth

🏷️TopicAnthropic

369

🏷️TopicNvidia

162