🏷️Topic

H100

6 articles

First tracked: Dec 9, 2025

Last updated: May 29, 2026

Overview

H100 is a topic tracked in our intelligence system with 5 linked articles.

Latest Coverage

Real-time LLM Inference on Standard GPUs: 3k tokens/s per request

↗

Kog claims real-time LLM inference on standard datacenter GPUs can reach about 3,000 tokens/s per request on a 2B model by co-designing a monokernel runtime, GPU code, and a Laneformer architecture, with scalability toward frontier MoEs as memory bandwidth grows.

May 29, 20261%

Bluesky embraces long-form content to counter X Articles

↗

A multi-article TechCrunch digest highlights mega AI funding, open-social/content interoperability, emerging AI-token futures markets, startup milestone metrics, and notable cybersecurity/regulatory risk signals.

May 28, 20261%

Matrix Multiplications on GPUs Run Faster When Given "Predictable" Data

↗

GPU matmuls are more driven by power constraints and input data patterns than theoretical compute; zeros can yield higher sustained FLOPS due to reduced transistor switching, with CUTLASS showing gains over CuBLAS in profiler benchmarks but real-world results depend on framework, leading to power-limited performance far below marketed peaks.

May 27, 20261%

Boosting multimodal inference performance by >10% with a single Python dict

↗

A technical blog post shows a 16% throughput and ~11% end-to-end latency improvement in multimodal inference by caching CUDA IPC pool handles in a Python dict, reducing host-side overhead in SGLang.

May 9, 20261%

Anthropic Gets in Bed With SpaceX as the AI Race Turns Weird

↗

Anthropic inks a compute deal with SpaceXAI to access Colossus 1’s ~220,000 Nvidia GPUs and ~300 MW capacity in Memphis, as SpaceXAI eyes an IPO and orbital compute ambitions, all amid regulatory and environmental scrutiny and large cloud-spend implications.

May 6, 20261%

Mistral Releases Devstral 2 (72.2% SWE-Bench Verified) and Vibe CLI

↗

Mistral launches Devstral 2 (123B) and Devstral Small 2 (24B) with open-source licenses, strong SWE-bench benchmarks, cost-efficiency, and a new Vibe CLI, plus detailed deployment and pricing information.

Dec 9, 20251%

Unlock 6+ topic insights

Subscribe for real-time topic updates and unlimited access to our intelligence platform.

Get Watch Sign in

Related Entities

376

164