Apple Silicon is a topic tracked in our intelligence system with 5 linked articles.
PrismML unveils Bonsai Image 4B, two ultra-compact on-device diffusion models (1-bit and ternary) with footprints under 2 GB, enabling local image generation on iPhone and other devices with open weights under Apache 2.0.
MacOS-only ARM64 assembly static web server (ymawky) with explicit safety controls, PUT uploads up to 1 GiB, and GPL-3.0 licensing; portability to Linux requires significant changes.
Ghost Pepper is a 100% local on-device macOS speech-to-text app (v1.9.0) that runs WhisperKit transcription and a local Qwen cleanup model, emphasizes privacy with no data leaving the device, and includes enterprise-ready deployment details via MDM/PPPC and Accessibility permissions.
Parlor demonstrates real-time, on-device AI on Apple M3 Pro using Gemma 4 E2B and Kokoro TTS with end-to-end latency ~2.5–3.0s and ~2.6 GB model size, highlighting low server reliance and open-source licensing.
A practical, data-heavy guide to running Google Gemma 4 26B-A4B locally on macOS via LM Studio 0.4.0’s headless CLI, detailing MoE efficiency, hardware/memory requirements, performance metrics, and integrating Claude Code for offline coding tasks.
TL;DR: A practical, data-heavy setup guide to run Gemma 4 26B on an Apple Silicon Mac mini with Ollama, including auto-start, model preload, and keep-alive.
Subscribe for real-time topic updates and unlimited access to our intelligence platform.