On-device Ai is a topic tracked in our intelligence system with 5 linked articles.
On-device AI should be the default to improve privacy, reliability, and developer control, demonstrated with Apple tooling and a Brutalist Report on-device summarization example.
Chrome may download a 4GB on-device AI model file (weights.bin) tied to Gemini Nano, causing storage usage when AI features are enabled.
Ghost Pepper is a 100% local on-device macOS speech-to-text app (v1.9.0) that runs WhisperKit transcription and a local Qwen cleanup model, emphasizes privacy with no data leaving the device, and includes enterprise-ready deployment details via MDM/PPPC and Accessibility permissions.
Parlor demonstrates real-time, on-device AI on Apple M3 Pro using Gemma 4 E2B and Kokoro TTS with end-to-end latency ~2.5–3.0s and ~2.6 GB model size, highlighting low server reliance and open-source licensing.
Gemma Gem embeds the Gemma 4 model in a browser extension with ~500MB local download and 128K context, running entirely on-device without API keys or cloud.
Google’s AI Edge Gallery adds Gemma 4 support for on-device LLMs on iPhone with offline privacy, while detailing data collection and EU compliance posture.
Subscribe for real-time topic updates and unlimited access to our intelligence platform.