MiniMax M3 Decodes 1M Tokens 15x Faster — and It Shouldn’t Be This Cheap
Author(s): Chew Loong Nian – AI ENGINEER Originally published on Towards AI. MiniMax M3 Decodes 1M Tokens 15x Faster — and It Shouldn’t Be This Cheap On June 1, a Shanghai lab quietly shipped a model that decodes a 1-million-token context 15.6x …
I Ran a 1.5B-Active Model on My Laptop That Embarrassed a 26B by 46 Points
Author(s): Chew Loong Nian – AI ENGINEER Originally published on Towards AI. I Ran a 1.5B-Active Model on My Laptop That Embarrassed a 26B by 46 Points I did not expect a model that activates 1.5 billion parameters to walk all over …
NVIDIA’s 550B Nemotron Embarrassed Every US Open Model — and It Shouldn’t Run This Fast
Author(s): Chew Loong Nian – AI ENGINEER Originally published on Towards AI. NVIDIA's 550B Nemotron Embarrassed Every US Open Model — and It Shouldn't Run This Fast NVIDIA just shipped a 550B-parameter open model that scores 48 on the Artificial Analysis Intelligence …
I Ran Claude Code on My MacBook With vllm-mlx — It Embarrassed llama.cpp by 87%
Author(s): Chew Loong Nian – AI ENGINEER Originally published on Towards AI. I Ran Claude Code on My MacBook With vllm-mlx — It Embarrassed llama.cpp by 87% I did something this week that I assumed would be a slow, frustrating downgrade: I …
Microsoft Just Embarrassed Browser Web Agents — 1,000 Lines Made GPT-5.4 Beat Opus 4.6 on 200 Web Tasks
Author(s): Chew Loong Nian – AI ENGINEER Originally published on Towards AI. Microsoft Just Embarrassed Browser Web Agents — 1,000 Lines Made GPT-5.4 Beat Opus 4.6 on 200 Web Tasks A Microsoft Research lab spent the last few weeks watching every other …
Sebastian Raschka’s New Repo Builds a DeepSeek-R1 Clone in 8 Chapters — and It Shouldn’t Be This Simple
Author(s): Chew Loong Nian – AI ENGINEER Originally published on Towards AI. Sebastian Raschka's New Repo Builds a DeepSeek-R1 Clone in 8 Chapters — and It Shouldn't Be This Simple For the last year I have treated reasoning models the way most …
Two HTML Attributes Now Turn Your Website Into an AI Agent Tool — Inside Chrome’s WebMCP
Author(s): Chew Loong Nian – AI ENGINEER Originally published on Towards AI. Two HTML Attributes Now Turn Your Website Into an AI Agent Tool — Inside Chrome's WebMCP At Google I/O 2026, buried under Gemini 3.5 and a press release bragging about …
Merve Noyan Stopped Writing Training Scripts — Her Agent Just Fine-Tuned 18 Models Solo for $11.40
Author(s): Chew Loong Nian – AI ENGINEER Originally published on Towards AI. The 17,300-view AI Engineer Singapore talk that quietly killed half my MLOps job I watched Merve Noyan’s “Your Agent Can Now Train Models” talk three times this week. It went …