GLM-5.2 open weights land the day Fable 5 goes dark — a direct power transfer in the model stack.
Top Signal
GLM-5.2 open weights: first model >80% Terminal-Bench, #1 Design Arena
new tool
r/LocalLLaMA
Zhipu's GLM-5.2 released open weights on HuggingFace and hit Ollama within hours of launch. It's the first open-weights model to cross 80% on Terminal-Bench — a benchmark testing real shell/coding task execution — and claimed #1 on Design Arena (above the now-export-controlled Fable 5) and #2 on WebDew Arena. API is live, GGUF weights available, HuggingChat already serving it. Timing is not coincidental: Anthropic's Fable 5 and Mythos remain unavailable outside the US. For builders, this is a direct drop-in for agentic coding tasks: strong terminal-command execution, design/code generation, MIT-licensed weights. GLM-5.2's terminal-task strength maps well onto agent scaffolding needing reliable shell execution. Pull it via Ollama now or hit the API — this is the most credible open-weights Fable replacement to date.
Read more →
Fast Signals
SpaceX acquiring Cursor (Anysphere) for $60B
platform change
HN Front Page
Reuters reports SpaceX is buying Anysphere, maker of Cursor, for $60B — the largest AI coding-tool acquisition on record. No operational changes announced, but builders who've standardized on Cursor should watch: enterprise lock-in dynamics, pricing, and API terms can shift materially post-acquisition. Diversify your editor dependencies if you haven't.
Link →
Vicki Boykis + llama.cpp creator declare local models ready for daily coding
emerging signal
HN Front Page, Simon Willison
Vicki Boykis's 'Running local models is good now' hit HN front page today. Same day, Georgi Gerganov (llama.cpp creator) stated he's been running Qwen3.6-27B daily for coding on his M2 Ultra and RTX 5090, calling it sufficient for 'small to medium tasks.' Two credible, independent signals in one day: local coding has crossed from experimental to routine for senior engineers.
Link →
Qwen-Robot Suite: foundation models for physical-world intelligence
new tool
HN Front Page, r/LocalLLaMA
Alibaba's Qwen team released a suite of models targeting robotics and physical-world intelligence, cross-sourced on HN and r/LocalLLaMA. If you're building with embodied AI, vision-language-action pipelines, or robotic control loops, this is the first serious open-weight alternative to Google's RT-2 lineage worth evaluating.
Link →
"Stop using Ollama" gains traction — llama.cpp server cited as faster alternative
workflow
r/LocalLLaMA
A r/LocalLLaMA post arguing against Ollama as the default local serving layer is circulating widely. Core critiques: model format lock-in, opaque quantization decisions, and performance overhead vs. bare llama.cpp server or LM Studio. If you're routing any production or high-throughput workloads through Ollama, benchmark your setup against llama.cpp's native HTTP server.
Link →
SubQ 1.1 Small: new small model with public technical report
new tool
HN Front Page
SubQ released a technical report for their 1.1 Small model, hitting HN front page with modest traction. The company appears focused on structured/tabular reasoning tasks. Too early to benchmark against Qwen3.6 or Gemma 4, but worth reading if you need compact models with documented training methodology and reproducible claims.
Link →
Radar
Le Gros Chaton: mystery model running on edge hardware
Multiple r/LocalLLaMA posts about 'Le Gros Chaton' — including someone running it on a 1984 Corolla radio. Open-source status unclear. Could be a new ultra-efficient French model or a joke that went viral; watch for weights to surface.
Link →
llama.cpp: CUDA + Vulkan simultaneous compile confirmed
A r/LocalLLaMA post confirmed you can compile llama.cpp to run CUDA and Vulkan backends simultaneously, enabling GPU+iGPU offloading on mixed hardware. Underdocumented capability — useful if you're serving models on systems with both NVIDIA and AMD/Intel graphics.
Link →
VibeThinker-3B claims frontier math/coding at 3B scale
The VibeThinker model scaled from 1.5B to 3B now claims frontier-level math and coding benchmark performance. If methodology holds, this is directly relevant for constrained deployments where 7B+ is too large to fit.
Link →
Convergence Watch
glm 5.2
TRENDING
8 mentions across r/LocalLLaMA
GLM-5.2 generated 7+ independent r/LocalLLaMA posts in a single day — extraordinary single-source density. The Fable 5 export-control vacuum is directly accelerating this. Expect HN and Simon Willison coverage tomorrow. The Design Arena #1 claim is the headline stat driving attention.
local model coding adoption
TRENDING
9 mentions across HN Front Page, Simon Willison, r/LocalLLaMA
Third consecutive day of cross-source convergence on local model viability. Today adds Vicki Boykis (HN), Georgi Gerganov (Simon Willison), and multiple r/LocalLLaMA coding agent threads. The inflection appears tied to Qwen3.6-27B hitting a threshold that satisfies experienced engineers, not just hobbyists.
claude fable 5
3 mentions across Simon Willison, r/LocalLLaMA
Coverage is shifting from 'what happened' to 'what fills the gap.' Builder signal from Fable 5 itself is exhausted — track GLM-5.2 and local model adoption as the active story instead.
STALE: Latent Space newest item is >48h old