Models — Page 2 · NeuralCoreNews

Models

Weights, releases, and the race to scale

27 articles in this section.

Google Gemma 4 12B: The Ideal Balance for Local LLM Deployment

Google’s new 12B model targets the gap between 8B and 70B models, offering high reasoning capabilities for 16GB RAM devices.

Jun 3, 2026 · 3 min read

Models

Alibaba’s Qwen3.7-Plus: Analyzing Hardware Requirements and Reasoning Capabilities

An analysis of Qwen3.7-Plus’s multimodal capabilities, the VRAM demands of its reasoning engine, and the implications of its licensing for developers.

Jun 2, 2026 · 3 min read

Models

MiniMax M3: The Reality of Million-Token Context Windows in Open-Weight Models

An analysis of the hardware constraints and retrieval quality challenges facing the MiniMax M3’s million-token context window for local deployment.

Jun 1, 2026 · 3 min read

Models

Liquid AI LFM2.5-8B-A1B: Efficient On-Device MoE Model Analysis

Liquid AI’s new MoE model balances 8.3B total parameters with 1.5B active parameters to optimize local inference speed and reasoning.

May 29, 2026 · 3 min read

Models

Claude Opus 4.8: A Polished Refinement Rather Than a Cognitive Leap

An analysis of the Claude Opus 4.8 update, arguing that minor refinements in steerability and pricing are not substitutes for genuine intelligence gains.

May 28, 2026 · 3 min read

Models

Soro: A Specialized Gemma 3 Fine-Tune for the Tajik Language

Soro leverages Gemma 3 to provide a local, culturally nuanced LLM specialized for Tajik, prioritizing efficiency and local inference over generalist models.

May 28, 2026 · 3 min read

Models

Evaluating the Trade-offs of the 4B Parameter Zerank-2 Reranker

An analysis of the latency and VRAM costs of using the 4B parameter Zerank-2 reranker in production RAG pipelines.

May 27, 2026 · 3 min read

Models

Stability AI Releases Stable Audio 3 Open Weights for Local Inference

Stability AI releases open weights for Stable Audio 3 Small and Medium variants, enabling high-quality audio generation on consumer GPUs.

May 27, 2026 · 3 min read

Models

Alibaba’s Qwen3.7-Max: The Gap Between Proprietary Power and Open Weights

An analysis of Qwen3.7-Max’s autonomous coding capabilities and the growing divide between proprietary APIs and open-weight AI models.

May 24, 2026 · 3 min read

Models

Microsoft Releases Fara1.5: Specialized Browser Automation Agents

Microsoft’s new Fara1.5 family of browser agents outperforms competitors in computer-use tasks, offering a high-performance 27B model for local deployment.

May 22, 2026 · 3 min read