Publisher · verified by editors

MarkTechPost

AI news source. Articles are auto-selected and adapted by Hamidun News editors.

290 articles in Hamidun·Latest: July 17· Active·marktechpost.com ↗

Latest publications

Sakana AI Launches Sakana Translate Translator for Japanese, English, and Chinese

Sakana AI added Sakana Translate translator to Sakana Chat, built on Namazu models — with three modes: direct translation, proofreading, and document dialogue.

Jul 6, 2026·2 min

LLMMarkTechPost

Fine-tuning Gemma-3 on Mathematics Using GRPO and LoRA Adapters

A step-by-step GRPO workflow for fine-tuning Gemma-3 on the GSM8K dataset—from environment setup and LoRA adapters to reward functions and final model export.

Jul 6, 2026·3 min

LLMMarkTechPost

Page Agent from Alibaba Manages Web Interfaces via DOM Without Screenshots and Multimodal Models

Alibaba introduced Page Agent — a client-side JavaScript agent that understands text commands and manages web pages via DOM without screenshots and server-side code.

Jul 6, 2026·2 min

LLMMarkTechPost

Anthropic Launches Claude Science Beta: Multi-Agent Environment for Genomics and Proteomics

On June 30, 2026, Anthropic opened beta access to Claude Science — a multi-agent platform for reproducible pipelines in genomics, proteomics, and cheminformatics.

Jul 5, 2026·2 min

LLMMarkTechPost

NVIDIA Introduces ASPIRE — Self-Learning Robotics Framework Achieving 31% Zero-Shot Success on Complex Tasks

NVIDIA released ASPIRE — a robotics framework that writes and improves control programs, accumulates verified solutions in a skill library, and transfers them to new tasks without fine-tuning.

Jul 5, 2026·2 min

LLMMarkTechPost

NVIDIA HORIZON: Autonomous Agent Closes 100% of RTL Benchmarks in Chip Design

NVIDIA introduced HORIZON — an agent that independently writes and verifies RTL code for chip design, achieving 100% completion on standard benchmarks.

Jul 5, 2026·2 min

LLMMarkTechPost

Former Qwen Technical Lead: Why Hybrid Reasoning Failed and What's Next

Zunyuan Lin, former Qwen development lead at Alibaba, explained where Qwen3's dynamic reasoning failed—and why agentic AI requires a fundamentally different training approach.

Jul 5, 2026·2 min

LLMMarkTechPost

LlamaIndex released legal-kb: agentic document search through four tools on Index v2

LlamaIndex published the legal-kb reference application, giving agents file-like access to a knowledge base via retrieve, find, read, and grep on the Index v2 engine.

Jul 5, 2026·2 min

LLMMarkTechPost

NVIDIA Released Nemotron-Labs-TwoTower — Diffusion Language Model with Open Weights

NVIDIA has opened the weights of the diffusion language model Nemotron-Labs-TwoTower, which combines a frozen AR backbone with a diffusion head to accelerate text generation.

Jul 4, 2026·2 min

LLMMarkTechPost

Interfaze released an open-source diffusion ASR model for speech recognition in six languages

Interfaze published diffusion-gemma-asr-small — the first open-source ASR adapter built on Google's DiffusionGemma: 42 million parameters, six languages, and inference cost tied to the number of denoising steps.

Jul 3, 2026·2 min

LLMMarkTechPost

WebBrain: open-source browser AI agent runs without the cloud or a subscription

WebBrain is a free MIT-licensed agent for Chrome and Firefox that reads pages and automates tasks, running locally via llama.cpp or Ollama.

Jul 3, 2026·2 min

LLMMarkTechPost

Anthropic Releases Claude Fable 5 and Mythos 5: One Model, Two Access Tiers

Anthropic opened Fable 5 to the public with full safety classifiers, while Mythos 5—with cyber restrictions removed—rolls out only through the closed Project Glasswing program.

Jun 30, 2026·3 min

LLMMarkTechPost

How to speed up transformer training with NVIDIA Apex: FusedAdam, FusedLayerNorm, and torch.amp

A step-by-step guide to building NVIDIA Apex from source and using FusedAdam, FusedLayerNorm, and mixed precision — up to 2.5× faster transformer training.

Jun 30, 2026·2 min

LLMMarkTechPost

JetBrains released Mellum2 — an open 12B MoE model for fast tasks in AI pipelines

JetBrains open-sourced Mellum2 under Apache 2.0 — a compact MoE model with 12 billion parameters, trained on 10.6 trillion tokens for integration into multi-model AI pipelines.

Jun 30, 2026·2 min

LLMMarkTechPost

Alibaba launched Qwen3.7-Plus: multimodal agent with video analysis and autonomous coding

Alibaba released Qwen3.7-Plus, a multimodal agent, on the Bailian platform — the model can process images and video, invoke tools, and iterate on code on its own without operator involvement.

Jun 30, 2026·2 min

LLMMarkTechPost

PyGraphistry: Graph analytics for threat detection in enterprise data

Researchers published a PyGraphistry Colab workflow that turns enterprise access data into an interactive graph with ML-based anomalies and three visualization modes for incident investigation.

Jun 30, 2026·2 min

LLMMarkTechPost

Google DeepMind released DiffusionGemma — an open 26B MoE model with 4x faster generation

Google DeepMind released DiffusionGemma, an experimental 26B-parameter open model that uses text diffusion instead of autoregression — generation is up to 4x faster on GPU.

Jun 30, 2026·2 min

LLMMarkTechPost

Microsoft SkillOpt: automatic prompt optimization instead of manual trial and error

Microsoft SkillOpt optimizes AI prompts through an automatic six-step loop — reflection, selection, validation gating — and shows measurable accuracy gains as early as the first iterations.

Jun 30, 2026·2 min

LLMMarkTechPost

Cohere's North Mini Code: open-weight 30B MoE model for agentic coding

Cohere has released its first open-weight model for developers — North Mini Code, built on MoE architecture with 30B parameters, a 256K context window, and the ability to run on a single H100 GPU.

Jun 30, 2026·2 min

LLMMarkTechPost

MiniMax released MSA: sparse attention speeds up 1M-token context processing by 28x

MiniMax unveiled MSA, a two-branch sparse attention mechanism trained on a 109B-parameter MoE model that is 28x faster than standard GQA at a context length of 1M tokens.

Jun 30, 2026·2 min

LLMMarkTechPost

Docling Parse: PDF parsing pipeline with character- and coordinate-level precision

The tutorial shows how to use Docling Parse for structured PDF analysis: coordinates for every word, visual overlays, and export to JSON and CSV for RAG systems.

Jun 30, 2026·3 min

LLMMarkTechPost

TimeCopilot: How to Build a Forecasting Pipeline with Foundation Models and Anomaly Detection

TimeCopilot lets you build a complete forecasting pipeline — from statistical to foundation models — with automatic anomaly detection and an LLM agent that explains predictions.

Jun 29, 2026·2 min

LLMMarkTechPost

Cisco open-sources FAPO: automatic LLM pipeline optimization with step-by-step diagnostics

Cisco Foundation AI has open-sourced FAPO — a Claude Code-based tool that finds the weak link in a prompt chain on its own and optimizes the pipeline to a target accuracy.

Jun 29, 2026·3 min

LLMMarkTechPost

MoonMath AI releases open-source HIP kernel for AMD MI300X that outperforms official AITER v3

Startup MoonMath AI has open-sourced an attention kernel for AMD MI300X that beats AMD's official library across all tensor shapes and rounding modes.

Jun 29, 2026·2 min

LLMMarkTechPost

xAI launches /goal in Grok Build: an autonomous agent plans and verifies multi-step tasks

Grok Build now has a /goal mode: the agent receives a single goal, builds its own execution plan, works through a checklist, and verifies the result at every stage until full completion.

Jun 29, 2026·2 min

LLMMarkTechPost

How to build an AI agent in Google Colab without frameworks: a tutorial with memory and MCP

A step-by-step tutorial on building a lightweight AI agent in Google Colab — with tool registration, session memory, skills, and an MCP server, without external frameworks.

Jun 28, 2026·2 min

LLMMarkTechPost

OpenAI unveils GPT-5.6 with three tiers — Sol, Terra, and Luna — and reasoning modes

OpenAI has announced the GPT-5.6 family with three models — Sol, Terra, and Luna — and new reasoning modes, max and ultra. Access is currently limited to selected developers.

Jun 28, 2026·2 min

LLMMarkTechPost

Perplexity launches Computer for Counsel — an AI agent for legal teams

Perplexity has launched a specialized AI agent for lawyers: routing across 20+ models, integration with Microsoft 365 and Midpage, and all answers backed by verifiable source links.

Jun 28, 2026·2 min

LLMMarkTechPost

NVIDIA Open-SWE-Traces: data preparation for fine-tuning coding agents

The tutorial shows how to work with the NVIDIA Open-SWE-Traces dataset — parse agent trajectories, analyze code patches, and build an SFT dataset for training AI agents.

Jun 28, 2026·2 min

LLMMarkTechPost

DeepSeek released DSpark — a framework that speeds up generation on DeepSeek-V4 by 57–85%

DeepSeek open-sourced DSpark — a speculative decoding framework for DeepSeek-V4 that speeds up generation by 57–85% without loss of quality.

Jun 28, 2026·2 min