Habr AI

Smart Service Group tests voice control for pallet transport robot
Smart Service Group's initial test showed that voice can trigger pallet robot scenarios in a warehouse, but only with strictly defined comma

Anthropic removes Claude Code from $20 plan, SpaceX prepares Cursor acquisition
Anthropic tests removal of Claude Code from $20 subscription, Duolingo removes AI metrics for employees, and closed Claude Mythos model foun

OpenAI released GPT-5.5: stronger in programming, agents, and computer work
OpenAI launched GPT-5.5 focused on code, agentic tasks, and computer work: the model is already available in ChatGPT and Codex, but the API

NextFilm describes movie recommendation model: cold start, taste vector and GPT layer
NextFilm's author showed how to recommend movies to new users: collect initial ratings, build a taste vector, compare it with MovieLens and

n0x Developer Taught His Browser Agent to Open Sites and Take Screenshots
The n0x project evolved from a regular language chatbot into a browser agent with MCP support: it now opens websites, takes screenshots, and

Anthropic Tests Claude Mythos: Leak Reveals Model with 10 Trillion Parameters
An internal Anthropic leak has revealed Claude Mythos — a model the company considers its most powerful AI and is not yet ready to release p

Anthropic and OpenClaude: why 'free' Claude Code in 2026 isn't really free
After Claude Code's source code leak, the community quickly assembled OpenClaude, but behind the promise of free AI coding lie compatibility

How a single system instruction turns an LLM into a reliable tool: tests on Qwen and DeepSeek
A single system prompt can eliminate LLM hallucinations: an experiment with Qwen and DeepSeek showed that an 'exoskeleton' of rules transfor

T-Technologies on Open Source in AI/ML: Inside the LLM Development Process
Interview with the AI/ML team at T-Technologies Group — about LLM development, participation in open source, and research directions.

NVIDIA at GTC 2026 Shifts Focus From Chips to Token Factories and Agent-as-a-Service
At GTC 2026, NVIDIA showcased a bet not on individual GPUs, but on token factories, the modular Vera Rubin architecture, and AI agents as a

PageIndex from VectifyAI offers embedding-free search for long documents
PageIndex builds a tree-structured document outline and searches for relevant sections through LLM reasoning, promising RAG without embeddin

GolangConf 2026 and Ontiko: Why Go Teams Need to Fix Architecture, Not Code Speed
Ontiko is restructuring GolangConf 2026 around the real pain points of Go teams: AI has accelerated code writing, but architectural decision

ruGPT3XL Gains 8k Context: Restored Model Transcends 2k Limit with Minimal Losses
The ruGPT3XL restoration author fixed sparse attention, increased model context from 2k to 8k, and preserved quality on short sequences with

OpenAI, MiniMax and Nvidia Set the Tone for March in AI: Sora, GPT-5.4 and the Bet on Mira Murati
March in AI was defined by major product shifts: OpenAI reconsiders Sora's future, Google and Anthropic accelerate their assistants, while M

AI-first startups: why growth marketing stalls and what breaks in the funnel
Strong top-of-funnel metrics for AI-first products often mask false demand: people come for novelty, not to solve a problem, so the conventi

Habr: AI agents change delivery, and teams must rebuild the entire development cycle
Habr explains why with the emergence of AI agents, teams need to restructure not only code writing but the entire delivery: context, checks,

M2 delegated 40% of marketing texts to AI and maintained content quality
The M2 team embedded an AI copywriter into its internal editorial department, assigned 40% of product and marketing texts to the model, and

Sova AI Released Android Assistant That Controls Phone Without PC and Root
Sova AI unveiled an Android application with an AI agent that opens apps, presses buttons, scrolls screens, and executes voice commands with

OpenAI API and GPT Fan-Out Queries: How SEO Specialists Analyze AI Search
The author demonstrates how to retrieve hidden GPT fan-out queries via the OpenAI API and use them to analyze how AI models gather sources a

Hack The Box: How MCP Inspector Turns AI Tools into a New Attack Vector
A breakdown of Kobold from HTB Season 10 shows how a single dev utility for AI servers can lead to RCE, LFI, credential reuse, and complete

LLM-based system reduced quality control map preparation at metallurgical plant from 2 hours to 5 minutes
At a metallurgical plant, an LLM-system began assembling quality control maps in 3–5 minutes instead of two hours: not a universal prompt, b

Habr AI Shows How to Add Memory and Context to an LLM Chat in Python with Ollama and LiteLLM
In a new part of the tutorial on Python chat with Ollama and LiteLLM, the author demonstrates how to store message history, pass context to

Claude Sonnet and Jarvis Pattern: why AI agents might not need more than an operating system
On Habr, developers proposed building personal AI agents not around complex frameworks, but on a combination of LLM, operating system, and f

Yandex showed how to reach Alice answers and measure search visibility
Yandex launched a 'Website Visibility in Alice' section in Webmaster. The message to businesses is clear: assistant answers come not from tr

Flant: How a Go Developer Turned Zed and Gemini into a Useful AI Agent
A Go developer from Flant described the path from slow IDE plugins to a combination of Zed, Gemini 3 Flash, and gopls-mcp, which provides an

Agentis Memory: Redis-Compatible Storage with Vector Search and Local Embeddings
Agentis Memory transforms a Redis-compatible store into shared memory for AI agents: with local embeddings, built-in vector search, and no e

Habr: How synthetic data helps train models and why self-training leads to collapse
Synthetic data helps AI compensate for a shortage of quality human-generated corpus, but with uncontrolled self-training, models begin to lo

Why ChatGPT and Gemini Won't Recommend Your B2B SaaS, Even if Your Website Is Well-Built
Even a well-designed B2B SaaS website may not appear in ChatGPT, Gemini, and Perplexity responses if your brand lacks a clear category, exte

Why AI in UI Design Matters Not for Production, but as a Source of Visual Mutations
AI-generated UI is valuable not only for rapid sketching: its power lies in rare visual combinations that help designers discover new approa

Qwen and llama.cpp: how to run a local neural network without the cloud on your computer or server
A practical guide explaining how to run the Qwen model through llama.cpp on your own computer or server to work with a local neural network