Habr AI

Claude Code without the magic: Habr breaks down the architecture, context noise, and engineering practices
Habr has published a detailed breakdown of Claude Code: why the agent loses efficiency in long sessions, how context gets consumed, and why

Ollama and LiteLLM: how to turn a Python script into a complete console-based LLM chat
In the second part of the practical guide, the author shows how to turn a basic Python script with Ollama and LiteLLM into a console chat wi

Sber showed how RAG and LLM in the IDE turn manual scenarios into automated tests
Sber described an IDE plugin that uses RAG and LLM to search for examples in the codebase and turn manual scenarios into automated tests, wh

VS Code 1.111 launched Autopilot: AI agent writes, tests, and deploys on its own
Microsoft released VS Code 1.111 with Autopilot mode — the AI agent now works autonomously and completes the entire task without the develop

Habr explained why language models and classic RAG lose their understanding of relationships
Habr examines why large language models with classic RAG are good at finding text fragments but struggle when they need to reconstruct relat

BotHub listed 10 free AI tools for text, code, video, 3D and documents
BotHub published a roundup of 10 free AI tools, including options for text, code, video, 3D models, document work and autonomous tasks.

LLMs in development: the 4 approaches teams use and how they differ
LLMs are no longer used only for autocomplete: this piece breaks down four modes of AI-assisted development along two axes — how much code a

Cursor, Copilot, and Claude Code are included in a roundup of 12 popular AI agents for developers
The roundup features 12 AI tools for coding — from Cursor and Copilot to GigaCode and Snyk: editors, cloud IDEs, and security services with

OpenAI, Google, and Tesla set the week's agenda: GPT-5.4 mini, AI Studio, and Terafab
OpenAI opened GPT-5.4 mini to broad access, Google turned AI Studio into an app-building environment, and Tesla promised its own $20–25 bill

AI Independence Bench compared 49 models and measured their resilience to user pressure
The new benchmark, AI Independence Bench, tests whether 49 language models can maintain their own stance, avoid apologizing without reason,

How the CEO of digital agency bijobs.ru built a brochure website over a weekend with Claude Code
Bogdan Nepryakhin of bijobs.ru explained how he replaced a month of work by a designer and a front-end developer with Claude Code over a wee

A way to sync sessions between a PC and laptop is now available for OpenAI Codex
OpenAI Codex has no built-in synchronization between devices, so a CLI tool called codexSync has appeared to transfer sessions, context, and

Python developer reined in Claude Code with a public anti-regression config
A solo developer at CREATMAN built a configuration for Claude Code that keeps the agent from straying beyond the task and breaking projects

Gemini: how Google turned Bard's failure into 750 million users and AI leadership
In 2023, an error in a Bard ad wiped $100 billion off Alphabet's market value — today Gemini outperforms GPT-5.1 in benchmarks and reaches 7

Perplexity turned AI search into an agent that builds Excel files and checks data on its own
In seven minutes, Perplexity Computer assembled a four-sheet Excel file with 33 sources, wrote its own Python script, and showed how AI is m

Developer built a statusline for Claude Code with VPS monitoring in a single session
A developer got tired of guessing the context state and server load during long Claude Code sessions — and built a custom statusline right i

DeepMind proposed ten cognitive scales for measuring progress toward AGI
Google DeepMind published the framework "Measuring Progress Toward AGI" — ten independent cognitive scales that for the first time make it p

AI agents vs RAG: how ReAct works and why multi-agent systems are needed
A single LLM response is no longer enough — a chain of actions is needed. We explain what AI agents are, how they differ from RAG, how ReAct

Lemana Tech showed how it combined LLM, RAG, and traditional ML in tech support
The company described a hybrid support setup: fast ML classifiers handle high-volume tickets, while LLMs with RAG are used where Wiki-based

Raiffeisenbank implemented a RAG assistant in Kotlin without Python or new expertise
Raiffeisenbank’s team built an internal RAG assistant on the JVM stack — Kotlin and Spring Boot — without Python and without bringing additi

Garage Eight: how AI is changing the work of analysts and why junior positions are disappearing
A Garage Eight analyst described six trends that are already reshaping the profession: AI is taking over routine work, raising the bar for j

OpenClaw broke multi-agent work down into three modes: standalone agents, subagents, and ACP
OpenClaw explained how it structures multi-agent work through Telegram: standalone agents get memory and a workspace, subagents handle one-o

Habr AI Demonstrates How Reflex Architecture for LLM-Agents Eliminates Lag Down to 60 FPS
Habr AI explored dual-process architecture, where a fast reflex layer handles instantaneous reactions, while the LLM handles semantics, plan

Apple bets on local AI in M-series chips, not giant models
While the market measures itself by data centers and GPU clusters, Apple is pushing a different scenario: running AI near the user, on devic

Developer revealed how he built Roomify — an AI interior visualizer on React and Puter
Roomify converts floor plans into photorealistic 3D renders in seconds, with the entire project logic running on React, Puter, and Claude/Ge

Entrepreneur built auto service CRM with ChatGPT and Cursor
An auto service owner without IT background described how since 2022 he built a CRM, Telegram bot and Android app for offline businesses usi

How Personal Data Anonymization Affects LLM Agents: Hivetrace Dataclean Experiment
The authors examined how much a banking LLM agent loses in quality after data anonymization by comparing clean requests, masks, and pseudony

Capsules for AI-Agents: How Packaged Developer Experience Becomes Machine Knowledge
The capsule framework's author demonstrates how a capsule's rigid structure — context, constraints, and history — becomes the ideal format f

How to Run DeepSeek on Your Server: Memory, Config, and Complete Privacy
A detailed guide to installing DeepSeek on a cloud server — how much memory you need for different model variants, which tools to use, and h

Stable Diffusion XL at Home: A Guide for Those Who Thought It Was Too Complicated
Publisher BHV has released a practical guide to Stable Diffusion XL — for those who want to generate images without subscriptions, locally a