Habr AI

Surf explains why teams sabotage AI implementation in product development
Surf executives described five stages of AI adoption in development—from denial and anger to equilibrium—and explained why forced rollout al

Google NotebookLM, Canva and Gamma: 13 Conditionally Free AI Services for Presentations in 2026
A new roundup compares 13 AI services for presentations — from Google NotebookLM and Gamma to Canva and Decktopus; free access almost everyw

NGT Memory Module Shows How to Give LLM Persistent Memory Without Vector Database
Open-source NGT Memory module offers persistent memory for LLM without external vector database: stores user profile, links facts, and retri

Claude Code: Which keyboard shortcuts accelerate work and where they can interfere
Claude Code users discovered useful key combinations for terminal work: quick change rollback, background tasks, mode switching, and tips fo

agent-pool for Google AI and Gemini CLI released with pipelines, cron, and bounce-back
The agent-pool MCP-server for Google AI and Gemini CLI automates multi-agent chains: launches steps based on triggers, returns tasks for rev

Agentic Legal RAG Challenge 2026: How Sparks of intelligence Tested the Limits of Agentic RAG
The Sparks of intelligence team analyzed their experience in the Agentic Legal RAG Challenge 2026 and demonstrated why chunking, search, gro

RooCode for VS Code: How to Set Up Multi-Agent Development Without Chaos and Unnecessary Model Requests
A detailed RooCode configuration guide shows how to split roles across modes, connect MCP servers, and gradually improve AI agent performanc

Yandex 360 Explained How to Teach AI Assistants to Work with Internal UI Kits
The Yandex 360 team shared how to make a frontend repository understandable to AI — and why without this, the assistant will hallucinate com

LinkedIn and SHRM: AI Made Resumes a Weak Signal and Turned Them Into a Hiring Filter
Candidates polish resumes with AI, companies filter them with algorithms, and ultimately the value of a 'perfect' CV drops: case studies, in

MCP LLM-Agent in Fusion360: First Vibe-Design Test on a Wooden Staircase
Fusion360 rolled out an MCP LLM-assistant — a Habr author immediately tested it on a wooden staircase he had previously created manually ove

Claude Code Raised Legal RAG to 0.791, but ARLC 2026 Final Hit Scaling Limits
The ARLC 2026 case demonstrates how Legal RAG can be improved across 17 iterations, then lose 42% of results as the corpus scales from 30 to

OpenAI and Grok Lose to Custom RAG in Legal Agentic RAG Challenge
A team on Mac Studio M3 Ultra compared RAG, CAG, BM25, and built-in databases from OpenAI and Grok in a legal challenge and found: grounding

Habr: How Outdated Knowledge Base Breaks LLM-Agents and How to Fix It
Habr explored why outdated documentation is more dangerous than its absence for LLM-agents, and proposed three practices: automated checks,

Vibe Coding Promises 10x Productivity Gains — PyPI Hasn't Confirmed It
Developers claim 10–100x acceleration thanks to AI tools, but if that's true — why doesn't PyPI show explosive growth in Python packages?

Anthropic Strengthens Claude: Why New Corporate Customers Choose It Over ChatGPT
Ramp data from March 2026 shows a shift in corporate demand: among companies purchasing AI tools for the first time, Anthropic's Claude nota

NotebookLM and Gemini Automate Test Creation via Google Apps Script and Forms
An author demonstrated how to integrate NotebookLM, Gemini, Google Apps Script, and Google Forms to quickly assemble educational tests from

Doka launches local AI-agent for Russia without VPN, subscription or cloud
A developer bundled a custom AI-agent into the Doka desktop app: the service runs locally, uses the Qwen3 model, and requires no cloud, VPN,

Why Language Models Make Mistakes Even When They Know the Right Answer: Breaking Down LLM Limitations
TechWill researcher Valery Shabashev analyzed why LLMs can make mistakes even when they have the correct answer: the problem isn't in knowle

LLM Ensemble Examined Theological Interpretations: 1 Tim. 2:15 as a Static Analysis Case
Five language models compared two interpretations of 1 Tim. 2:15 and demonstrated that an LLM ensemble can avoid seeking 'truth' while uncov

Bootik: a console AI agent for servers that runs locally and doesn't require much memory
Bootik is a lightweight console AI agent for SSH operations: it launches on demand, understands server context, and is designed for local mo

Anthropic: agentic AI and GitHub Copilot are changing development rules in 2026
In 2026, AI code tools are no longer just code suggesters: Claude Code, Copilot, and Cursor now read repositories themselves, plan changes,

SkillMarket built a marketplace of skills for AI agents on FastAPI and Claude Sonnet
The SkillMarket team showed an MVP of a platform where industry experts convert their experience into sellable skills for AI agents, with Fa

Sber Explained Why AI-Generated Code Looks Reliable But Breaks Under Real Load
Sber analyzed why AI-generated code often appears quality in code review but fails in production, and what engineering practices help reduce

"Advanced Payment Solutions" Launched Voice AI Assistant for Calls to Pilot Without ML Team
"Advanced Payment Solutions" brought a voice AI assistant for calls to pilot in six months, despite assembling the team from 12 backend deve

Habr Explained How to Force LLMs to Calculate Without Errors Through Python Code Generation
Habr analyzed why language models regularly make arithmetic mistakes and demonstrated a working approach: LLM writes Python code, while the

Paperclip Promises a Company of AI Agents but Drowns in Bureaucracy During Testing
A Habr author tested Paperclip — a system for managing teams of AI agents — and instead of accelerating development, got two days of coordin

Krok Shows How It Built an Internal RAG Assistant for Corporate Data
Krok explained how it assembled an internal RAG assistant for searching corporate data with on-prem deployment, ACL control, quality benchma

Sergey Smirnov explained how to prepare an AI agent for reliable operation in production
AI engineer Sergey Smirnov outlined a roadmap for deploying an AI agent to production: from concept and initial launch to reliable, predicta

Google NotebookLM Helps Build a Personal AI Prompt Engineer in 15 Minutes
Instead of endless prompting courses, the author proposes creating a personal assistant in Google NotebookLM that answers only based on sele

Microsoft GraphRAG and Ollama: How Graph-Based RAG Performed on Local Models
The author tested Microsoft GraphRAG with Ollama on the "Johnny Mnemonic" narrative and concluded: graph-based RAG delivers more relevant an