Source

Habr AI

708
total articles
403
this week
30 апреля
last update
RSSOriginal site →
LLM
LLM·Habr AI

Anthropic and ETH Zurich: a long CLAUDE.md worsens agent performance and raises costs

A study by ETH Zurich across 138 repositories found that long CLAUDE.md and AGENTS.md files often reduce coding-agent success rates while al

2026-04-30·3 мин
LLM
LLM·Habr AI

Gemini 3.1 Pro outperformed ChatGPT 5.4 and Claude Opus 4.6 in a text generation test

An author-led comparison of three top models across four literary tasks found that Gemini 3.1 Pro handled genre, emotion and brevity better,

2026-04-30·3 мин
LLM
LLM·Habr AI

Anthropic, OpenAI, and Cursor: Eight Levels of Agent Engineering Maturity

Habr AI broke down eight levels of agent engineering — from tab-complete and context engineering to background agents and autonomous teams t

2026-04-30·3 мин
LLM
LLM·Habr AI

AMD RX580 ran an LLM locally: how to tame ROCm, Ollama, and get GPU inference

An engineer showed how to make an old AMD RX580 run an LLM reliably with ROCm and Ollama, breaking down false signs of GPU activity, hipMemG

2026-04-30·3 мин
LLM
LLM·Habr AI

Why Claude 4.6 Isn't Enough Without Context: The Main Blind Spot in LLM Development

Even a strong model like Claude 4.6 loses effectiveness without systematically gathered context: a knowledge base, links between services, a

2026-04-30·3 мин
LLM
LLM·Habr AI

Developer builds AI-powered news aggregator with MCP, DeepSeek, and Telegram bot

After the drone attack over Dubai, a developer built a multilingual news aggregator: 80+ sources, an MCP server for AI agents, an AI chat, a

2026-04-30·3 мин
LLM
LLM·Habr AI

BorisovAI tested MoE on an RTX 4090 and showed why perplexity breaks LLM evaluation

BorisovAI tested MoE with plug-in experts on a single RTX 4090 and found that an impressive perplexity score does not guarantee real quality

2026-04-30·3 мин
LLM
LLM·Habr AI

LLM experiment showed how a model’s “personality” emerges in latent space

An experiment with a modular LLM showed that a separate latent vector can store not only text style, but also stable behavioral traits resem

2026-04-30·2 мин
LLM
LLM·Habr AI

Nvidia hints at an optical chip that could reshape AI data centers ahead of GTC 2026

Ahead of its March 16, 2026 presentation, Nvidia raised expectations with a promise of “a chip that will shake the world,” and silicon photo

2026-04-30·3 мин
LLM
LLM·Habr AI

Why DeepMind's AGI advances do not answer the key question of machine consciousness

The author explains why growing computing power and DeepMind's AGI advances do not amount to the emergence of consciousness: intelligence ca

2026-04-30·3 мин
LLM
LLM·Habr AI

Habr AI on the future of work: how AI and robots could return society to a new antiquity

Habr AI argues that the combination of AI, robotics, and neural implants could do more than reshape the labor market — it could divide socie

2026-04-30·2 мин
LLM
LLM·Habr AI

Unity showed how to build voice NPCs with memory and world context

A step-by-step guide shows how to build voice NPCs in Unity with a local model, dialog memory, knowledge of the game world, and spoken respo

2026-04-30·2 мин
LLM
LLM·Habr AI

Harvard: AI is cutting junior hiring, and in three years that could affect the entire industry

Harvard reports a drop in junior hiring after AI adoption, while METR points to growing dependence of experienced developers on AI assistant

2026-04-30·3 мин
LLM
LLM·Habr AI

DeepSeek and GLM-5 beat Yandex in a test of 34 AI models for managers without VPN

The authors of a large test of management scenarios found that DeepSeek V3.2 and GLM-5, available in Russia without VPN, are markedly strong

2026-04-30·3 мин
LLM
LLM·Habr AI

Google released Gemini Embedding 2 for multimodal RAG with video, audio, and PDF

Google released Gemini Embedding 2, a model that embeds text, images, video, audio, and PDF into a single space and simplifies building mult

2026-04-30·2 мин
LLM
LLM·Habr AI

Bitrix24 listed eight common mistakes in developing MCP servers for LLMs

A Bitrix24 developer explained why MCP servers fail on authorization, call chains, poor tool descriptions, testing, security, and context ov

2026-04-30·3 мин
LLM
LLM·Habr AI

Why Yann LeCun's world model idea does not solve the main crisis in LLM development

After Yann LeCun's departure from Meta, his world model concept is once again being discussed as a path beyond LLMs, but critics argue that

2026-04-30·3 мин
LLM
LLM·Habr AI

A Physical AI pipeline for SO-101 was assembled on top of ROS2 and LeRobot for 30,000 rubles

An open-source stack based on ROS2 and LeRobot makes it possible to build a full Physical AI pipeline on the low-cost SO-101: teleoperation,

2026-04-30·3 мин
LLM
LLM·Habr AI

SimpleOne launched SimpleGen — an AI tool for development and deployment on the platform

SimpleOne introduced SimpleGen — an AI tool for generating solutions on its platform: developers only need to prepare a repository, access t

2026-04-30·3 мин
LLM
LLM·Habr AI

Google AI Ultra: how to turn a subscription into a pool of parallel agents and model consensus

Google AI Ultra is being proposed as the foundation for a multi-agent stack: run parallel Gemini workers, delegate routine tasks, and cross-

2026-04-30·2 мин
LLM
LLM·Habr AI

Sam Altman and the Pentagon: how military contracts could become insurance for OpenAI

An op-ed on the OpenAI-Pentagon connection argues that military contracts give Sam Altman's company not only money and data, but also protec

2026-04-30·2 мин
LLM
LLM·Habr AI

OpenAI released GPT-5.4 Pro: new records in ARC-AGI-2, FrontierMath, and logic

OpenAI introduced GPT-5.4 Pro — a model that made sharp gains on difficult benchmarks, solves reasoning tasks better, and handles unconventi

2026-04-30·2 мин
LLM
LLM·Habr AI

nullClaw on Zig outperformed OpenClaw in memory and startup in local AI agent tests

nullClaw, a lightweight AI runtime on Zig, showed near-instant startup and several times lower memory usage in a local comparison with OpenC

2026-04-30·3 мин
LLM
LLM·Habr AI

OpenAI's ChatGPT 5.4 beat Claude Opus 4.6 and Gemini 3.1 Pro in a Habr comparison

Habr published a comparison of three flagship models in routine tasks: ChatGPT 5.4 ranked first by total score, Gemini 3.1 Pro was the cheap

2026-04-30·3 мин
LLM
LLM·Habr AI

Study: Cursor speeds up early development, but later adds to the team's technical debt

A study of Cursor found that the AI assistant sharply accelerates code delivery in the first weeks, but then increases complexity, the numbe

2026-04-30·2 мин
LLM
LLM·Habr AI

Yandex at AI Dev Day Showed How AI Is Already Changing Development at Avito, Ozon, and T-Bank

At AI Dev Day, companies Yandex, Avito, Ozon, T-Bank, and Sber demonstrated where AI is already accelerating development and where the effec

2026-04-30·3 мин
LLM
LLM·Habr AI

How AI is changing indie development: it's getting harder for solo developers to compete

The columnist argues that AI has sped up MVP launches, but at the same time raised the barrier to entry, intensified marketing competition,

2026-04-30·3 мин
LLM
LLM·Habr AI

A company without managers: three traps companies fall into when implementing AI

Of 50 executives at ProIT Fest, only three said AI had actually made decision-making easier — even as companies have fewer and fewer manager

2026-04-30·2 мин
LLM
LLM·Habr AI

Bitrix24 showed how to add four automation robots to a business portal

Bitrix24 released the next part of its practical series and showed how to integrate four robots into a business portal: for cleaning phone n

2026-04-30·3 мин
LLM
LLM·Habr AI

Anthropic explained how to build skills for Claude Code and why teams need their own marketplace

Anthropic showed which skills actually work in Claude Code, how to write them without extra noise, and why large teams need their own extens

2026-04-30·3 мин