Habr AI→ original

Claude Code and Cursor: how context engineering turns AI memory into a working tool

Development AI assistants still forget everything after closing a chat, so teams waste time repeating the stack, rules, and past solutions. The solution is…

AI-processed from Habr AI; edited by Hamidun News
Claude Code and Cursor: how context engineering turns AI memory into a working tool
Source: Habr AI. Collage: Hamidun News.
◐ Listen to article

AI-assistants for development have become significantly stronger, but they retain a fundamental problem: almost every new session begins with a loss of accumulated context. The model doesn't remember what tech stack the team uses, what conventions are adopted in the project, why the port was moved from 8501 to 8505 long ago, and which components have already proven to be poor choices. As a result, the developer repeatedly reproduces the same initial conditions.

An article on context engineering suggests looking at this not as an inevitable defect, but as an engineering problem: memory can be extracted to a separate layer and made part of the workflow. The key distinction here is between short-term and long-term memory. Short-term memory lives within a single context window: the current chat, open files, recent actions.

Once the session ends, that memory disappears. This is precisely why, without additional configuration, an assistant might first suggest a React dashboard, then after revision switch to Streamlit, and only on the third attempt account for Altair and other requirements. Each such iteration seems like a minor detail, but over distance it becomes a constant "repetition tax".

The more projects and team members there are, the more expensive the absence of stable context becomes. The first and most practical level of solution is explicit rule files within the project. For different tools these can be CLAUDE.

md, AGENTS.md, or directories with rules-files. In them it makes sense to fix the stack, interface and code style conventions, commands for running, testing, and linting, as well as important historical decisions that are difficult to re-explain in each session.

If the team prefers Material icons instead of emojis, wide layout in Streamlit, Altair for charts, and specific caching mechanisms, all of this is better described once next to the code. The advantage of such an approach is that it lives in version control: a new developer clones the repository, and with it the assistant receives a ready-made set of rules of the game. The next layer is global rules that relate not to a specific project, but to the work style of the user or team itself.

Here there is no need for a list of libraries and ports; what matters is to set the response format, code completeness requirements, attitude toward comments, approach to optimization, and choice between brevity and readability. The idea is to separate technical context from behavioral. Technological details should be in the project, while stable preferences should be at the global level.

The article also notes a more portable format for skills, when the assistant is given not only rules of behavior, but also procedures for executing typical tasks. This brings AI-tools closer to the model of working with a real employee: they are given not only briefing materials, but also standard ways to act. More advanced levels are connected with implicit memory and external infrastructure.

This includes systems that collect traces of work themselves: code fragments, IDE activity, browser history, debugging insights, and project patterns. The article mentions examples such as Pieces, Claude Code's auto-memory, and cloud mechanisms like ChatGPT Memory. An important role is beginning to be played by Model Context Protocol, or MCP: it provides a unified way to connect the assistant to external data sources, instead of building unique integrations for each tool.

For teams that need a full-fledged organizational memory layer, there is an even heavier path — services like Mem0, Zep, and Supermemory, or own RAG-infrastructure based on Pinecone and Weaviate. But this is no longer an evening configuration, but a separate engineering system with its own costs for embeddings, search, deduplication, and resolution of contradictions in data. The main conclusion is quite practical: most teams don't need a complex memory stack right away.

It's enough to start with one rules file in the project root, fix the stack, commands and conventions, and then gradually move recurring habits into global settings. If after a productive session you ask the assistant to briefly formulate what it learned, such a knowledge layer will grow without chaos. On April 11, 2026, the author of the article already noted that major players like Anthropic have begun to effectively embed memory into their tools, but the models themselves remain stateless.

This is the main change in perspective: the problem is not that LLMs are "bad", but that context needs to be designed as consciously as code, pipelines, and documentation. Teams that understand this before others will gain not magic, but sustainable acceleration of development.

ZK
Hamidun News
AI news without noise. Daily editorial selection from 400+ sources. A product by Zhemal Khamidun, Head of AI at Alpina Digital.

Want to stop reading about AI and start using it?

AI News is a curated feed of AI/tech news. Hamidun Academy teaches you to use AI systematically in your work.

What do you think?
Loading comments…