Beyond Stateless Conversations: Adding Long-Term Memory to Your Foundry Agents

Beyond Stateless Conversations: Adding Long-Term Memory to Your Foundry Agents

If you’ve worked with AI agents in production, you’ve almost certainly run into this problem: a user tells your agent something important in one session, and the next time they come back, the agent has completely forgotten about it. The user has to repeat themselves. Context is lost. The experience feels disjointed. I ran into this exact issue while working on a project where users were interacting with an enterprise agent multiple times throughout the…

model router implementation

Azure AI Model Router: Implementation and Production Patterns

After three months running Model Router in our private ChatGPT app, I figured it was time to share what actually works in production versus what the documentation says should work. Part 1 covered the architecture and decision framework. This post walks through the real implementation – deployment, code, monitoring, and the edge cases that aren’t obvious until you hit them.Fair warning: this gets technical. I’m showing you the .NET code we use, the telemetry patterns…