BLOG

Technical writing on dialogue systems, NLP architecture, and conversational AI from the Equmenopolis engineering team.

March 28, 2026 Engineering

Building Intent Graphs Instead of Intent Lists

Flat intent lists work until users start combining requests. "Book a flight and cancel my hotel" is not two separate utterances - it's a compound intent with shared context. Here's how we represent that as a typed graph.

Read article
March 14, 2026 Performance

How We Keep Context State Updates Under 10ms

Context management has a latency budget. If state operations consume 150ms, you have 50ms left for inference - which forces you to use smaller, less capable models. We describe the state engine design that keeps overhead below 10ms.

Read article