Current Focus
- ML serving architectures - batching, caching, quantisation and real-world latency tradeoffs
- Evaluation frameworks for AI apps - moving beyond vibes-based testing
- Operational realities of agentic systems beyond the demos
- ML serving architectures
- Evaluation frameworks
- Operational realities of agentic systems