Senior/ Staff Backend

Zyoin Group
Job description
๐Ÿ“ India
๐Ÿ’ผ Full time
๐Ÿ’ฐ Competitive
๐Ÿ“… Posted 27/05/2026

Job description

Inference Optimization Drive TTFT below 400ms for multi-step agent pipelines Streaming optimization: first token to user while sub-agents are still running KV cache strategy, prompt compression, dynamic context window management Multi-provider routing: model selection by latency, cost, and task type across OpenAI, Anthropic, Gemini, and open-weight models Agent Architecture Design and implement Plan-Execute-Synthesize pipelines that run sub-agents in parallel DAGs, not sequential chains Build rโ€ฆ

๐Ÿš€ Apply now for free

Sign up to Kokobeo Jobs in 30 seconds.
AI-written CV, 1-click applications.

Sign up and apply โ†’