Skip to main content
Kayvan Mazaheri
open to opportunities

> Senior backend engineer — shipping AI in production.

Backend systems at scale. AI features that ship.

Senior backend engineer, 7+ years at Cafebazaar (51M users) and Divar (30M). For the last year I've been shipping production AI for B2B SaaS: RAG, agent platforms, the caching-and-fallbacks work that decides if an LLM feature holds up in front of a real customer. I do my best work on systems that have to keep running at 3 AM, and I have strong opinions about observability.

Kayvan Mazaheri
production metrics
users reached 51M+ p99 latency 25ms events/sec 50K/s ad revenue 10×
Career

7+ Years of Building at Scale

From startup foundations to 51M-user platforms

AI & Backend Engineer (Contract)

Independent

Apr 2025 - Present

Independent engagements building production AI for B2B SaaS clients — RAG, agent platforms, multi-tenant retrieval. The caching-and-fallbacks work that decides whether an LLM feature holds up in front of a real customer.

RAG Embeddings Vector Stores Pinecone Elasticsearch

Technical Team Lead

Cafebazaar

Mar 2022 - Apr 2025

Tech lead for the ad platform at Cafebazaar (Iran's largest Android store, 51M users). Owned a 10+ engineer team through a live microservices migration and the systems work that grew ad revenue 10× over two years.

Python Go Kafka Redis Kubernetes

Senior Software Engineer

Cafebazaar

Oct 2020 - Mar 2022

Shipped 15+ high-reliability backend APIs at 51M-user scale, decomposed the legacy payment monolith into independently deployable services, and cut data loss on the stats aggregation service from ~15% to under 1%.

Python Django FastAPI PostgreSQL Redis
Technical Expertise

What I Bring to the Table

Backend and Languages

Python (Django, FastAPI) Go JavaScript / Node.js C++ gRPC, REST, async APIs

AI in Production

RAG over PDF, DOCX, plain text Embeddings and vector stores (Pinecone, Elasticsearch) Citation-anchored retrieval Agent design under predictability constraints LLM API integration (OpenAI, Anthropic, open-source) AI-assisted SDLC (Claude Code, evals, agentic dev loops)

Cloud and Infrastructure

Docker, Kubernetes GCP, AWS CI/CD pipelines Linux / shell

Data and Messaging

PostgreSQL Redis (caching, pub/sub, dedup) Apache Kafka RabbitMQ MongoDB

Observability

Grafana, Prometheus Sentry Distributed tracing Custom telemetry tooling

Engineering Practices

Microservices and service boundaries System design at scale Code review and mentorship Agile / Scrum leadership Unit and integration testing

Also available

Consulting for backend architecture and AI integration

Outside of full-time roles, I take on a small number of engagements with teams who need a senior pair of eyes — usually 1–4 weeks of focused work with written deliverables.

See services

Hiring? Let's talk.

Email is the fastest path — skim the resume first if you want context.