01. about
An Engineer & Athlete
6+ years building distributed systems that handle millions of daily requests. I specialize in high-throughput microservices, event-driven architecture, and performance engineering at scale.
Currently at 7-Eleven leading backend infrastructure for the 7NOW food delivery platform. Previously built logistics systems at Amazon impacting hundreds of thousands of sellers across 8 countries. Now targeting Senior Software Engineer roles where I can lead architecture and mentor teams.
MS Computer Science
University of Texas at Dallas · 2021
Microsoft Applied Agentic AI
April 2026
Microsoft AI Engineer Program
August 2026
beyond the code
02. experience
Where I've Worked
03. skills
Technical Arsenal
core competencies
04. insights
Engineering Insights
Learning from systems built at massive scale
OpenAI
source ↗Scaling LLM Inference
key ideas
- ▸LLMs are served through distributed inference clusters
- ▸Requests are batched to maximize GPU utilization
- ▸Scheduling layers prioritize latency-sensitive workloads
- ▸Horizontal scaling handles traffic spikes
Stripe
source ↗Designing Idempotent APIs
key ideas
- ▸APIs must be safe to retry in distributed systems
- ▸Clients send an idempotency key with each request
- ▸The backend stores the request and response for that key
- ▸Retries return the cached response instead of re-executing
Netflix
source ↗Building Resilient Streaming Systems
key ideas
- ▸Microservices communicate through resilient service meshes
- ▸Circuit breakers prevent cascading failures
- ▸Chaos engineering validates system reliability
- ▸Services degrade gracefully instead of failing completely
Uber
source ↗Real-Time Dispatch Architecture
key ideas
- ▸Real-time location streams processed via distributed event pipelines
- ▸Kafka handles high-throughput event ingestion
- ▸Dispatch decisions rely on low-latency geospatial computations
- ▸Systems balance throughput with real-time responsiveness
Cloudflare
source ↗Edge Computing at Global Scale
key ideas
- ▸Edge networks run workloads close to the user
- ▸Global request routing improves latency and reliability
- ▸Stateless services allow horizontal scaling
- ▸Edge workers enable lightweight serverless compute
The Tail at Scale
key ideas
- ▸Slow requests at the tail significantly impact overall user experience
- ▸Even rare slow responses can delay an entire request chain
- ▸Request hedging and replication reduce tail latency
- ▸Optimize for worst-case latency, not just average
05. projects
Things I've Built
Order Orchestration Engine
7-Eleven · 2024
Event-driven state machine orchestrating the full order lifecycle from placement to delivery. Handles 1M+ API requests/day at 99% uptime.
QSR Integration Platform
7-Eleven · 2024
Rearchitected Cart, Checkout, and Order microservices to onboard Quick Service Restaurant chains, unlocking a 10% revenue increase.
Split Payment System
7-Eleven · 2023
SNAP EBT + Cashback + Card split-payment architecture enabling government benefit spending. Doubled average basket size from $30 to $60.
Carrier Selection Engine
Amazon · 2022
Distributed carrier orchestration selecting optimal shipping partners based on cost, SLA, and capacity across 800K+ orders.
Global Seller Onboarding
Amazon · 2022
Multi-country shipping charge configuration system for new marketplace launches. Scaled to 100K+ seller enrollments across 8 countries.
06. contact
Let's Build Something
Open to senior SWE roles and interesting distributed systems challenges. Reach out directly or send a message.
akshaykbkale@gmail.com
linkedin.com/in/akshaykbkale