akshay.dev

// software engineer & athlete

Akshay Kale

6+ years building systems at Amazon & 7-Eleven that handle millions of requests daily.

6+
Years
3+
Open Source Contributions
3
Top Companies

01. about

An Engineer & Athlete

6+ years building distributed systems that handle millions of daily requests. I specialize in high-throughput microservices, event-driven architecture, and performance engineering at scale.

Currently at 7-Eleven leading backend infrastructure for the 7NOW food delivery platform. Previously built logistics systems at Amazon impacting hundreds of thousands of sellers across 8 countries. Now targeting Senior Software Engineer roles where I can lead architecture and mentor teams.

recognition

★  Recognized by 7-Eleven · 2025

Donations · Split Payments · SNAP EBT · QSR Integration · Developer Testing Tools

education

MS Computer Science

University of Texas at Dallas · 2021

certifications
IN PROGRESS

Microsoft Applied Agentic AI

April 2026

Microsoft AI Engineer Program

August 2026

beyond the code

Ultimate Frisbee Athlete

Competed for UTD varsity sports team · 2019

World Championship Coach

Coached India Club Team at World Ultimate Club Championships · Cincinnati, Ohio · 2022

Mountaineer

Basic Mountaineering Course · A Grade · ABVIMAS · Indian Government Institution

02. experience

Where I've Worked

7-Eleven

Software Engineer II

May 2023PresentDallas, TX
  • Owned microservices handling 1M+ API requests/day with 99% availability
  • Rearchitected Cart, Checkout, Order services to integrate QSR → 10% revenue increase
  • Led AWS Lambda → Spring WebFlux on ECS migration → 30% higher throughput
  • Optimized APIs replacing sequential calls with caching/async → 70% latency reduction, 40% more conversions
  • Built event-driven order orchestration state machine
  • Designed split-payment architecture (SNAP EBT, Cashback, Card) → basket size $30→$60
  • Standardized observability via OpenTelemetry, CloudWatch, New Relic
  • Built AI-assisted internal developer tooling
  • Conducted 30+ technical interviews across US and India
tech stack
JavaSpring WebFluxAWS ECSDynamoDBKafkaSNS/SQSTerraformOpenTelemetry

03. skills

Technical Arsenal

core competencies

Distributed SystemsMicroservicesEvent-Driven ArchitecturePerformance OptimizationTechnical LeadershipSystem DesignFault-Tolerant SystemsObservabilityInfrastructure as Code
AI & Agenticlearning
LLM APIsAgentic AIRAGLangChainPrompt EngineeringAzure AIVector DBs
Languagesexperienced
JavaJavaScriptGolangPythonCC++Bash
Cloud & AWSexperienced
LambdaECSDynamoDBSNS/SQSS3KinesisCloudWatchRedshift
Frameworksexperienced
Spring BootSpring WebFluxKafkaReact.jsNode.jsFlask
Databasesexperienced
PostgreSQLCassandraMongoDBDynamoDBRedisSQLNoSQL
DevOpsexperienced
DockerKubernetesTerraformGitMavenGradle

04. insights

Engineering Insights

Learning from systems built at massive scale

Scaling LLM Inference

key ideas

  • LLMs are served through distributed inference clusters
  • Requests are batched to maximize GPU utilization
  • Scheduling layers prioritize latency-sensitive workloads
  • Horizontal scaling handles traffic spikes

↑ hover to expand

These patterns apply to any compute-heavy backend where batching and scheduling improve efficiency — not just AI.

Designing Idempotent APIs

key ideas

  • APIs must be safe to retry in distributed systems
  • Clients send an idempotency key with each request
  • The backend stores the request and response for that key
  • Retries return the cached response instead of re-executing

↑ hover to expand

This pattern prevents duplicate operations like payments or order creation — critical for any financial system.

Netflix

source ↗

Building Resilient Streaming Systems

key ideas

  • Microservices communicate through resilient service meshes
  • Circuit breakers prevent cascading failures
  • Chaos engineering validates system reliability
  • Services degrade gracefully instead of failing completely

↑ hover to expand

Fault tolerance is essential when operating globally distributed systems — design for failure, not against it.

Real-Time Dispatch Architecture

key ideas

  • Real-time location streams processed via distributed event pipelines
  • Kafka handles high-throughput event ingestion
  • Dispatch decisions rely on low-latency geospatial computations
  • Systems balance throughput with real-time responsiveness

↑ hover to expand

Event-driven architecture is essential for real-time systems where milliseconds determine user experience.

Cloudflare

source ↗

Edge Computing at Global Scale

key ideas

  • Edge networks run workloads close to the user
  • Global request routing improves latency and reliability
  • Stateless services allow horizontal scaling
  • Edge workers enable lightweight serverless compute

↑ hover to expand

Moving computation closer to users drastically reduces latency — a core principle of modern distributed systems.

The Tail at Scale

key ideas

  • Slow requests at the tail significantly impact overall user experience
  • Even rare slow responses can delay an entire request chain
  • Request hedging and replication reduce tail latency
  • Optimize for worst-case latency, not just average

↑ hover to expand

At scale, reducing tail latency is critical to consistency. A p99 problem affects millions of users daily.

05. projects

Things I've Built

01

Kodai — Local-First AI Coding Agent

In Progress

Open Source · 2025

A local-first AI coding agent built ground-up on hexagonal architecture — the core knows nothing about any host environment, model provider, or IDE, and communicates with the outside world through typed ports that adapters plug into. Currently available as a VS Code extension, with CLI and additional IDE support planned. Designed to run well on locally-hosted LLMs like Ollama and LM Studio alongside cloud providers, so a developer on a 16GB laptop gets a genuinely useful agent, not a degraded cloud fallback. Multi-model by design: different jobs route to different models — planning to Claude, edits to a fast local coder, embeddings to a tiny embedder. Never silently truncates context, every dangerous action goes through an explicit approval card, and the plugin architecture lets contributors ship new tools, providers, and retrievers without forking.

TypeScriptOllamaLM StudioAnthropic APIVS Code APIHexagonal ArchitectureNode.js
02

VS Code Extension — Function Call Tree Hover

10+ installs

Open Source · 2026

VS Code extension that displays an interactive call tree on hover, recursively showing all functions a given function calls up to a configurable depth. Supports JavaScript, TypeScript, and JSX/TSX via the VS Code Call Hierarchy API.

TypeScriptVS Code APIesbuildD3.jsNode.js
03

Code Understanding Agent

Open Source · 2026

AI-powered CLI and web app that indexes any public GitHub repository and answers natural language questions about the codebase. Uses a multi-strategy retrieval pipeline (semantic search, BM25, graph traversal, RRF reranking) and an agentic reasoning loop with tool-calling to trace logic across files — returning cited, business-level explanations with exact file:line references.

PythonClaude APIChromaDBtree-sitterNetworkXFastAPINext.jsTailwind CSSZustandSentence Transformers
04

QSR Integration Platform

10% revenue ↑

7-Eleven · 2024

Rearchitected Cart, Checkout, and Order microservices to onboard Quick Service Restaurant chains, unlocking a 10% revenue increase.

JavaAWS SNS/SQSREST APIsTerraform
05

Split Payment System

$30 → $60 cart price increase

7-Eleven · 2023

SNAP EBT + Cashback + Card split-payment architecture enabling government benefit spending. Doubled average basket size from $30 to $60.

JavaPayment APIsDynamoDBSpring Boot
06

VS Code Extension — JSON Pro

10+ installs

Open Source · 2026

VS Code extension providing a complete JSON toolbox: formatter, minifier, auto-fixer, diff checker, JSONPath query, stats dashboard, TypeScript interface generator, YAML converter, deep merge, flatten/unflatten, search, sidebar tree view, hover intelligence for Base64/timestamps/dates, and real-time diagnostics with duplicate key detection.

TypeScriptVS Code APIesbuildjsonc-parserNode.js

06. contact

Let's Build Something

Open to senior SWE roles and interesting distributed systems challenges. Reach out directly or send a message.

email

akshaykbkale@gmail.com

linkedin

linkedin.com/in/akshaykbkale