Rohit Bhardwaj

Rohit Bhardwaj is a Director of Architecture working at Salesforce. Rohit has extensive experience architecting multi-tenant cloud-native solutions in Resilient Microservices Service-Oriented architectures using AWS Stack. In addition, Rohit has a proven ability in designing solutions and executing and delivering transformational programs that reduce costs and increase efficiencies.

As a trusted advisor, leader, and collaborator, Rohit applies problem resolution, analytical, and operational skills to all initiatives and develops strategic requirements and solution analysis through all stages of the project life cycle and product readiness to execution.
Rohit excels in designing scalable cloud microservice architectures using Spring Boot and Netflix OSS technologies using AWS and Google clouds. As a Security Ninja, Rohit looks for ways to resolve application security vulnerabilities using ethical hacking and threat modeling. Rohit is excited about architecting cloud technologies using Dockers, REDIS, NGINX, RightScale, RabbitMQ, Apigee, Azul Zing, Actuate BIRT reporting, Chef, Splunk, Rest-Assured, SoapUI, Dynatrace, and EnterpriseDB. In addition, Rohit has developed lambda architecture solutions using Apache Spark, Cassandra, and Camel for real-time analytics and integration projects.

Rohit has done MBA from Babson College in Corporate Entrepreneurship, Masters in Computer Science from Boston University and Harvard University. Rohit is a regular speaker at No Fluff Just Stuff, UberConf, RichWeb, GIDS, and other international conferences.

Rohit loves to connect on http://www.productivecloudinnovation.com.
http://linkedin.com/in/rohit-bhardwaj-cloud or using Twitter at rbhardwaj1.

Presentations

Scaling APIs for Millions of AI-Driven Calls

AI agents are becoming a new class of API consumers. Unlike human users, agents can create bursty traffic, retry aggressively, call multiple tools in parallel, and accidentally amplify downstream failures. A single user request can become a large chain of API calls, model calls, vector searches, database lookups, and workflow events.

This talk explains how to design APIs for this new reality.

We will cover agent-aware rate limiting, budget-aware throttling, backpressure, load shedding, idempotency, deduplication, deterministic caching, async workflows, event-driven APIs, tail-latency SLOs, and cost observability.

Participants will learn how to tag and trace agent traffic, control runaway tool calls, prevent retry amplification, design graceful degradation, and build runbooks for cache storms, retry storms, dependency brownouts, and cost spikes.

The core message:

APIs exposed to AI agents must be contract-safe, retry-safe, cost-aware, observable, and degradation-ready.

Classic API scaling assumed relatively predictable traffic.

AI-driven API traffic is different because:

One prompt can create many downstream API calls.
Agents can retry, loop, and fan out.
Tool-calling creates bursty and non-human traffic patterns.
Cost grows with requests, retries, context size, model calls, and downstream work.
Failures can amplify quickly across gateways, SDKs, queues, databases, and model APIs.

Agenda

Why AI Changes API Scaling
Human traffic versus agent traffic, tool chains, fan-out, retries, and burst patterns.
New Failure Modes
Retry storms, cache-miss storms, malformed tool calls, version drift, DB saturation, and cost spikes.
Traffic Control for AI Agents
Agent-aware rate limits, per-tenant budgets, per-tool quotas, fair queuing, and adaptive backpressure.
Resilience Patterns
Idempotency keys, deduplication, bounded retries, circuit breakers, bulkheads, timeouts, and load shedding.
Caching for AI Workloads
Deterministic-result caching, semantic-aware caching, stale-while-revalidate, negative caching, and cache warming.
Async and Event-Driven APIs
Queue-first design, workflows, webhooks, streaming responses, outbox patterns, and dead-letter handling.
Observability and Cost Governance
Chain IDs, tool IDs, agent IDs, tail-latency SLOs, per-agent cost attribution, anomaly detection, and loop detection.
Runbooks and Readiness
Playbooks for retry storms, cache storms, provider brownouts, cost spikes, and safe degradation.

Your API Is Not Ready for AI (Yet): A Lifecycle Readiness Guide

PIs built for humans often fail when consumed by AI agents. They rely on documentation instead of contracts, return unpredictable structures, and break silently when upgraded. Large Language Models (LLMs) and autonomous agents need something different: machine-discoverable, deterministic, idempotent, and lifecycle-managed APIs.
This session introduces a five-phase API readiness framework—from discovery to deprecation—so you can systematically evolve your APIs for safe, predictable AI consumption. You’ll learn how to assess current APIs, prioritize the ones that matter, and apply modern readiness practices: function/tool calling, schema validation, idempotency, version sunset headers, and agent-aware monitoring.

Problems Solved

LLMs fail due to polymorphic or unpredictable API responses
Agents retry or loop because APIs aren’t idempotent
Ambiguous error messages block autonomous remediation
Silent breaking changes halt long-lived agent integrations
Lack of lifecycle management creates risk and rework

What “AI-Readiness” Means

Machine-Discoverable: APIs described in OpenAPI 3.1 + JSON Schema; self-describing operations and data types.
Deterministic: Same input → same output shape; no hidden conditional payloads.
Idempotent: Safe retries using Idempotency-Key or request signature patterns.
Guardrailed: Strict schema validation, quota enforcement, and prompt-injection defense.
Lifecycle Managed: Semantic versioning, Deprecation/Sunset headers, contract testing, and migration guides.

Common Failure Modes Today

Polymorphic responses that confuse function-calling agents.
Ambiguous errors without remediation guidance.
Non-idempotent endpoints causing duplicate orders or charges.
Hidden side effects undocumented or triggered by retries.
Breaking changes without warning → agents silently fail.

Agenda
Introduction: The Shift from Human → Machine Consumption Why LLMs and agents fundamentally change API design expectations. Examples of human-centric patterns that break agent workflows.
Pattern 1: Assessment & Readiness Scorecard How to audit existing APIs for AI-readiness. Scoring dimensions: discoverability, determinism, idempotency, guardrails, lifecycle maturity. Sample scorecard matrix and benchmark scoring.
Pattern 2: Prioritization Strategy How to choose where to start:

High traffic + high risk first (payments, claims, healthcare, orders)
Partner/customer-facing before internal
Regulated domains (HIPAA, PCI, SOX) before unregulated
Consolidate schema, security, and idempotency changes together
Pattern 3: Five-Phase Readiness Roadmap
Discovery: Audit specs, tag agent traffic, document gaps.
Redesign: Harden schemas, fix errors, add idempotency keys and prompt-injection defenses.
Versioning: Adopt SemVer, support multiple versions, and emit Deprecation/Sunset headers.
Monitoring: Track agent vs human usage, retries, anomalies, cost attribution.
Deprecation: Communicate timelines, throttle old versions, enable fallback modes.
Pattern 4: Security & Guardrails Inject prompt-defense filters at the edge. Schema validation and rate-limiting. Automated regression testing against contract schemas to ensure safety.
Pattern 5: Case Studies
Stripe Idempotency: Eliminating duplicate charges with the Idempotency-Key pattern.
Deprecation Done Right: APIs that use Sunset headers for graceful agent migration.
Agent Tool Example: Mapping operationId=ReserveInventory directly to an LLM tool schema.
Wrap-Up & Discussion Recap of framework and quick wins. Using the Readiness Scorecard and KPI checklist to measure progress from human-centric APIs → agent-ready APIs. Discussion on embedding readiness audits in CI/CD governance.

Key Framework References

OpenAPI 3.1 + JSON Schema: Machine-readable API contracts
FinOps + AI Cost Governance: Tagging and metering agent usage
OWASP LLM Top 10: Prompt-injection and misuse defenses
API Lifecycle Standards: RFC 8594 (Deprecation), RFC 9457 (Sunset Header)
ISO/IEC 38507: Governance implications for AI-integrated systems

Takeaways

API Readiness Scorecard to evaluate current maturity
5-phase modernization roadmap: Discovery → Redesign → Versioning → Monitoring → Deprecation
Checklist + KPIs to align API modernization with AI readiness
Case patterns demonstrating resilient, agent-safe API evolution

Director of Architecture, Expert in cloud-native solutions

Presentations

Scaling APIs for Millions of AI-Driven Calls

Your API Is Not Ready for AI (Yet): A Lifecycle Readiness Guide