Sitemap

The dictionary meaning “the principles or practice of passive submission to constituted authority even when unjust or oppressive” doesn’t capture the eastern...

Leverage

1 minute read

Leverage is something you can use to get maximum advantage of something. It is a tool. For you to use a tool, first you have to understand the fundamental pr...

Person and perspective

less than 1 minute read

Is someone separate from their perspective? How can we disentangle these two?

Evaluated Experience

less than 1 minute read

It is a standard consensus that experience is the best teacher. How does just the experience can be the best teacher?

Catch up with status quo

1 minute read

What if you could get rid of the idea that you always need to catch-up with your peers status wise? What changes will you make in your life? What stops you f...

Conformist

1 minute read

If everything you believe is something that you are supposed to believe, what are the odds that it is really a coincidence?

Learning vs Education

1 minute read

Education is the industrial process of making people compliant. Command and control is the backbone of it. While learning is unleashing of a curious mind ag...

Pleasure vs happiness

less than 1 minute read

The former is about taking, whereas the latter is all about giving. One is short-lived and the other is long-lived.

First post

less than 1 minute read

Hello world, and everyone.

What are AI Agents?

17 minute read

“From Passive Tools to Active Assistants: The Cognitive Revolution in Software.”

LLM Capabilities for Agents

13 minute read

“The Engine of Autonomy: Understanding the Agentic ‘Brain’.”

Prompt Engineering for Agents

8 minute read

“Programming with English: The High-Level Language of 2024.”

Tool Calling Fundamentals

7 minute read

“Giving the Brain Hands to Act: The Interface Between Intelligence and Infrastructure.”

Memory Architectures

6 minute read

“The difference between a Chatbot and a Partner is Memory.”

Agent Frameworks Landscape

9 minute read

“To Framework or Not to Framework? Navigating the Agent Ecosystem.”

Building Your First Agent

8 minute read

“Hello World? No, Hello Agent.”

Agent Workflow Patterns

6 minute read

“Better workflows beat better models.” — Dr. Andrew Ng

Retrieval-Augmented Generation (RAG)

7 minute read

“Giving the Brain a Library: The Foundation of Knowledge-Intensive Agents.”

Document Processing for Agents

5 minute read

“Garbage In, Garbage Out. The Art of Reading Messy Data.”

Vector Search for Agents

21 minute read

“Finding a Needle in a High-Dimensional Haystack: The Mathematics of Recall.”

Context Window Management

10 minute read

“The Finite Canvas of Intelligence: Managing the Agent’s RAM.”

Multi-Step Reasoning

8 minute read

“Thinking Fast and Slow: How to make LLMs stop guessing and start solving.”

The ReAct Pattern Deep Dive

8 minute read

“Reason + Act: The Loop that Changed Everything.”

Planning and Decomposition

7 minute read

“If you fail to plan, you are planning to fail (and burn tokens).”

Real-Time Agent Pipelines

13 minute read

“Speed is not a feature. Speed is the product.”

Voice Agent Architecture

8 minute read

“Talking to machines: The end of the Keyboard.”

Voice Agent Frameworks: LiveKit & Pipecat

6 minute read

“Don’t build the phone network. Just build the app.”

Voice Activity Detection (VAD)

6 minute read

“The art of knowing when to shut up.”

Real-Time Speech-to-Speech Agents

6 minute read

“Removing the Text Bottleneck: The Omni Future.”

Vision Agent Fundamentals

18 minute read

“Giving eyes to the brain: How Agents see the world.”

Screenshot Understanding Agents

18 minute read

“Giving agents the eyes to read the screen as a human does.”

UI Automation Agents

19 minute read

“The ultimate API: The User Interface.”

Computer Use Agents

18 minute read

“Moving from ‘Chatting’ with an AI to ‘Co-working’ with an OS.”

Human-in-the-Loop Patterns

18 minute read

“The safest way to deploy AI: Keep the human in the driver’s seat.”

Tool Design Principles & Agentic Orchestration

19 minute read

“An agent is only as good as the tools it can wield.”

API Integration Patterns for AI Agents

18 minute read

“Connecting the brain to the world’s nervous system.”

Database Interaction Agents: Mastering Text-to-SQL & Beyond

18 minute read

“Democratizing data access through natural language.”

Multi-Agent Architectures: The Power of Coordination

19 minute read

“If you want to go fast, go alone. If you want to go far, go together.”

Agent Communication Protocols: The Language of Cooperation

18 minute read

“The final frontier: Standardizing the Agent-to-Agent dialogue.”

Role-Based Agent Design

18 minute read

“Generalists are okay, but Specialists win: Why Role-Based Design is the secret to production AI.”

State Management and Checkpoints

18 minute read

“Agents that don’t forget: Building reliability through state persistence.”

Error Handling and Recovery

18 minute read

“Agents that don’t quit: Building resilient AI that can fix itself.”

Observability and Tracing

18 minute read

“Inside the mind of the machine: Mastering agentic observability.”

Structured Output Patterns

20 minute read

“Make agents predictable: enforce schemas, validate outputs, and recover automatically when the model slips.”

Web Browsing Agents

17 minute read

“Turn the open web into a reliable tool: browse, extract, verify, and cite—without getting prompt-injected.”

Code Execution Agents

17 minute read

“Let agents run code safely: sandbox execution, cap damage, and verify outputs like a production system.”

Autonomous Agent Architectures

16 minute read

“Architecture beats prompting: build autonomous agents with clear state, strict tool boundaries, and measurable stop conditions.”

Self-Reflection and Critique

15 minute read

“Make agents less overconfident: separate drafting from critique, force evidence, and turn failures into actionable feedback.”

Hierarchical Planning

14 minute read

“Make agents reliable at large tasks: plan at multiple levels, execute in small verified steps, and stop when budgets say so.”

World Models for Agents

14 minute read

“Agents become reliable when they carry an internal model of reality: state, uncertainty, and predictions—not just chat history.”

Agent Evaluation Frameworks

13 minute read

“If you can’t measure an agent, you can’t improve it: build evals for success, safety, cost, and regressions.”

Testing AI Agents

11 minute read

“Test agents like systems: validate tool calls, pin behaviors with replayable traces, and catch regressions before users do.”

Prompt Injection Defense

6 minute read

“Treat prompts like an attack surface: isolate untrusted content, validate every tool call, and fail closed under uncertainty.”

Data Leakage Prevention

5 minute read

“Prevent leaks by design: minimize data access, redact outputs and logs, and enforce least privilege for tools and memory.”

Token Efficiency Optimization

6 minute read

“The most expensive token is the one you didn’t need to send.”

Cost Management for Agents

5 minute read

“Intelligence is cheap. Reliable, scalable intelligence is expensive.”

Streaming Real-Time Agents

5 minute read

“Waiting 10 seconds for a thoughtful answer is okay. Waiting 10 seconds for a blank screen is broken.”

Dependency Graphs for Agents

4 minute read

“An Agent without a Plan is just a stochastic parrot reacting to noise.”

Building Domain-Specific Agents

4 minute read

“Don’t build a generalist. Build a specialist.”

Two Sum

27 minute read

The hash table trick that makes O(n²) become O(n) and why this pattern appears everywhere from feature stores to embedding lookups.

Valid Parentheses

24 minute read

Why a simple stack solves bracket matching, expression parsing, and even neural network depth management in one elegant pattern.

Merge Two Sorted Lists

29 minute read

The pointer manipulation pattern that powers merge sort, data pipeline merging, and multi-source stream processing.

Best Time to Buy and Sell Stock

24 minute read

The single-pass pattern that powers streaming analytics, online algorithms, and real-time decision making in production systems.

Maximum Subarray (Kadane’s Algorithm)

23 minute read

Master the pattern behind online algorithms, streaming analytics, and dynamic programming, a single elegant idea powering countless production systems.

Climbing Stairs

25 minute read

The Fibonacci problem in disguise, teaching the fundamental transition from recursion to dynamic programming to space optimization.

Binary Tree Traversal

26 minute read

Master the fundamental patterns of tree traversal: the gateway to solving hundreds of tree problems in interviews.

Validate Binary Search Tree

23 minute read

Master BST validation to understand data integrity in tree structures, critical for indexing and search systems.

Binary Search

28 minute read

Master binary search to understand logarithmic algorithms and efficient searching, foundational for optimization and search systems.

Reverse Linked List

27 minute read

Master linked list manipulation through reversal - a fundamental pattern for understanding pointer logic and in-place algorithms.

LRU Cache

27 minute read

Master LRU cache design: O(1) get/put with hash map + doubly linked list. Critical for interviews and production caching systems.

Add Two Numbers

23 minute read

Master digit-by-digit addition with linked lists: Handle carry propagation elegantly. Classic problem teaching pointer manipulation and edge cases.

Container With Most Water

24 minute read

Master the two-pointer greedy technique that powers resource optimization in production ML systems.

Generate Parentheses

24 minute read

Master backtracking to generate all valid combinations—the foundation of ensemble model selection and multi-model systems.

Group Anagrams

24 minute read

Master hash-based grouping to solve anagrams—the foundation of clustering systems and speaker diarization in production ML.

Merge Intervals

21 minute read

Master interval processing to handle overlapping ranges—the foundation of event streams and temporal reasoning in production systems.

Add Two Numbers (Linked List)

13 minute read

Simulate arbitrary-precision addition on linked lists—the same sequential pattern used in large-scale distributed training and streaming pipelines.

Rotate Image

13 minute read

Master in-place matrix rotation—the same 2D transformation pattern that powers image and spectrogram augmentations in modern ML systems.

Spiral Matrix

13 minute read

Master systematic matrix traversal—the same pattern used for tracking experiments, processing logs, and managing state in ML systems.

Jump Game

24 minute read

Master greedy decision-making to determine reachability—the same adaptive strategy used in online learning and real-time speech systems.

Unique Paths

23 minute read

Master grid path counting with dynamic programming—the same optimization technique used in neural architecture search and speech model design.

Minimum Path Sum

21 minute read

The classic grid optimization problem that bridges the gap between simple recursion and 2D Dynamic Programming.

Decode Ways

16 minute read

A deceptive counting problem that teaches the fundamentals of state transitions and connects directly to Beam Search.

Word Break

20 minute read

The fundamental string segmentation problem that powers spell checkers, search engines, and tokenizers.

Validate Binary Search Tree

17 minute read

The gatekeeper of data integrity. How do we ensure our sorted structures are actually sorted?

Binary Tree Level Order Traversal

19 minute read

How do you print a corporate hierarchy level by level? CEO first, then VPs, then Managers…

Construct Binary Tree from Preorder and Inorder Traversal

22 minute read

Given two arrays, can you rebuild the original tree? It’s like solving a jigsaw puzzle where the pieces are numbers.

Kth Smallest Element in a BST

23 minute read

Finding the median or the 99th percentile is easy in a sorted array. Can we do it in a tree?

Lowest Common Ancestor of a Binary Tree

12 minute read

“Find the point where two paths in a tree first meet.”

Number of Islands

15 minute read

“Counting connected components in a 2D grid.”

Course Schedule (Topological Sort)

13 minute read

“Can you finish all courses given their prerequisites?”

Word Ladder (BFS)

18 minute read

“Transforming ‘cold’ to ‘warm’ one letter at a time.”

Clone Graph (DFS/BFS)

21 minute read

“Creating a deep copy of a graph structure.”

Evaluate Division (Graph/Union-Find)

24 minute read

“Modeling algebraic equations as graph path problems.”

Surrounded Regions (DFS/BFS)

20 minute read

“Capturing regions by identifying safe boundaries.”

Partition Equal Subset Sum

19 minute read

“Can you split the treasure evenly?”

Longest Increasing Subsequence (LIS)

17 minute read

“Finding the longest upward trend in chaos.”

Coin Change (Unbounded Knapsack)

19 minute read

“Making change with the fewest coins.”

Word Break

21 minute read

“Making sense of a stream of characters.”

Trapping Rain Water

21 minute read

“Calculating capacity in a fragmented landscape.”

Jump Game II

24 minute read

“Finding the optimal path through a sequence of choices.”

Merge K Sorted Lists

17 minute read

“Combining order from chaos, one element at a time.”

Median of Two Sorted Arrays

17 minute read

“Finding the middle ground between two ordered worlds.”

Largest Rectangle in Histogram

16 minute read

“Finding the maximum hidden in the valleys and peaks.”

Sliding Window Maximum

13 minute read

“Finding the king of every window.”

Binary Tree Maximum Path Sum

10 minute read

“Find the path to success—even if you have to start from the bottom, go up, and come back down.”

Serialize and Deserialize Binary Tree

7 minute read

“Data is only useful if it can survive the journey from RAM to Disk and back again.”

Word Search II

6 minute read

“Don’t look for one needle in a haystack. Magnetize the hay to find all needles at once.”

Course Schedule (Topological Sort)

5 minute read

“You can’t build the roof before you pour the foundation.”

Alien Dictionary

5 minute read

“Language is just a graph of symbols. If you know the order, you know the language.”

Recommendation System: Candidate Retrieval

29 minute read

How do you narrow down 10 million items to 1000 candidates in under 50ms? The art of fast retrieval at scale.

Classification Pipeline Design

16 minute read

From raw data to production predictions: building a classification pipeline that handles millions of requests with 99.9% uptime.

Data Preprocessing Pipeline Design

28 minute read

How to build production-grade pipelines that clean, transform, and validate billions of data points before training.

A/B Testing Systems for ML

28 minute read

How to design experimentation platforms that enable rapid iteration while maintaining statistical rigor at scale.

Batch vs Real-Time Inference

23 minute read

How to choose between batch and real-time inference, the architectural decision that shapes your entire ML serving infrastructure.

Model Evaluation Metrics

24 minute read

How to measure if your ML model is actually good, choosing the right metrics is as important as building the model itself.

Feature Engineering at Scale

22 minute read

Feature engineering makes or breaks ML models, learn how to build scalable, production-ready feature pipelines that power real-world systems.

Model Serving Architecture

22 minute read

Design production-grade model serving systems that deliver predictions at scale with low latency and high reliability.

Online Learning Systems

24 minute read

Design systems that learn continuously from streaming data, adapting to changing patterns without full retraining.

Caching Strategies for ML Systems

27 minute read

Design efficient caching layers for ML systems to reduce latency, save compute costs, and improve user experience at scale.

Content Delivery Networks (CDN)

22 minute read

Design a global CDN for ML systems: Edge caching reduces latency from 500ms to 50ms. Critical for real-time predictions worldwide.

Distributed ML Systems

25 minute read

Design distributed ML systems that scale to billions of predictions: Master replication, sharding, consensus, and fault tolerance for production ML.

Resource Allocation for ML

28 minute read

Build production ML infrastructure that dynamically allocates resources using greedy optimization to maximize throughput and minimize costs.

Model Ensembling

25 minute read

Build production ensemble systems that combine multiple models using backtracking strategies to explore optimal combinations.

Clustering Systems

24 minute read

Design production clustering systems that group similar items using hash-based and distance-based approaches for recommendations, search, and analytics.

Event Stream Processing

19 minute read

Build production event stream processing systems that handle millions of events per second using windowing and temporal aggregation—applying the same interva...

Distributed Training Architecture

12 minute read

Design distributed training architectures that can efficiently process massive sequential datasets and train billion-parameter models across thousands of GPUs.

Data Augmentation Pipeline

11 minute read

Design a robust data augmentation pipeline that applies rich transformations to large-scale datasets without becoming the training bottleneck.

Experiment Tracking Systems

13 minute read

Design robust experiment tracking systems that enable systematic exploration, reproducibility, and collaboration across large ML teams.

Online Learning Systems

18 minute read

Design online learning systems that adapt models in real-time using greedy updates—the same adaptive decision-making pattern from Jump Game applied to stream...

Neural Architecture Search

18 minute read

Design neural architecture search systems that automatically discover optimal model architectures using dynamic programming and path optimization—the same pr...

Cost Optimization for ML

16 minute read

A comprehensive guide to FinOps for Machine Learning: reducing TCO without compromising accuracy or latency.

Beam Search Decoding

14 minute read

The industry-standard algorithm for converting probabilistic model outputs into coherent text sequences.

Tokenization Systems

16 minute read

The critical preprocessing step that defines the vocabulary and capabilities of Large Language Models.

Model Monitoring Systems

15 minute read

The silent killer of ML models is not a bug in the code, but a change in the world.

Batch Processing Pipelines

12 minute read

Not everything needs to be real-time. Sometimes, “tomorrow morning” is fast enough.

Model Architecture Design

23 minute read

Architecture is destiny. The difference between 50% accuracy and 90% accuracy is often just a skip connection.

Ranking Systems at Scale

23 minute read

How does Google search 50 billion pages in 0.1 seconds? The answer is the “Ranking Funnel”.

Hierarchical Classification Systems

11 minute read

“Organizing the world’s information into a structured hierarchy.”

Graph-based Recommendation Systems

13 minute read

“Leveraging the connection structure to predict what users will love.”

ML Pipeline Dependencies & Orchestration

11 minute read

“Managing complex ML workflows with thousands of interdependent tasks.”

Semantic Search Systems

11 minute read

“Moving beyond keywords to understand the meaning of a query.”

Model Replication Systems

19 minute read

“Ensuring your ML models are available everywhere, all the time.”

Knowledge Graph Systems

22 minute read

“Structuring the world’s information into connected entities and relationships.”

Boundary Detection in ML

19 minute read

“Defining where one object ends and another begins.”

Resource Partitioning in ML Clusters

18 minute read

“How to share a supercomputer without stepping on each other’s toes.”

Sequence Modeling in ML

19 minute read

“Predicting the next word, the next stock price, the next frame.”

Hyperparameter Optimization

22 minute read

“Finding the perfect knobs to turn.”

Model Interpretability and Explainability (XAI)

16 minute read

“Trust, but verify. Why did the model say No?”

Distributed Training Patterns

21 minute read

“Scaling from one GPU to thousands.”

Model Compression Techniques

20 minute read

“Fitting billion-parameter models into megabytes.”

Feature Stores

14 minute read

“The centralized truth for machine learning features.”

Vector Databases

13 minute read

“The infrastructure for semantic search and AI-native applications.”

LLM Serving Infrastructure

12 minute read

“Serving models that think at human scale.”

RAG Systems

12 minute read

“Grounding LLMs in facts, not hallucinations.”

Transfer Learning Systems

7 minute read

“Standing on the shoulders of giants isn’t just a metaphor—it’s an engineering requirement.”

Model Serialization Systems

6 minute read

“Training is Art. Serialization is Logistics. Wars are won on logistics.”

Trie-Based Search Systems (Typeahead)

5 minute read

“The user knows what they want. Your job is to tell them before they finish typing.”

DAG Pipeline Orchestration

5 minute read

“Cron is not an orchestrator. A script is not a pipeline.”

Character-Level Language Models

5 minute read

“Before machines could write essays, they had to learn to spell.”

Blizzard Challenge 2015 Submission by DONLab, IIT Madras

[Challenge] Blizzard Challenge 2015

Resources for Indian languages

[Conference] Community-based Building of Language Resources(CBBLR), Brno, Czech Republic, September 2016

A unified parser for developing Indian language text to speech synthesizers

[Conference] International Conference on Text, Speech, and Dialogue(TSD), Brno, Czech Republic, September 2016

TBT Toolkit to Build TTS A High Performance Framework to build Multiple Language HTS Voice

[Conference] INTERSPEECH 2017 (Show and Tell), Stockholm, Sweden, August 2017

Deep Learning Techniques in Tandem with Signal Processing Cues for Phonetic Segmentation for Text to Speech Synthesis in Indian Languages

[Conference] INTERSPEECH 2017, Stockholm, Sweden, August 2017

Sitemap

Pages

Posts

ai_agents

dsa

ml_system_design