Utkarsh Mittal | Towards AI

Part 13 — Design the Recommender System

15 likes

June 22, 2026

Author(s): Utkarsh Mittal Originally published on Towards AI. Part 13 — Design the Recommender System Part 12 — https://medium.com/p/75cf0a345156 The article explains how to design a production recommender system using a real end-to-end scenario and concrete latency, data, and training considerations. It …

Artificial Intelligence Latest Machine Learning

Part 12 -The 80GB Wall: GPU Infrastructure and Scheduling, Worked End to End

Utkarsh Mittal

9 likes

June 21, 2026

Author(s): Utkarsh Mittal Originally published on Towards AI. Our running example, fixed for the whole article Part 11— https://pub.towardsai.com/ml-systems-design-series-retrieval-augmented-generation-rag-why-your-llm-doesnt-know-about-00e885bdbea9?source=friends_link&sk=55c086d99d3f6b7dfadd3d7c5226b4e0 The article walks through how GPU infrastructure, scheduling, and memory constraints determine the design of large-model training and inference systems. Starting from a …

Artificial Intelligence Latest Machine Learning

Machine Learning System Design -The Model Serving Triangle, With One Forward Pass Flowing Through Every Trade-off (Part3)

Utkarsh Mittal

49 likes

April 28, 2026

Author(s): Utkarsh Mittal Originally published on Towards AI. The Model Serving Triangle, With One Forward Pass Flowing Through Every Trade-off (Part3) Part 1-p https://pub.towardsai.com/the-ml-system-design-interview-with-numbers-flowing-through-every-stage-part-1-a77888339297?source=friends_link&sk=9064640f37c84a131ef24b1126bc0cf9 Three pieces of memory math that every candidate must have memorizedThis article discusses the complexities and trade-offs of …

Artificial Intelligence Latest Machine Learning

The L1 Loss Gradient, Explained From Scratch

Utkarsh Mittal

137 likes

April 10, 2026

Author(s): Utkarsh Mittal Originally published on Towards AI. A complete, step-by-step walkthrough of how gradient descent works with absolute-value loss — with diagrams you can actually follow. If you’ve ever read a deep learning tutorial and hit a derivative that seems to …

Artificial Intelligence Latest Machine Learning

Agentic RAG & Semantic Caching: Building Smarter Enterprise Knowledge Systems

Utkarsh Mittal

27 likes

February 22, 2026

Author(s): Utkarsh Mittal Originally published on Towards AI. Section 1: The Rise (and Limitations) of RAG Enterprise data is messy. It lives in Slack threads, Google Drive folders, SharePoint libraries, spreadsheets buried three levels deep in someone’s OneDrive, and meeting transcripts that …

Artificial Intelligence Latest Machine Learning

Inside the Forward Pass: Pre-Fill, Decode, and the GPU Economics of Serving Large Language Models

Utkarsh Mittal

23 likes

February 17, 2026

Author(s): Utkarsh Mittal Originally published on Towards AI. Why Inference Is the Endgame Pre-training a frontier large language model typically consumes somewhere between 15 trillion and 30 trillion tokens. That sounds like an enormous number — until you do the arithmetic on …

Artificial Intelligence Latest Machine Learning

Understanding XGBoost: A Deep Dive into the Algorithm

Utkarsh Mittal

65 likes

December 9, 2025

Author(s): Utkarsh Mittal Originally published on Towards AI. Introduction XGBoost (Extreme Gradient Boosting) has become the go-to algorithm for winning machine learning competitions and solving real-world prediction problems. But what makes it so powerful? In this comprehensive tutorial, we’ll unpack the mathematical …

Latest Machine Learning

Understanding Gradient Boosted Trees: The Foundation of XGBoost

Utkarsh Mittal

20 likes

December 1, 2025

Author(s): Utkarsh Mittal Originally published on Towards AI. Understanding Gradient Boosted Trees: The Foundation of XGBoost Gradient Boosted Trees have revolutionized machine learning, powering some of the most successful algorithms in data science. Before diving into the complexities of XGBoost, it’s essential …

Latest Machine Learning

RoPE (Rotary Position Embeddings): A Detailed Example

Utkarsh Mittal

27 likes

November 10, 2025

Author(s): Utkarsh Mittal Originally published on Towards AI. In transformer models, knowing the order of tokens is essential — even though the model processes tokens in parallel. Traditional positional embeddings rely on a fixed “lookup table” (learned for positions up to a …

Frequently Used, Contextual References

Resources

Part 13 — Design the Recommender System

Part 12 -The 80GB Wall: GPU Infrastructure and Scheduling, Worked End to End

Machine Learning System Design -The Model Serving Triangle, With One Forward Pass Flowing Through Every Trade-off (Part3)

The L1 Loss Gradient, Explained From Scratch

Agentic RAG & Semantic Caching: Building Smarter Enterprise Knowledge Systems

Inside the Forward Pass: Pre-Fill, Decode, and the GPU Economics of Serving Large Language Models

Understanding XGBoost: A Deep Dive into the Algorithm

Understanding Gradient Boosted Trees: The Foundation of XGBoost

RoPE (Rotary Position Embeddings): A Detailed Example

Recent Posts

I Deleted Every Static Claude API Key I Owned. Here’s the Keyless Migration, Provider by Provider.

I Replaced ChatGPT With Local AI for 30 Days. Here’s What Actually Happened.

A Practical Guide to Evaluating a Cloud Migration Partner

AsyncIO in Python: What It Actually Is and Why Your ‘Async’ Code Might Not Be Async

Building Long-Running Claude Managed Agents: Why State Matters More Than Compute

The Building Blocks of LangGraph (Part 0)

Five Ways Claude Code Runs Multi-Step Work. The Two Questions That Pick the Right One.

Choose Wisely: Models Should Follow Your Use Case.

Comprehensive AI Engineering and AI for Work certifications

Company

CONTACT US

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Frequently Used, Contextual References

Resources

Recent Posts

Comprehensive AI Engineering and AI for Work certifications

Company

CONTACT US

GDPR CCPA Statement