Nitin Sharma Blog - My own researches and fundamentals: RAG

Showing posts with label RAG. Show all posts

Wednesday, June 4, 2025

Complete Beginner's Guide to Building an AI-Powered Backend with Model Context Protocol (MCP)

Welcome to your first AI programming adventure! By the end of this tutorial, you'll have built a smart AI assistant that can read data files, perform calculations, and make intelligent decisions about which tools to use. Don't worry if you've never coded before – we'll go through everything step by step.

What You'll Build

Imagine having a personal assistant who can:

Read your business data (like a customer database)
Perform calculations when needed
Answer questions by combining information from different sources
Know exactly which tool to use for each task

That's exactly what we're building – an AI-powered backend application using something called the Model Context Protocol (MCP).

Part 1: Understanding MCP (The Simple Version)

Think of MCP like a translator at the United Nations. Just as the translator helps delegates from different countries communicate, MCP helps your AI assistant communicate with different tools and data sources.

Here's the analogy:

Your AI is like a smart intern
Your tools (calculator, file reader) are like specialized departments
MCP is like the office manager who knows which department can help with each request
Your data files are like filing cabinets with important information

Without MCP, your AI would be like an intern who doesn't know where anything is or who to ask for help. With MCP, your AI becomes incredibly efficient because it knows exactly where to find information and which tools to use.

Part 2: Setting Up Your Development Environment

Let's get your computer ready for AI development!

Step 1: Install Python

For Mac:

Go to python.org
Download Python 3.9 or newer
Run the installer and follow the prompts

Step 2: Verify Your Installation

Open your terminal (Command Prompt on Windows, Terminal on Mac/Linux) and type:


bash
python --version

You should see something like "Python 3.9.x" or newer.

Step 3: Create Your Project Folder


bash
# Create a new folder for your project
mkdir ai_assistant_tutorial
cd ai_assistant_tutorial

# Create a Python virtual environment (like a clean workspace)
python -m venv ai_env

# Activate your virtual environment
# On Mac/Linux:
source ai_env/bin/activate

You'll know it worked when you see (ai_env) at the beginning of your command line.

Step 4: Install Required Packages


bash
pip install openai pandas python-dotenv

What these packages do:

openai: Lets us talk to OpenAI's GPT models
pandas: Helps us read and work with data files
python-dotenv: Keeps our API keys secure

Part 3: Getting Your AI Model Ready

Step 1: Get an OpenAI API Key

Go to platform.openai.com
Sign up for an account
Go to "API Keys" in your dashboard
Click "Create new secret key"
Copy the key (it starts with "sk-")

Important: Keep this key secret! Never share it or put it in your code directly.

Step 2: Create Your Environment File

In your project folder, create a file called .env:


bash
# Create the .env file
touch .env

Open .env in any text editor and add:


OPENAI_API_KEY=your_api_key_here

Replace your_api_key_here with your actual API key.

Part 4: Create Your Data Source

Let's create a simple customer database as a CSV file.

Create a file called customer_data.csv:


csv
customer_id,name,email,purchase_amount,product_category,join_date
1,Alice Johnson,alice@email.com,1250.00,Electronics,2023-01-15
2,Bob Smith,bob@email.com,890.50,Clothing,2023-02-20
3,Carol Davis,carol@email.com,2100.75,Electronics,2023-01-10
4,David Wilson,david@email.com,450.25,Books,2023-03-05
5,Eva Brown,eva@email.com,1800.00,Electronics,2023-02-28
6,Frank Miller,frank@email.com,320.90,Clothing,2023-03-12
7,Grace Lee,grace@email.com,975.60,Books,2023-01-25
8,Henry Taylor,henry@email.com,1650.30,Electronics,2023-02-15

This represents a simple customer database with purchase information.

Part 5: Building Your MCP-Powered AI Assistant

Now for the exciting part – let's build our AI assistant! Create a file called ai_assistant.py:

ai_assistant.py

Part 6: Running Your AI Assistant

Now let's run your AI assistant! Make sure you're in your project directory with your virtual environment activated.


bash
python ai_assistant.py

You should see output like this:


🚀 Welcome to Your AI-Powered Business Assistant!
==================================================
This assistant can help you with:
- Finding customer information
- Calculating discounts and percentages
- Analyzing sales data
- Answering business questions

🤖 Initializing AI Assistant...
📁 Loading customer data...
✅ Loaded 8 customer records
✅ AI Assistant ready with the following tools:
   📊 Customer Data Access Tool
   🧮 Calculator Tool

The program will first run through some test queries to show you what it can do, then enter interactive mode where you can ask your own questions!

Part 7: Understanding What's Happening

Let's break down the magic behind your AI assistant:

The MCP Architecture

AI Brain (OpenAI GPT): The smart decision-maker that understands your questions
Tool Registry: A catalog of available tools (like a phone book for services)
Data Access Tool: Reads and searches your customer database
Calculator Tool: Performs mathematical operations
Assistant Coordinator: Routes requests to the right tools

How MCP Makes It Smart

When you ask "Who is Alice?", here's what happens:

Understanding: The AI analyzes your question
Decision: It realizes it needs customer data
Tool Selection: It chooses the query_customer_data tool
Execution: The tool searches your CSV file
Response: The AI formats the result in a friendly way

This is MCP in action – the AI doesn't just give pre-programmed responses; it intelligently chooses which tools to use based on your specific question.

Part 8: Testing Your Assistant

Try these example questions:

Customer Queries:

"Who is Bob Smith?"
"Find information about Carol"
"Show me all Electronics customers"

Calculations:

"What's 15% of 2000?"
"Calculate a 25% discount on $1800"
"Add 1250 and 890"

Data Analysis:

"Who are the top customers?"
"What's our total revenue?"
"Show me category statistics"

Part 9: What Makes This "MCP-Powered"

Traditional approaches would hardcode specific responses. Our MCP approach:

Dynamic Tool Discovery: The AI discovers available tools at runtime
Intelligent Routing: Based on the question, it picks the right tool
Flexible Integration: Adding new tools doesn't require changing the AI logic
Context Preservation: The AI maintains context across tool calls

Part 10: Troubleshooting Common Issues

"ModuleNotFoundError": Make sure your virtual environment is activated and packages are installed:


bash
source ai_env/bin/activate  # or ai_env\Scripts\activate on Windows
pip install openai pandas python-dotenv

"OpenAI API Error": Check that your API key is correct in the .env file

"FileNotFoundError": Make sure customer_data.csv is in the same folder as your Python script

Part 11: What to Try Next

Now that you have a working MCP-powered AI assistant, here are some exciting extensions:

1. Add a Weather Tool

Create a new tool that fetches weather data:


python
class WeatherTool:
    def get_tool_description(self):
        return {
            "name": "get_weather",
            "description": "Get current weather information for a city",
            "parameters": {
                "type": "object",
                "properties": {
                    "city": {"type": "string", "description": "City name"}
                },
                "required": ["city"]
            }
        }

2. Connect to a Real Database

Replace the CSV file with SQLite:


python
import sqlite3

class DatabaseTool:
    def __init__(self):
        self.conn = sqlite3.connect('customers.db')
        # Create and populate your database

3. Add Email Functionality

Create a tool that can send emails:


python
class EmailTool:
    def execute(self, recipient, subject, message):
        # Use smtplib to send actual emails
        pass

4. Web API Integration

Add a tool that fetches data from public APIs:


python
import requests

class APITool:
    def get_stock_price(self, symbol):
        # Fetch stock prices from a financial API
        pass

5. File Operations

Add tools for reading/writing different file types:


python
class FileOperationTool:
    def read_json(self, filename):
        # Read JSON files
        pass
    
    def write_report(self, data, filename):
        # Generate reports
        pass

Part 12: Key Concepts Recap

Model Context Protocol (MCP): A standardized way for AI to communicate with external tools and data sources.

Tool Registry: A catalog that tells the AI what tools are available and how to use them.

Dynamic Tool Selection: The AI chooses which tools to use based on the user's question, not pre-programmed rules.

Context Preservation: Information flows seamlessly between the AI, tools, and data sources.

Part 13: Real-World Applications

Your MCP skills can be applied to build:

Customer Service Bots: That can access real customer data and perform calculations
Business Intelligence Tools: That combine data analysis with AI insights
Personal Assistants: That can interact with your personal data and services
Automated Workflows: That can trigger actions across multiple systems

Conclusion

Congratulations! You've built a fully functional AI-powered backend application using Model Context Protocol principles. You now understand:

How AI can intelligently choose between different tools
How to structure data access in an MCP-compatible way
How to create tool descriptions that AI can understand
How to coordinate between AI models and external systems

The beauty of MCP is that it makes AI systems more flexible and powerful. Instead of being limited to what they were trained on, AI assistants can now access real-time data and perform real-world actions.

Your AI assistant demonstrates the core principle of MCP: seamless integration between AI intelligence and external capabilities. As you continue building, remember that MCP is about creating smart systems that know how to use the right tool for each job – just like a great manager who knows which team member to assign to each task.

Keep experimenting, and you'll soon be building AI systems that can interact with any service or data source you can imagine!

📘 Beginner-Friendly AI Backend Tutorial with Model Context Protocol (MCP)

Welcome! In this tutorial, we will build a simple AI-powered backend application that integrates an AI model with tools and data using the Model Context Protocol (MCP) — all on your local machine.

No experience with AI or coding? No problem! This guide will walk you through every step, like a friendly tutor.

🧠 What is Model Context Protocol (MCP)?

Imagine you’re talking to a super-smart assistant (like ChatGPT). Now imagine that assistant can look things up, calculate results, and read data from your files — just like a helpful human would.

MCP is the set of rules that tells the AI:

"Hey, if you see a question that needs a calculator, use it!"
"If the user asks about data, check the file first before guessing."

Think of it as a walkie-talkie between the AI and real tools/data.

🛠️ Step 1: Set Up Your Environment

1.1 Install Python

If you don’t already have Python:

Go to https://www.python.org/downloads/
Download and install Python 3.9 or higher.
Make sure to check "Add Python to PATH" during installation.

To check if Python is installed:

python --version

1.2 Create a Folder for Your Project

mkdir mcp-ai-demo
cd mcp-ai-demo

1.3 Set Up a Virtual Environment (Optional but Recommended)

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

1.4 Install Required Packages

pip install openai pandas

📦 Step 2: Prepare Your Data (CSV File)

We’ll use a simple product catalog as our "database."

Create a file called `products.csv`:

id,name,price,stock
1,Red T-shirt,19.99,25
2,Blue Jeans,49.99,10
3,Green Hat,14.99,5
4,Black Sneakers,89.99,2

🧠 Step 3: Set Up AI Access (GPT Model)

3.1 Get an OpenAI API Key

Go to https://platform.openai.com/account/api-keys
Create a free account and get your key.

3.2 Store the Key in Your Code (DO NOT share it)

We’ll use a .env-style setup for this tutorial.

Create a file config.py:

# config.py
OPENAI_API_KEY = "your-api-key-here"  # Replace this with your real key

🧰 Step 4: Create Tools the AI Can Use

4.1 Data Tool (Search CSV File)

# data_tool.py
import pandas as pd

def search_products(keyword):
    df = pd.read_csv('products.csv')
    results = df[df['name'].str.contains(keyword, case=False)]
    return results.to_dict(orient='records')

4.2 Calculator Tool

# calculator_tool.py
def calculate(expression):
    try:
        return eval(expression)
    except Exception as e:
        return str(e)

🧠 Step 5: Connect AI via MCP Logic

5.1 Main Script

# main.py
import openai
from config import OPENAI_API_KEY
from data_tool import search_products
from calculator_tool import calculate

openai.api_key = OPENAI_API_KEY

def ai_call(user_input):
    print("\nUser asked:", user_input)

    # Step 1: Let AI decide what to do
    prompt = f"""
You are a smart AI assistant. You can:
1. Search product data using 'search_products(keyword)'
2. Use a calculator with 'calculate(expression)'

If the user asks something like "find red shirt" -> call search_products('red shirt')
If the user says "what's 5 * 20" -> call calculate('5 * 20')
Otherwise, reply directly.

Now, here's what the user asked:
"{user_input}"
What should you do?
"""
    response = openai.Completion.create(
        engine="text-davinci-003",
        prompt=prompt,
        max_tokens=150,
        temperature=0
    )

    action = response.choices[0].text.strip()
    print("\nAI decided:", action)

    # Step 2: Execute based on AI decision
    if "search_products(" in action:
        keyword = action.split("(")[1].split(")")[0].strip("'\"")
        result = search_products(keyword)
        print("\nSearch Results:", result)
    elif "calculate(" in action:
        expr = action.split("(")[1].split(")")[0].strip("'\"")
        result = calculate(expr)
        print("\nCalculation Result:", result)
    else:
        print("\nAI Response:", action)

# Simple CLI Loop
if __name__ == "__main__":
    while True:
        user_input = input("\nAsk something (or type 'exit'): ")
        if user_input.lower() == 'exit':
            break
        ai_call(user_input)

▶️ Step 6: Run and Test It Locally

In your terminal:

python main.py

Try typing:

Find red t-shirt
How much is 15 * 3.5?
Show green hat

🎉 You’ve just built a mini AI system that talks to tools and data!

💡 What to Try Next

Add a second data file (like customer info).
Add a weather tool using a public API.
Let the AI update your CSV (e.g., mark item as "out of stock").
Try switching the AI model to something open-source (e.g., using HuggingFace Transformers).

📦 Summary

Component	Description
CSV File	Acts as your mock database
AI Model	Interprets user queries
Tools	Execute actions (search, calculate, etc.)
MCP Logic	Bridges AI intent with tool invocation

You now understand the basics of Model Context Protocol — using plain Python and local files. 🧠🔌📊

Happy hacking!

Sunday, May 25, 2025

Core Architecture Concepts in RAG, LLMs & GenAI

1. Embeddings

What it is: A dense vector representation of data (e.g., words, sentences, code).
Why it matters: Converts discrete data (like text) into continuous numerical space that models can process.
Example:
- “Dog” → [0.25, -0.12, ..., 0.83]
- Words with similar meanings have vectors close in space (semantic similarity).
Used in:
- Semantic search in RAG
- Input for LLMs
- Vector databases

2. Vector Spaces

What it is: A high-dimensional space where embeddings live.
Why it matters: Vectors allow fast similarity search using measures like cosine similarity or dot product.
Used in:
- Finding relevant documents in RAG
- Nearest neighbor searches in FAISS or similar vector DBs

3. Attention Mechanism

What it is: A technique that allows the model to focus on relevant parts of the input sequence when producing output.
Types:
- Self-attention: Used in Transformers; compares all tokens in a sequence to each other.
- Cross-attention: Used in RAG; queries from LLM attend to retrieved documents.
Why it matters:
- Solves long-range dependency problems in sequences.
- Enables parallelism (vs. RNNs).
Key math:
\text{Attention}(Q, K, V) = \text{softmax}\left(\frac{QK^T}{\sqrt{d_k}}\right)V

4. Transformers

What it is: The architecture underlying modern LLMs.
Components:
- Input Embedding + Positional Encoding
- Multi-head Attention
- Feed-forward Neural Networks
- Layer Normalization
- Residual Connections
Why it matters: Allows LLMs to scale, understand context, and generate coherent text.

5. Large Language Models (LLMs)

What it is: Neural networks (typically Transformers) trained on massive corpora to predict and generate human-like language.
Examples: GPT, BERT, Claude, Gemini
Key Traits:
- Pretraining: On vast text data using next-token prediction or masked language modeling.
- Fine-tuning: For specific tasks (e.g., chat, summarization).
- Inference: Generates text one token at a time using learned probabilities.

6. Generative AI (GenAI)

What it is: Any AI model that can generate new content (text, images, code, etc.).
In NLP:
- Models that produce novel text based on prompts or questions.
- LLMs are a subset of GenAI.
Modalities:
- Text (GPT, Claude)
- Code (Codex)
- Images (DALL·E, Midjourney)
- Video (Sora)
- Audio (MusicGen)

7. Retrieval-Augmented Generation (RAG)

What it is: A hybrid GenAI method that augments LLMs with retrieval from external knowledge.
Flow:
1. Embed Query → vector space
2. Retrieve Documents → from vector DB using similarity search
3. Augment Prompt → LLM receives query + retrieved context
4. Generate Answer → grounded, up-to-date, accurate
Why it matters:
- Reduces hallucination
- Enables up-to-date, domain-specific responses
- Keeps LLMs smaller and more efficient (vs. training on entire domain data)

8. Tokenization

What it is: Breaking text into tokens (smaller pieces) before inputting into a model.
Example:
- “ChatGPT is smart.” → [‘Chat’, ‘G’, ‘PT’, ‘ is’, ‘ smart’, ‘.’]
Why it matters:
- LLMs operate on tokens, not raw text.
- Affects context length and cost.

9. Context Window

What it is: The maximum number of tokens a model can consider at once.
LLMs have limits (e.g., GPT-4 can handle 128k tokens).
Why it matters: Limits how much data (prompt + docs) you can include during RAG.

10. Prompt Engineering

What it is: Crafting input prompts to guide the LLM’s behavior.
In RAG: Used to incorporate retrieved documents properly.
Example:
You are a Java expert. Based on the following context, answer the user’s question. Context: [...]. Question: [...]

11.
Vector Databases
- What it is: Specialized databases that store and search high-dimensional vectors.
- Popular tools: FAISS, Pinecone, Weaviate, Qdrant
- Role in RAG:
  - Store document embeddings
  - Retrieve semantically relevant docs during generation
12.
Similarity Search
- What it is: Finding vectors in the database closest to the query vector.
- Common Metrics:
  - Cosine Similarity
  - Dot Product
  - Euclidean Distance
13.
Fine-tuning vs. Prompting vs. RAG
Technique
When to Use
Fine-tuning
You want model to learn new tasks from scratch
Prompting
Quick instructions using existing model knowledge
RAG
Inject external, non-memorized knowledge

Technique	When to Use
Fine-tuning	You want model to learn new tasks from scratch
Prompting	Quick instructions using existing model knowledge
RAG	Inject external, non-memorized knowledge

┌─────────────┐

│ User Query │

└─────┬───────┘

│

▼

┌──────────────┐

│ Embed Query │

└─────┬────────┘

▼

┌─────────────────────┐

│ Vector DB Search │ ←— uses cosine similarity

└─────┬───────────────┘

▼

┌───────────────────────┐

│ Retrieved Documents │

└─────┬─────────────────┘

▼

┌────────────────────────────┐

│ Prompt + Retrieved Context │

└─────┬──────────────────────┘

▼

┌────────────────┐

│ LLM │

│ (e.g. GPT-4) │

└─────┬──────────┘

▼