Tech Unpacked – Research & Fundamentals with Nitin Sharma

Wednesday, June 4, 2025

📘 Beginner-Friendly AI Backend Tutorial with Model Context Protocol (MCP)

Welcome! In this tutorial, we will build a simple AI-powered backend application that integrates an AI model with tools and data using the Model Context Protocol (MCP) — all on your local machine.

No experience with AI or coding? No problem! This guide will walk you through every step, like a friendly tutor.

🧠 What is Model Context Protocol (MCP)?

Imagine you’re talking to a super-smart assistant (like ChatGPT). Now imagine that assistant can look things up, calculate results, and read data from your files — just like a helpful human would.

MCP is the set of rules that tells the AI:

"Hey, if you see a question that needs a calculator, use it!"
"If the user asks about data, check the file first before guessing."

Think of it as a walkie-talkie between the AI and real tools/data.

🛠️ Step 1: Set Up Your Environment

1.1 Install Python

If you don’t already have Python:

Go to https://www.python.org/downloads/
Download and install Python 3.9 or higher.
Make sure to check "Add Python to PATH" during installation.

To check if Python is installed:

python --version

1.2 Create a Folder for Your Project

mkdir mcp-ai-demo
cd mcp-ai-demo

1.3 Set Up a Virtual Environment (Optional but Recommended)

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

1.4 Install Required Packages

pip install openai pandas

📦 Step 2: Prepare Your Data (CSV File)

We’ll use a simple product catalog as our "database."

Create a file called `products.csv`:

id,name,price,stock
1,Red T-shirt,19.99,25
2,Blue Jeans,49.99,10
3,Green Hat,14.99,5
4,Black Sneakers,89.99,2

🧠 Step 3: Set Up AI Access (GPT Model)

3.1 Get an OpenAI API Key

Go to https://platform.openai.com/account/api-keys
Create a free account and get your key.

3.2 Store the Key in Your Code (DO NOT share it)

We’ll use a .env-style setup for this tutorial.

Create a file config.py:

# config.py
OPENAI_API_KEY = "your-api-key-here"  # Replace this with your real key

🧰 Step 4: Create Tools the AI Can Use

4.1 Data Tool (Search CSV File)

# data_tool.py
import pandas as pd

def search_products(keyword):
    df = pd.read_csv('products.csv')
    results = df[df['name'].str.contains(keyword, case=False)]
    return results.to_dict(orient='records')

4.2 Calculator Tool

# calculator_tool.py
def calculate(expression):
    try:
        return eval(expression)
    except Exception as e:
        return str(e)

🧠 Step 5: Connect AI via MCP Logic

5.1 Main Script

# main.py
import openai
from config import OPENAI_API_KEY
from data_tool import search_products
from calculator_tool import calculate

openai.api_key = OPENAI_API_KEY

def ai_call(user_input):
    print("\nUser asked:", user_input)

    # Step 1: Let AI decide what to do
    prompt = f"""
You are a smart AI assistant. You can:
1. Search product data using 'search_products(keyword)'
2. Use a calculator with 'calculate(expression)'

If the user asks something like "find red shirt" -> call search_products('red shirt')
If the user says "what's 5 * 20" -> call calculate('5 * 20')
Otherwise, reply directly.

Now, here's what the user asked:
"{user_input}"
What should you do?
"""
    response = openai.Completion.create(
        engine="text-davinci-003",
        prompt=prompt,
        max_tokens=150,
        temperature=0
    )

    action = response.choices[0].text.strip()
    print("\nAI decided:", action)

    # Step 2: Execute based on AI decision
    if "search_products(" in action:
        keyword = action.split("(")[1].split(")")[0].strip("'\"")
        result = search_products(keyword)
        print("\nSearch Results:", result)
    elif "calculate(" in action:
        expr = action.split("(")[1].split(")")[0].strip("'\"")
        result = calculate(expr)
        print("\nCalculation Result:", result)
    else:
        print("\nAI Response:", action)

# Simple CLI Loop
if __name__ == "__main__":
    while True:
        user_input = input("\nAsk something (or type 'exit'): ")
        if user_input.lower() == 'exit':
            break
        ai_call(user_input)

▶️ Step 6: Run and Test It Locally

In your terminal:

python main.py

Try typing:

Find red t-shirt
How much is 15 * 3.5?
Show green hat

🎉 You’ve just built a mini AI system that talks to tools and data!

💡 What to Try Next

Add a second data file (like customer info).
Add a weather tool using a public API.
Let the AI update your CSV (e.g., mark item as "out of stock").
Try switching the AI model to something open-source (e.g., using HuggingFace Transformers).

📦 Summary

Component	Description
CSV File	Acts as your mock database
AI Model	Interprets user queries
Tools	Execute actions (search, calculate, etc.)
MCP Logic	Bridges AI intent with tool invocation

You now understand the basics of Model Context Protocol — using plain Python and local files. 🧠🔌📊

Happy hacking!

Monday, June 2, 2025

Distributed Key-Value Store: System Design Principles for Scalability and Consistency

Introduction

In modern distributed systems, key-value stores are fundamental for high-performance applications requiring fast read/write operations. Designing a distributed key-value database involves balancing durability, availability, performance, and consistency while ensuring scalability.

This blog explores the core principles of a strongly consistent distributed key-value store, covering:

Architecture & Components
Consistency Models & Replication Strategies
Conflict Resolution & Fault Tolerance
Scalability & Performance Optimizations

1. Key Characteristics & Priorities

A well-designed distributed key-value store prioritizes:

Durability – Data must never be lost once written.
Availability – The system should remain operational despite failures.
Performance – Low-latency reads/writes are essential but secondary to durability and availability.

"If you lose customer data, you won’t be in business for long. It’s better to return data slowly than not at all."

Key Takeaways:

✔ Durability is non-negotiable—data loss is catastrophic.
✔ Availability trumps performance—slow responses are better than downtime.
✔ Security is assumed (e.g., client-side encryption) but not the focus here.

2. Strong Consistency Model

Unlike eventually consistent systems (e.g., Cassandra), this design enforces strong consistency:

A read after a successful write will always reflect the latest data.
ACID compliance is partial—Atomicity and Isolation are not guaranteed at row/table level.

Key Takeaways:

✔ Read-after-write consistency ensures predictable behavior.
✔ No full ACID support—trade-offs are made for scalability.
✔ Last-write-wins (LWW) conflict resolution via a sequencer.

3. Core Data Structure: Key-Value with Sequencer

Each record consists of:

Key (unique identifier)
Value (associated data)
16-byte Sequencer (monotonically increasing for conflict resolution)

"The last write wins, and the sequencer determines which write is the latest."

Key Takeaways:

✔ Sequencer resolves conflicts in concurrent writes.
✔ Simple schema enables fast lookups and horizontal scaling.

4. Distributed System Architecture

The system comprises several key components:

A. Load Balancer

Distributes incoming requests across Request Managers.

B. Request Manager

Routes requests to the correct Replication Group using metadata.
Maintains an in-memory metadata cache for efficiency.

C. Metadata Manager

Stores table-to-Replication-Group mappings.
Handles leader election (must be strongly consistent).
High read, low write workload.

D. Replication Group (RG)

Leader-Follower model (odd number of nodes for quorum).
All writes go through the leader.
Followers replicate data for redundancy.

E. Controller (Scheduler)

Monitors "hot" tables (high traffic/large size).
Splits tables across multiple RGs for scalability.

5. Replication & Consistency Mechanisms

Write Process (Strong Consistency Guarantee)

Client sends a PUT request.
Leader appends data to an append-only log.
Followers replicate the log entry.
Write succeeds only when a majority (quorum) acknowledge it.

Read Process

Reads can be served by leader or followers (configurable).
Followers may lag but strong consistency ensures latest data is returned.

"A PUT is acknowledged only when a majority of nodes (including leader) confirm it."

Key Takeaways:

✔ Quorum writes ensure durability and consistency.
✔ Append-only log enables efficient sequential writes.
✔ B+ Tree or LSM Tree indexes speed up reads.

6. Handling Failures & Edge Cases

Failure Scenario	Resolution Mechanism
Split Brain (Two Leaders)	Quorum voting ensures only one leader is valid.
Leader Failure	New leader elected via majority consensus.
Network Partition	Outdated Request Managers refresh metadata.
Node Crash Before Indexing	Data recovered from append-only log.

Key Takeaways:

✔ Consistency > Availability in conflict scenarios.
✔ Majority quorum prevents split-brain issues.

7. Scalability & Performance Optimizations

A. Handling "Hot" Tables

Controller detects large/high-traffic tables.
Splits them into smaller ranges across new RGs.

B. Data Storage & Indexing

Append-only log (fast sequential writes).
B+ Tree / LSM Tree (efficient indexing for reads).

C. Estimated Scalability

Petabyte-scale with sufficient RGs.
Key-value size limit: ~1MB (optimized for small records).

Key Takeaways:

✔ Automatic table splitting prevents bottlenecks.
✔ Efficient indexing balances read/write performance.

Conclusion

Designing a distributed key-value store requires careful trade-offs between consistency, availability, and performance. This system prioritizes:

Strong consistency via quorum writes and sequencers.
Fault tolerance through leader-follower replication.
Scalability via automatic table splitting.

By leveraging append-only logs, B+ trees, and a robust metadata layer, this architecture ensures durability, high availability, and efficient scaling.

Popular Posts

Search This Blog

Wednesday, June 4, 2025

📘 Beginner-Friendly AI Backend Tutorial with Model Context Protocol (MCP)

🧠 What is Model Context Protocol (MCP)?

🛠️ Step 1: Set Up Your Environment

1.1 Install Python

1.2 Create a Folder for Your Project

1.3 Set Up a Virtual Environment (Optional but Recommended)

1.4 Install Required Packages

📦 Step 2: Prepare Your Data (CSV File)

Create a file called products.csv:

🧠 Step 3: Set Up AI Access (GPT Model)

3.1 Get an OpenAI API Key

3.2 Store the Key in Your Code (DO NOT share it)

🧰 Step 4: Create Tools the AI Can Use

4.1 Data Tool (Search CSV File)

4.2 Calculator Tool

🧠 Step 5: Connect AI via MCP Logic

5.1 Main Script

▶️ Step 6: Run and Test It Locally

💡 What to Try Next

📦 Summary

Monday, June 2, 2025

Distributed Key-Value Store: System Design Principles for Scalability and Consistency

Introduction

1. Key Characteristics & Priorities

Key Takeaways:

2. Strong Consistency Model

Key Takeaways:

3. Core Data Structure: Key-Value with Sequencer

Key Takeaways:

4. Distributed System Architecture

A. Load Balancer

B. Request Manager

C. Metadata Manager

D. Replication Group (RG)

E. Controller (Scheduler)

5. Replication & Consistency Mechanisms

Write Process (Strong Consistency Guarantee)

Read Process

Key Takeaways:

6. Handling Failures & Edge Cases

Key Takeaways:

7. Scalability & Performance Optimizations

A. Handling "Hot" Tables

B. Data Storage & Indexing

C. Estimated Scalability

Key Takeaways:

Conclusion

My Profile

!! IMPORTANT LINKS !!

!! INTERESTING TALKS !!

Contact Form

Labels

Total Pageviews

Create a file called `products.csv`: