Different Tools

Now that you have gained an initial idea of how MCPs work through a simple example, it is time to explore them further. Here we provide you with several ideas to gain more experience with building your own MCP. Feel free to choose the option that is most interesting to you, as you can always come back later to practice with the other options. Here are the possible directions:

Extend the MCP for a user database that we discussed in the Quickstart.
Build a MCP based upon a different public dataset.
Build a MCP unrelated to a dataset

For each of the options, you will be writing your own functions for the MCPs. The following tips may be helpful for each of the exercises.

When composing functions in a MCP, it is vital that the functions are as descriptive as possible. If not, the LLM may misunderstand how the function works and not use it correctly. Pay attention to the following things:

Add the type to each of the input variables of the function (e.g. n: int = 5, instead of n = 5).

At the beginning of each function, add descriptive documentation. Have a look at the functions in the Quickstart for examples.

The function should return a descriptive sentence. To illustrate, a function should return “There are 50 men in the dataset”, instead of returning 50, for the LLM to understand what is happening.

If you wish to test a python function, we recommend you to do so locally. This can either be in your IDE of preference (e.g. VSCode), or through an online IDE such as Google Colab. It is recommended to run quick tests to see whether your code is functional, for ease of debugging.

When launching a tool server, you can use tmux to avoid it blocking your terminal, as discussed in the Quickstart. Alternatively, you can simply log onto your server again in a second terminal.

Extend the user database MCP

In this exercise, you will add more functionalities to the MCP for a user database that we constructed in the Quickstart. You can choose the new functionality yourself, based on the attributes available in the dataset. Alternatively, you can choose one of the suggestions below. We recommend you to try and compose the functions yourself, to gain experience with setting up such a MCP. Once you are done, or if you are short on time, you can verify your work with the solution that is available below. If you run into persistent problems, feel free to ask for help to one of the workshop organizers. Good luck!

Find the most common first and last names in the dataset, both for men and women.
For a given e-mail domain, count the number of men and women using this domain.
Get a random sample of users from the dataset.

Show solution

@mcp.tool
def count_by_age_and_job(min_age: int, max_age: int, keyword: str):
    """
    Count the number of males and females within a specific age range who have a job title containing a given keyword.

    Parameters:
    - min_age (int): Minimum age (inclusive)
    - max_age (int): Maximum age (inclusive)
    - keyword (str): Keyword to search for in job titles (e.g., 'manager'). Case-insensitive.

    Returns:
    - dict: {"male": int, "female": int}
    """
    keyword = keyword.lower()
    filtered = df[
        (df['Age'] >= min_age)
        & (df['Age'] <= max_age)
        & (df['Job Title'].str.contains(keyword, na=False))
    ]
    males = filtered[filtered['Sex'] == 'male'].shape[0]
    females = filtered[filtered['Sex'] == 'female'].shape[0]
    return (
        f"There are {males} men and {females} women within the ages "
        f"{min_age} and {max_age} with a job title containing the keyword "
        f"{keyword}."
    )


@mcp.tool
def most_common_names(gender: str, top_n: int = 10):
    """
    Return the most common first and last names for a given gender.

    Parameters:
    - gender (str): "male" or "female" (case-insensitive). Accepts variations like "man", "woman", "f", "m".
    - top_n (int): Number of top names to return (default: 10)

    Returns:
    - dict: {"first_names": {name: count}, "last_names": {name: count}}
    """
    gender = normalize_gender(gender)
    filtered = df[df['Sex'] == gender]
    first_names = filtered['First Name'].value_counts().head(top_n).to_dict()
    last_names = filtered['Last Name'].value_counts().head(top_n).to_dict()
    return (
        f"The top {top_n} most common first names for the gender {gender} "
        f"are {first_names}, the most common last names are {last_names}."
    )


@mcp.tool
def email_domain_stats(domain: str):
    """
    Count the number of males and females using a specific email domain.

    Parameters:
    - domain (str): Email domain to search for (e.g., 'gmail.com'). Case-insensitive.

    Returns:
    - dict: {"male": int, "female": int, "total": int}
    """
    domain = domain.lower()
    filtered = df[df['Email'].str.endswith(domain)]
    males = filtered[filtered['Sex'] == 'male'].shape[0]
    females = filtered[filtered['Sex'] == 'female'].shape[0]
    return (
        f"There are in total {filtered.shape[0]} users with emails ending "
        f"with {domain}, of which {males} men and {females} women."
    )


@mcp.tool
def random_sample(n: int = 5, gender: str = None):
    """
    Return a random sample of users, optionally filtered by gender.

    Returns:
    - dict:
        {
          "description": str,
          "count": int,
          "users": list[dict]
        }
    """
    sample_df = df
    gender_label = "all users"

    if gender:
        gender = normalize_gender(gender)
        sample_df = sample_df[sample_df['Sex'] == gender]
        gender_label = f"{gender} users"

    total_available = sample_df.shape[0]
    sample_size = min(n, total_available)
    sampled = sample_df.sample(sample_size)

    users = sampled[
        ['First Name', 'Last Name', 'Age', 'Job Title']
    ].to_dict(orient='records')

    description = (
        f"Here is a random sample of {sample_size} {gender_label} "
        f"out of {total_available} matching users."
    )

    return {
        "description": description,
        "count": sample_size,
        "users": users,
    }

When you have added new functions to the tools.py file, and you have started the tool server, do not forget to add the function names in the Open WebUI instance. Specifically, in the Function Name Filter List. When in doubt, you can go through the Quickstart again to look at the sequence of actions for launching a MCP.

Build a MCP on a different data source

Before, you made a MCP with tools for analysing a toy CSV data file. Now, you can take the time to build a MCP server with tools for a more realistic dataset. Below, we provide several suggestions, but you can also choose a dataset of your own. Given that this is an open exercise, we do not provide a sample solution here. If you do wish to do an exercise with an available solution, have a look at the first or third option on this page. If you wish to do build another MCP using a small toy dataset, there are different CSV files available here. Several suggestions for real datasets are:

Data from the stock market: through Stooq you can obtain free historical market data. To illustrate, https://stooq.com/q/d/l/?s=msft.us&i=d would give you daily data on the Microsoft stock (abbreviation msft). Using different abbreviations, you can obtain data for different stocks.
Covid data: through this link, you can obtain data on Covid cases in the Netherlands.

You can download data in a python file by using the requests library. For an example, see the code in the quickstart.

LLM- or API-powered tools

The third option is to build a MCP server with LLM-powered tools. You can use the Ollama server that is spinning for your Open WebUI instance to power the tools in your MCP server. Below, you will find a template for the python file that you can use to launch your tool server. Now is the time to be creative, and construct your own LLM-powered tools. If you would like some inspiration, below are several options. For the first two cases, we provide a solution below.

Compose a tool chaining multiple LLM calls, such as first summarizing a piece of text and then rewriting it in a specific style (e.g. humorous). Solution available below
Compose a tool for getting the weather forecast in a specific city, using this API, where we suggest to use the JSON output format of the API. Solution available below
Compose a tool that gets live sports data, through an API endpoint such as TheSportsDB or SportsDataIO.
Compose a tool for verifying whether a text adheres to a pre-specified set of guidelines, such as company guidelines.

import requests
from fastmcp import FastMCP

OPENAI_URL = "https://cf54665a-c32f-4d45-ad99-23e70fd0f175.inference.at-vie-2.exoscale-cloud.com/v1/chat/completions"
MODEL = "mistralai/Mistral-7B-Instruct-v0.3"  # or whatever model name the endpoint exposes
API_KEY = "YOUR_API_KEY_HERE"

# Initialize FastMCP
mcp = FastMCP("llm-api-tools", port=8001)
mcp.settings.host = "0.0.0.0"

def openai_generate(
    prompt: str,
    system: str | None = None,
    temperature: float = 0.7,
    max_tokens: int = 512,
) -> str:
    headers = {
        "Authorization": f"Bearer {API_KEY}",
        "Content-Type": "application/json",
    }

    messages = []
    if system:
        messages.append({"role": "system", "content": system})
    messages.append({"role": "user", "content": prompt})

    payload = {
        "model": MODEL,
        "messages": messages,
        "temperature": temperature,
        "max_tokens": max_tokens,
    }

    response = requests.post(OPENAI_URL, headers=headers, json=payload, timeout=60)
    response.raise_for_status()

    return response.json()["choices"][0]["message"]["content"]


######### Tools #############
# To complete

Show solution

import requests
from fastmcp import FastMCP

OPENAI_URL = "https://cf54665a-c32f-4d45-ad99-23e70fd0f175.inference.at-vie-2.exoscale-cloud.com/v1/chat/completions"
MODEL = "mistralai/Mistral-7B-Instruct-v0.3"
API_KEY = "YOUR_API_KEY_HERE"

# Initialize FastMCP
mcp = FastMCP("llm-tools-demo", port=8001)
mcp.settings.host = "0.0.0.0"

def openai_generate(
    prompt: str,
    system: str | None = None,
    temperature: float = 0.7,
    max_tokens: int = 512,
) -> str:
    headers = {
        "Authorization": f"Bearer {API_KEY}",
        "Content-Type": "application/json",
    }

    messages = []
    if system:
        messages.append({"role": "system", "content": system})
    messages.append({"role": "user", "content": prompt})

    payload = {
        "model": MODEL,
        "messages": messages,
        "temperature": temperature,
        "max_tokens": max_tokens,
    }

    response = requests.post(OPENAI_URL, headers=headers, json=payload, timeout=60)
    response.raise_for_status()

    return response.json()["choices"][0]["message"]["content"]


# -----------------------------
# Summary and rewriting tool
# -----------------------------

@mcp.tool
def summarize_and_rewrite_text(text: str, style: str) -> str:
    """
    Summarizes a document for downstream use.

    Parameters:
    - text (str): the text that should be summarized
    - style (str): the style in which the summary should be rewritten

    Returns:
    - str: the summary rewritten in the specified style
    """
    prompt_summary = f"""
    Summarize the following text, be concise and stick to the information that is provided in the original text.

    TEXT:
    {text}
    """

    summary = openai_generate(prompt_summary)

    prompt_rephrase = f"""
    Rewrite the following text, such that it adheres to the following style: {style}. 

    TEXT:
    {summary}
    """

    rewritten_summary = openai_generate(prompt_rephrase)

    return f"The summary of the document, rewritten in the style {style}, is: {rewritten_summary}"


# -----------------------------
# Weather tool
# -----------------------------

@mcp.tool
def get_weather(city: str):
    """Returns the current weather for a city."""
    try:
        response = requests.get(f"https://wttr.in/{city}?format=j1")
        data = response.json()
        current = data["current_condition"][0]
        return {
            "city": city,
            "temperature_C": current["temp_C"],
            "temperature_F": current["temp_F"],
            "weather": current["weatherDesc"][0]["value"],
            "humidity": current["humidity"],
        }
    except Exception as e:
        return {"error": str(e)}


# ------------------ Run Server ------------------

if __name__ == "__main__":
    mcp.run(transport="streamable-http")

Author: Alexander Sternfeld