Cassandra database tool (#13423)

92d31778 · Patrick McFadin · GitHub · cad1eaf8 · 92d31778 · 92d31778
Unverified Commit 92d31778 authored 10 months ago by Patrick McFadin Committed by GitHub 10 months ago
--- a/docs/docs/examples/tools/casssandra.ipynb
+++ b/docs/docs/examples/tools/casssandra.ipynb
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Cassandra Database Tools\n",
+    "\n",
+    "Apache Cassandra® is a widely used database for storing transactional application data. The introduction of functions and tooling in Large Language Models has opened up some exciting use cases for existing data in Generative AI applications. The Cassandra Database toolkit enables AI engineers to efficiently integrate Agents with Cassandra data, offering the following features: \n",
+    " - Fast data access through optimized queries. Most queries should run in single-digit ms or less. \n",
+    " - Schema introspection to enhance LLM reasoning capabilities \n",
+    " - Compatibility with various Cassandra deployments, including Apache Cassandra®, DataStax Enterprise™, and DataStax Astra™ \n",
+    " - Currently, the toolkit is limited to SELECT queries and schema introspection operations. (Safety first)\n",
+    "\n",
+    "## Quick Start\n",
+    " - Install the cassio library\n",
+    " - Set environment variables for the Cassandra database you are connecting to\n",
+    " - Initialize CassandraDatabase\n",
+    " - Pass the tools to your agent with spec.to_tool_list()\n",
+    " - Sit back and watch it do all your work for you\n",
+    "\n",
+    "## Theory of Operation\n",
+    "Cassandra Query Language (CQL) is the primary *human-centric* way of interacting with a Cassandra database. While offering some flexibility when generating queries, it requires knowledge of Cassandra data modeling best practices. LLM function calling gives an agent the ability to reason and then choose a tool to satisfy the request. Agents using LLMs should reason using Cassandra-specific logic when choosing the appropriate tool or chain of tools. This reduces the randomness introduced when LLMs are forced to provide a top-down solution. Do you want an LLM to have complete unfettered access to your database? Yeah. Probably not. To accomplish this, we provide a prompt for use when constructing questions for the agent: \n",
+    "\n",
+    "```json\n",
+    "You are an Apache Cassandra expert query analysis bot with the following features \n",
+    "and rules:\n",
+    " - You will take a question from the end user about finding specific \n",
+    "   data in the database.\n",
+    " - You will examine the schema of the database and create a query path. \n",
+    " - You will provide the user with the correct query to find the data they are looking \n",
+    "   for, showing the steps provided by the query path.\n",
+    " - You will use best practices for querying Apache Cassandra using partition keys \n",
+    "   and clustering columns.\n",
+    " - Avoid using ALLOW FILTERING in the query.\n",
+    " - The goal is to find a query path, so it may take querying other tables to get \n",
+    "   to the final answer. \n",
+    "\n",
+    "The following is an example of a query path in JSON format:\n",
+    "\n",
+    " {\n",
+    "  \"query_paths\": [\n",
+    "    {\n",
+    "      \"description\": \"Direct query to users table using email\",\n",
+    "      \"steps\": [\n",
+    "        {\n",
+    "          \"table\": \"user_credentials\",\n",
+    "          \"query\": \n",
+    "             \"SELECT userid FROM user_credentials WHERE email = 'example@example.com';\"\n",
+    "        },\n",
+    "        {\n",
+    "          \"table\": \"users\",\n",
+    "          \"query\": \"SELECT * FROM users WHERE userid = ?;\"\n",
+    "        }\n",
+    "      ]\n",
+    "    }\n",
+    "  ]\n",
+    "}\n",
+    "```\n",
+    "\n",
+    "## Tools Provided\n",
+    "\n",
+    "### `cassandra_db_schema`\n",
+    "Gathers all schema information for the connected database or a specific schema. Critical for the agent when determining actions. \n",
+    "\n",
+    "### `cassandra_db_select_table_data`\n",
+    "Selects data from a specific keyspace and table. The agent can pass paramaters for a predicate and limits on the number of returned records. \n",
+    "\n",
+    "### `cassandra_db_query`\n",
+    "Experimental alternative to `cassandra_db_select_table_data` which takes a query string completely formed by the agent instead of parameters. *Warning*: This can lead to unusual queries that may not be as performant(or even work). This may be removed in future releases. If it does something cool, we want to know about that too. You never know!"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Enviroment Setup\n",
+    "\n",
+    "Install the following Python modules:\n",
+    "\n",
+    "```bash\n",
+    "pip install ipykernel python-dotenv cassio llama-index llama-index-agent-openai llama-index-llms-openai llama-index-tools-cassandra\n",
+    "```\n",
+    "\n",
+    "### .env file\n",
+    "Connection is via `cassio` using `auto=True` parameter, and the notebook uses OpenAI. You should create a `.env` file accordingly.\n",
+    "\n",
+    "For Casssandra, set:\n",
+    "```bash\n",
+    "CASSANDRA_CONTACT_POINTS\n",
+    "CASSANDRA_USERNAME\n",
+    "CASSANDRA_PASSWORD\n",
+    "CASSANDRA_KEYSPACE\n",
+    "```\n",
+    "\n",
+    "For Astra, set:\n",
+    "```bash\n",
+    "ASTRA_DB_APPLICATION_TOKEN\n",
+    "ASTRA_DB_DATABASE_ID\n",
+    "ASTRA_DB_KEYSPACE\n",
+    "```\n",
+    "\n",
+    "For example:\n",
+    "\n",
+    "```bash\n",
+    "# Connection to Astra:\n",
+    "ASTRA_DB_DATABASE_ID=a1b2c3d4-...\n",
+    "ASTRA_DB_APPLICATION_TOKEN=AstraCS:...\n",
+    "ASTRA_DB_KEYSPACE=notebooks\n",
+    "\n",
+    "# Also set \n",
+    "OPENAI_API_KEY=sk-....\n",
+    "```\n",
+    "\n",
+    "(You may also modify the below code to directly connect with `cassio`.)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from dotenv import load_dotenv\n",
+    "\n",
+    "load_dotenv(override=True)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Import necessary libraries\n",
+    "import os\n",
+    "\n",
+    "import cassio\n",
+    "\n",
+    "from llama_index.tools.cassandra.base import CassandraDatabaseToolSpec\n",
+    "from llama_index.tools.cassandra.cassandra_database_wrapper import (\n",
+    "    CassandraDatabase,\n",
+    ")\n",
+    "\n",
+    "from llama_index.agent.openai import OpenAIAgent\n",
+    "from llama_index.llms.openai import OpenAI"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Connect to a Cassandra Database"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "cassio.init(auto=True)\n",
+    "\n",
+    "session = cassio.config.resolve_session()\n",
+    "if not session:\n",
+    "    raise Exception(\n",
+    "        \"Check environment configuration or manually configure cassio connection parameters\"\n",
+    "    )"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Test data prep\n",
+    "\n",
+    "session = cassio.config.resolve_session()\n",
+    "\n",
+    "session.execute(\"\"\"DROP KEYSPACE IF EXISTS llamaindex_agent_test; \"\"\")\n",
+    "\n",
+    "session.execute(\n",
+    "    \"\"\"\n",
+    "CREATE KEYSPACE if not exists llamaindex_agent_test \n",
+    "WITH replication = {'class': 'SimpleStrategy', 'replication_factor': 1};\n",
+    "\"\"\"\n",
+    ")\n",
+    "\n",
+    "session.execute(\n",
+    "    \"\"\"\n",
+    "    CREATE TABLE IF NOT EXISTS llamaindex_agent_test.user_credentials (\n",
+    "    user_email text PRIMARY KEY,\n",
+    "    user_id UUID,\n",
+    "    password TEXT\n",
+    ");\n",
+    "\"\"\"\n",
+    ")\n",
+    "\n",
+    "session.execute(\n",
+    "    \"\"\"\n",
+    "    CREATE TABLE IF NOT EXISTS llamaindex_agent_test.users (\n",
+    "    id UUID PRIMARY KEY,\n",
+    "    name TEXT,\n",
+    "    email TEXT\n",
+    ");\"\"\"\n",
+    ")\n",
+    "\n",
+    "session.execute(\n",
+    "    \"\"\"\n",
+    "    CREATE TABLE IF NOT EXISTS llamaindex_agent_test.user_videos ( \n",
+    "    user_id UUID,\n",
+    "    video_id UUID,\n",
+    "    title TEXT,\n",
+    "    description TEXT,\n",
+    "    PRIMARY KEY (user_id, video_id)\n",
+    ");\n",
+    "\"\"\"\n",
+    ")\n",
+    "\n",
+    "user_id = \"522b1fe2-2e36-4cef-a667-cd4237d08b89\"\n",
+    "video_id = \"27066014-bad7-9f58-5a30-f63fe03718f6\"\n",
+    "\n",
+    "session.execute(\n",
+    "    f\"\"\"\n",
+    "    INSERT INTO llamaindex_agent_test.user_credentials (user_id, user_email) \n",
+    "    VALUES ({user_id}, 'patrick@datastax.com');\n",
+    "\"\"\"\n",
+    ")\n",
+    "\n",
+    "session.execute(\n",
+    "    f\"\"\"\n",
+    "    INSERT INTO llamaindex_agent_test.users (id, name, email) \n",
+    "    VALUES ({user_id}, 'Patrick McFadin', 'patrick@datastax.com');\n",
+    "\"\"\"\n",
+    ")\n",
+    "\n",
+    "session.execute(\n",
+    "    f\"\"\"\n",
+    "    INSERT INTO llamaindex_agent_test.user_videos (user_id, video_id, title)\n",
+    "    VALUES ({user_id}, {video_id}, 'Use Langflow to Build an LLM Application in 5 Minutes');\n",
+    "\"\"\"\n",
+    ")\n",
+    "\n",
+    "session.set_keyspace(\"llamaindex_agent_test\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "cassandra_db_schema\n",
+      "cassandra_db_schema(keyspace: str) -> List[llama_index.core.schema.Document]\n",
+      "Input to this tool is a keyspace name, output is a table description\n",
+      "            of Apache Cassandra tables.\n",
+      "            If the query is not correct, an error message will be returned.\n",
+      "            If an error is returned, report back to the user that the keyspace\n",
+      "            doesn't exist and stop.\n",
+      "\n",
+      "        Args:\n",
+      "            keyspace (str): The name of the keyspace for which to return the schema.\n",
+      "\n",
+      "        Returns:\n",
+      "            List[Document]: A list of Document objects, each containing a table description.\n",
+      "        \n",
+      "<class 'pydantic.main.cassandra_db_schema'>\n",
+      "cassandra_db_select_table_data\n",
+      "cassandra_db_select_table_data(keyspace: str, table: str, predicate: str, limit: int) -> List[llama_index.core.schema.Document]\n",
+      "Tool for getting data from a table in an Apache Cassandra database.\n",
+      "            Use the WHERE clause to specify the predicate for the query that uses the\n",
+      "            primary key. A blank predicate will return all rows. Avoid this if possible.\n",
+      "            Use the limit to specify the number of rows to return. A blank limit will\n",
+      "            return all rows.\n",
+      "\n",
+      "        Args:\n",
+      "            keyspace (str): The name of the keyspace containing the table.\n",
+      "            table (str): The name of the table for which to return data.\n",
+      "            predicate (str): The predicate for the query that uses the primary key.\n",
+      "            limit (int): The maximum number of rows to return.\n",
+      "\n",
+      "        Returns:\n",
+      "            List[Document]: A list of Document objects, each containing a row of data.\n",
+      "        \n",
+      "<class 'pydantic.main.cassandra_db_select_table_data'>\n"
+     ]
+    }
+   ],
+   "source": [
+    "# Create a CassandraDatabaseToolSpec object\n",
+    "db = CassandraDatabase()\n",
+    "\n",
+    "spec = CassandraDatabaseToolSpec(db=db)\n",
+    "\n",
+    "tools = spec.to_tool_list()\n",
+    "for tool in tools:\n",
+    "    print(tool.metadata.name)\n",
+    "    print(tool.metadata.description)\n",
+    "    print(tool.metadata.fn_schema)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Choose the LLM that will drive the agent\n",
+    "# Only certain models support this\n",
+    "llm = OpenAI(model=\"gpt-4-1106-preview\")\n",
+    "\n",
+    "# Create the Agent with our tools. Verbose will echo the agent's actions\n",
+    "agent = OpenAIAgent.from_tools(tools, llm=llm, verbose=True)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Invoking the agent with tools\n",
+    "We've created an agent that uses an LLM for reasoning and communication with a tool list for actions, Now we can simply ask questions of the agent and watch it utilize the tools we've given it. "
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Added user message to memory: What tables are in the keyspace llamaindex_agent_test?\n",
+      "=== Calling Function ===\n",
+      "Calling function: cassandra_db_schema with args: {\"keyspace\":\"llamaindex_agent_test\"}\n",
+      "Got output: [Document(id_='4b6011e6-62e6-4db2-9198-046534b7c8dd', embedding=None, metadata={}, excluded_embed_metadata_keys=[], excluded_llm_metadata_keys=[], relationships={}, text='Table Name: user_credentials\\n- Keyspace: llamaindex_agent_test\\n- Columns\\n  - password (text)\\n  - user_email (text)\\n  - user_id (uuid)\\n- Partition Keys: (user_email)\\n- Clustering Keys: \\n\\nTable Name: user_videos\\n- Keyspace: llamaindex_agent_test\\n- Columns\\n  - description (text)\\n  - title (text)\\n  - user_id (uuid)\\n  - video_id (uuid)\\n- Partition Keys: (user_id)\\n- Clustering Keys: (video_id asc)\\n\\n\\nTable Name: users\\n- Keyspace: llamaindex_agent_test\\n- Columns\\n  - email (text)\\n  - id (uuid)\\n  - name (text)\\n- Partition Keys: (id)\\n- Clustering Keys: \\n\\n', start_char_idx=None, end_char_idx=None, text_template='{metadata_str}\\n\\n{content}', metadata_template='{key}: {value}', metadata_seperator='\\n')]\n",
+      "========================\n",
+      "\n",
+      "Added user message to memory: What is the userid for patrick@datastax.com ?\n",
+      "=== Calling Function ===\n",
+      "Calling function: cassandra_db_select_table_data with args: {\"keyspace\":\"llamaindex_agent_test\",\"table\":\"user_credentials\",\"predicate\":\"user_email = 'patrick@datastax.com'\",\"limit\":1}\n",
+      "Got output: [Document(id_='e5620177-c735-46f8-a09a-a0e062efcdec', embedding=None, metadata={}, excluded_embed_metadata_keys=[], excluded_llm_metadata_keys=[], relationships={}, text=\"Row(user_email='patrick@datastax.com', password=None, user_id=UUID('522b1fe2-2e36-4cef-a667-cd4237d08b89'))\", start_char_idx=None, end_char_idx=None, text_template='{metadata_str}\\n\\n{content}', metadata_template='{key}: {value}', metadata_seperator='\\n')]\n",
+      "========================\n",
+      "\n",
+      "Added user message to memory: What videos did user patrick@datastax.com upload?\n",
+      "=== Calling Function ===\n",
+      "Calling function: cassandra_db_select_table_data with args: {\"keyspace\":\"llamaindex_agent_test\",\"table\":\"user_videos\",\"predicate\":\"user_id = 522b1fe2-2e36-4cef-a667-cd4237d08b89\",\"limit\":10}\n",
+      "Got output: [Document(id_='e3ecfba1-e8e1-4ce3-b321-3f51e12077a1', embedding=None, metadata={}, excluded_embed_metadata_keys=[], excluded_llm_metadata_keys=[], relationships={}, text=\"Row(user_id=UUID('522b1fe2-2e36-4cef-a667-cd4237d08b89'), video_id=UUID('27066014-bad7-9f58-5a30-f63fe03718f6'), description=None, title='Use Langflow to Build an LLM Application in 5 Minutes')\", start_char_idx=None, end_char_idx=None, text_template='{metadata_str}\\n\\n{content}', metadata_template='{key}: {value}', metadata_seperator='\\n')]\n",
+      "========================\n",
+      "\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "AgentChatResponse(response='The user `patrick@datastax.com` uploaded the following video in the `llamaindex_agent_test` keyspace:\\n\\n- Title: \"Use Langflow to Build an LLM Application in 5 Minutes\"\\n- Video ID: `27066014-bad7-9f58-5a30-f63fe03718f6`\\n- Description: Not provided', sources=[ToolOutput(content='[Document(id_=\\'e3ecfba1-e8e1-4ce3-b321-3f51e12077a1\\', embedding=None, metadata={}, excluded_embed_metadata_keys=[], excluded_llm_metadata_keys=[], relationships={}, text=\"Row(user_id=UUID(\\'522b1fe2-2e36-4cef-a667-cd4237d08b89\\'), video_id=UUID(\\'27066014-bad7-9f58-5a30-f63fe03718f6\\'), description=None, title=\\'Use Langflow to Build an LLM Application in 5 Minutes\\')\", start_char_idx=None, end_char_idx=None, text_template=\\'{metadata_str}\\\\n\\\\n{content}\\', metadata_template=\\'{key}: {value}\\', metadata_seperator=\\'\\\\n\\')]', tool_name='cassandra_db_select_table_data', raw_input={'args': (), 'kwargs': {'keyspace': 'llamaindex_agent_test', 'table': 'user_videos', 'predicate': 'user_id = 522b1fe2-2e36-4cef-a667-cd4237d08b89', 'limit': 10}}, raw_output=[Document(id_='e3ecfba1-e8e1-4ce3-b321-3f51e12077a1', embedding=None, metadata={}, excluded_embed_metadata_keys=[], excluded_llm_metadata_keys=[], relationships={}, text=\"Row(user_id=UUID('522b1fe2-2e36-4cef-a667-cd4237d08b89'), video_id=UUID('27066014-bad7-9f58-5a30-f63fe03718f6'), description=None, title='Use Langflow to Build an LLM Application in 5 Minutes')\", start_char_idx=None, end_char_idx=None, text_template='{metadata_str}\\n\\n{content}', metadata_template='{key}: {value}', metadata_seperator='\\n')], is_error=False)], source_nodes=[], is_dummy_stream=False)"
+      ]
+     },
+     "execution_count": null,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "# Ask our new agent a series of questions. What how the agent uses tools to get the answers.\n",
+    "agent.chat(\"What tables are in the keyspace llamaindex_agent_test?\")\n",
+    "agent.chat(\"What is the userid for patrick@datastax.com ?\")\n",
+    "agent.chat(\"What videos did user patrick@datastax.com upload?\")"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "llamaindex",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}
+%% Cell type:markdown id: tags:
+
+# Cassandra Database Tools
+
+Apache Cassandra® is a widely used database for storing transactional application data. The introduction of functions and tooling in Large Language Models has opened up some exciting use cases for existing data in Generative AI applications. The Cassandra Database toolkit enables AI engineers to efficiently integrate Agents with Cassandra data, offering the following features:
+ - Fast data access through optimized queries. Most queries should run in single-digit ms or less.
+ - Schema introspection to enhance LLM reasoning capabilities
+ - Compatibility with various Cassandra deployments, including Apache Cassandra®, DataStax Enterprise™, and DataStax Astra™
+ - Currently, the toolkit is limited to SELECT queries and schema introspection operations. (Safety first)
+
+## Quick Start
+ - Install the cassio library
+ - Set environment variables for the Cassandra database you are connecting to
+ - Initialize CassandraDatabase
+ - Pass the tools to your agent with spec.to_tool_list()
+ - Sit back and watch it do all your work for you
+
+## Theory of Operation
+Cassandra Query Language (CQL) is the primary *human-centric* way of interacting with a Cassandra database. While offering some flexibility when generating queries, it requires knowledge of Cassandra data modeling best practices. LLM function calling gives an agent the ability to reason and then choose a tool to satisfy the request. Agents using LLMs should reason using Cassandra-specific logic when choosing the appropriate tool or chain of tools. This reduces the randomness introduced when LLMs are forced to provide a top-down solution. Do you want an LLM to have complete unfettered access to your database? Yeah. Probably not. To accomplish this, we provide a prompt for use when constructing questions for the agent:
+
+```json
+You are an Apache Cassandra expert query analysis bot with the following features
+and rules:
+ - You will take a question from the end user about finding specific
+   data in the database.
+ - You will examine the schema of the database and create a query path.
+ - You will provide the user with the correct query to find the data they are looking
+   for, showing the steps provided by the query path.
+ - You will use best practices for querying Apache Cassandra using partition keys
+   and clustering columns.
+ - Avoid using ALLOW FILTERING in the query.
+ - The goal is to find a query path, so it may take querying other tables to get
+   to the final answer.
+
+The following is an example of a query path in JSON format:
+
+ {
+  "query_paths": [
+    {
+      "description": "Direct query to users table using email",
+      "steps": [
+        {
+          "table": "user_credentials",
+          "query":
+             "SELECT userid FROM user_credentials WHERE email = 'example@example.com';"
+        },
+        {
+          "table": "users",
+          "query": "SELECT * FROM users WHERE userid = ?;"
+        }
+      ]
+    }
+  ]
+}
+```
+
+## Tools Provided
+
+### `cassandra_db_schema`
+Gathers all schema information for the connected database or a specific schema. Critical for the agent when determining actions.
+
+### `cassandra_db_select_table_data`
+Selects data from a specific keyspace and table. The agent can pass paramaters for a predicate and limits on the number of returned records.
+
+### `cassandra_db_query`
+Experimental alternative to `cassandra_db_select_table_data` which takes a query string completely formed by the agent instead of parameters. *Warning*: This can lead to unusual queries that may not be as performant(or even work). This may be removed in future releases. If it does something cool, we want to know about that too. You never know!
+
+%% Cell type:markdown id: tags:
+
+## Enviroment Setup
+
+Install the following Python modules:
+
+```bash
+pip install ipykernel python-dotenv cassio llama-index llama-index-agent-openai llama-index-llms-openai llama-index-tools-cassandra
+```
+
+### .env file
+Connection is via `cassio` using `auto=True` parameter, and the notebook uses OpenAI. You should create a `.env` file accordingly.
+
+For Casssandra, set:
+```bash
+CASSANDRA_CONTACT_POINTS
+CASSANDRA_USERNAME
+CASSANDRA_PASSWORD
+CASSANDRA_KEYSPACE
+```
+
+For Astra, set:
+```bash
+ASTRA_DB_APPLICATION_TOKEN
+ASTRA_DB_DATABASE_ID
+ASTRA_DB_KEYSPACE
+```
+
+For example:
+
+```bash
+# Connection to Astra:
+ASTRA_DB_DATABASE_ID=a1b2c3d4-...
+ASTRA_DB_APPLICATION_TOKEN=AstraCS:...
+ASTRA_DB_KEYSPACE=notebooks
+
+# Also set
+OPENAI_API_KEY=sk-....
+```
+
+(You may also modify the below code to directly connect with `cassio`.)
+
+%% Cell type:code id: tags:
+
+``` python
+from dotenv import load_dotenv
+
+load_dotenv(override=True)
+```
+
+%% Cell type:code id: tags:
+
+``` python
+# Import necessary libraries
+import os
+
+import cassio
+
+from llama_index.tools.cassandra.base import CassandraDatabaseToolSpec
+from llama_index.tools.cassandra.cassandra_database_wrapper import (
+    CassandraDatabase,
+)
+
+from llama_index.agent.openai import OpenAIAgent
+from llama_index.llms.openai import OpenAI
+```
+
+%% Cell type:markdown id: tags:
+
+## Connect to a Cassandra Database
+
+%% Cell type:code id: tags:
+
+``` python
+cassio.init(auto=True)
+
+session = cassio.config.resolve_session()
+if not session:
+    raise Exception(
+        "Check environment configuration or manually configure cassio connection parameters"
+    )
+```
+
+%% Cell type:code id: tags:
+
+``` python
+# Test data prep
+
+session = cassio.config.resolve_session()
+
+session.execute("""DROP KEYSPACE IF EXISTS llamaindex_agent_test; """)
+
+session.execute(
+    """
+CREATE KEYSPACE if not exists llamaindex_agent_test
+WITH replication = {'class': 'SimpleStrategy', 'replication_factor': 1};
+"""
+)
+
+session.execute(
+    """
+    CREATE TABLE IF NOT EXISTS llamaindex_agent_test.user_credentials (
+    user_email text PRIMARY KEY,
+    user_id UUID,
+    password TEXT
+);
+"""
+)
+
+session.execute(
+    """
+    CREATE TABLE IF NOT EXISTS llamaindex_agent_test.users (
+    id UUID PRIMARY KEY,
+    name TEXT,
+    email TEXT
+);"""
+)
+
+session.execute(
+    """
+    CREATE TABLE IF NOT EXISTS llamaindex_agent_test.user_videos (
+    user_id UUID,
+    video_id UUID,
+    title TEXT,
+    description TEXT,
+    PRIMARY KEY (user_id, video_id)
+);
+"""
+)
+
+user_id = "522b1fe2-2e36-4cef-a667-cd4237d08b89"
+video_id = "27066014-bad7-9f58-5a30-f63fe03718f6"
+
+session.execute(
+    f"""
+    INSERT INTO llamaindex_agent_test.user_credentials (user_id, user_email)
+    VALUES ({user_id}, 'patrick@datastax.com');
+"""
+)
+
+session.execute(
+    f"""
+    INSERT INTO llamaindex_agent_test.users (id, name, email)
+    VALUES ({user_id}, 'Patrick McFadin', 'patrick@datastax.com');
+"""
+)
+
+session.execute(
+    f"""
+    INSERT INTO llamaindex_agent_test.user_videos (user_id, video_id, title)
+    VALUES ({user_id}, {video_id}, 'Use Langflow to Build an LLM Application in 5 Minutes');
+"""
+)
+
+session.set_keyspace("llamaindex_agent_test")
+```
+
+%% Cell type:code id: tags:
+
+``` python
+# Create a CassandraDatabaseToolSpec object
+db = CassandraDatabase()
+
+spec = CassandraDatabaseToolSpec(db=db)
+
+tools = spec.to_tool_list()
+for tool in tools:
+    print(tool.metadata.name)
+    print(tool.metadata.description)
+    print(tool.metadata.fn_schema)
+```
+
+%% Output
+
+    cassandra_db_schema
+    cassandra_db_schema(keyspace: str) -> List[llama_index.core.schema.Document]
+    Input to this tool is a keyspace name, output is a table description
+                of Apache Cassandra tables.
+                If the query is not correct, an error message will be returned.
+                If an error is returned, report back to the user that the keyspace
+                doesn't exist and stop.
+    
+            Args:
+                keyspace (str): The name of the keyspace for which to return the schema.
+    
+            Returns:
+                List[Document]: A list of Document objects, each containing a table description.
+    
+    <class 'pydantic.main.cassandra_db_schema'>
+    cassandra_db_select_table_data
+    cassandra_db_select_table_data(keyspace: str, table: str, predicate: str, limit: int) -> List[llama_index.core.schema.Document]
+    Tool for getting data from a table in an Apache Cassandra database.
+                Use the WHERE clause to specify the predicate for the query that uses the
+                primary key. A blank predicate will return all rows. Avoid this if possible.
+                Use the limit to specify the number of rows to return. A blank limit will
+                return all rows.
+    
+            Args:
+                keyspace (str): The name of the keyspace containing the table.
+                table (str): The name of the table for which to return data.
+                predicate (str): The predicate for the query that uses the primary key.
+                limit (int): The maximum number of rows to return.
+    
+            Returns:
+                List[Document]: A list of Document objects, each containing a row of data.
+    
+    <class 'pydantic.main.cassandra_db_select_table_data'>
+
+%% Cell type:code id: tags:
+
+``` python
+# Choose the LLM that will drive the agent
+# Only certain models support this
+llm = OpenAI(model="gpt-4-1106-preview")
+
+# Create the Agent with our tools. Verbose will echo the agent's actions
+agent = OpenAIAgent.from_tools(tools, llm=llm, verbose=True)
+```
+
+%% Cell type:markdown id: tags:
+
+### Invoking the agent with tools
+We've created an agent that uses an LLM for reasoning and communication with a tool list for actions, Now we can simply ask questions of the agent and watch it utilize the tools we've given it.
+
+%% Cell type:code id: tags:
+
+``` python
+# Ask our new agent a series of questions. What how the agent uses tools to get the answers.
+agent.chat("What tables are in the keyspace llamaindex_agent_test?")
+agent.chat("What is the userid for patrick@datastax.com ?")
+agent.chat("What videos did user patrick@datastax.com upload?")
+```
+
+%% Output
+
+    Added user message to memory: What tables are in the keyspace llamaindex_agent_test?
+    === Calling Function ===
+    Calling function: cassandra_db_schema with args: {"keyspace":"llamaindex_agent_test"}
+    Got output: [Document(id_='4b6011e6-62e6-4db2-9198-046534b7c8dd', embedding=None, metadata={}, excluded_embed_metadata_keys=[], excluded_llm_metadata_keys=[], relationships={}, text='Table Name: user_credentials\n- Keyspace: llamaindex_agent_test\n- Columns\n  - password (text)\n  - user_email (text)\n  - user_id (uuid)\n- Partition Keys: (user_email)\n- Clustering Keys: \n\nTable Name: user_videos\n- Keyspace: llamaindex_agent_test\n- Columns\n  - description (text)\n  - title (text)\n  - user_id (uuid)\n  - video_id (uuid)\n- Partition Keys: (user_id)\n- Clustering Keys: (video_id asc)\n\n\nTable Name: users\n- Keyspace: llamaindex_agent_test\n- Columns\n  - email (text)\n  - id (uuid)\n  - name (text)\n- Partition Keys: (id)\n- Clustering Keys: \n\n', start_char_idx=None, end_char_idx=None, text_template='{metadata_str}\n\n{content}', metadata_template='{key}: {value}', metadata_seperator='\n')]
+    ========================
+    
+    Added user message to memory: What is the userid for patrick@datastax.com ?
+    === Calling Function ===
+    Calling function: cassandra_db_select_table_data with args: {"keyspace":"llamaindex_agent_test","table":"user_credentials","predicate":"user_email = 'patrick@datastax.com'","limit":1}
+    Got output: [Document(id_='e5620177-c735-46f8-a09a-a0e062efcdec', embedding=None, metadata={}, excluded_embed_metadata_keys=[], excluded_llm_metadata_keys=[], relationships={}, text="Row(user_email='patrick@datastax.com', password=None, user_id=UUID('522b1fe2-2e36-4cef-a667-cd4237d08b89'))", start_char_idx=None, end_char_idx=None, text_template='{metadata_str}\n\n{content}', metadata_template='{key}: {value}', metadata_seperator='\n')]
+    ========================
+    
+    Added user message to memory: What videos did user patrick@datastax.com upload?
+    === Calling Function ===
+    Calling function: cassandra_db_select_table_data with args: {"keyspace":"llamaindex_agent_test","table":"user_videos","predicate":"user_id = 522b1fe2-2e36-4cef-a667-cd4237d08b89","limit":10}
+    Got output: [Document(id_='e3ecfba1-e8e1-4ce3-b321-3f51e12077a1', embedding=None, metadata={}, excluded_embed_metadata_keys=[], excluded_llm_metadata_keys=[], relationships={}, text="Row(user_id=UUID('522b1fe2-2e36-4cef-a667-cd4237d08b89'), video_id=UUID('27066014-bad7-9f58-5a30-f63fe03718f6'), description=None, title='Use Langflow to Build an LLM Application in 5 Minutes')", start_char_idx=None, end_char_idx=None, text_template='{metadata_str}\n\n{content}', metadata_template='{key}: {value}', metadata_seperator='\n')]
+    ========================
+    
+
+    AgentChatResponse(response='The user `patrick@datastax.com` uploaded the following video in the `llamaindex_agent_test` keyspace:\n\n- Title: "Use Langflow to Build an LLM Application in 5 Minutes"\n- Video ID: `27066014-bad7-9f58-5a30-f63fe03718f6`\n- Description: Not provided', sources=[ToolOutput(content='[Document(id_=\'e3ecfba1-e8e1-4ce3-b321-3f51e12077a1\', embedding=None, metadata={}, excluded_embed_metadata_keys=[], excluded_llm_metadata_keys=[], relationships={}, text="Row(user_id=UUID(\'522b1fe2-2e36-4cef-a667-cd4237d08b89\'), video_id=UUID(\'27066014-bad7-9f58-5a30-f63fe03718f6\'), description=None, title=\'Use Langflow to Build an LLM Application in 5 Minutes\')", start_char_idx=None, end_char_idx=None, text_template=\'{metadata_str}\\n\\n{content}\', metadata_template=\'{key}: {value}\', metadata_seperator=\'\\n\')]', tool_name='cassandra_db_select_table_data', raw_input={'args': (), 'kwargs': {'keyspace': 'llamaindex_agent_test', 'table': 'user_videos', 'predicate': 'user_id = 522b1fe2-2e36-4cef-a667-cd4237d08b89', 'limit': 10}}, raw_output=[Document(id_='e3ecfba1-e8e1-4ce3-b321-3f51e12077a1', embedding=None, metadata={}, excluded_embed_metadata_keys=[], excluded_llm_metadata_keys=[], relationships={}, text="Row(user_id=UUID('522b1fe2-2e36-4cef-a667-cd4237d08b89'), video_id=UUID('27066014-bad7-9f58-5a30-f63fe03718f6'), description=None, title='Use Langflow to Build an LLM Application in 5 Minutes')", start_char_idx=None, end_char_idx=None, text_template='{metadata_str}\n\n{content}', metadata_template='{key}: {value}', metadata_seperator='\n')], is_error=False)], source_nodes=[], is_dummy_stream=False)
--- a/llama-index-integrations/tools/llama-index-tools-cassandra/.gitignore
+++ b/llama-index-integrations/tools/llama-index-tools-cassandra/.gitignore
+llama_index/_static
+.DS_Store
+# Byte-compiled / optimized / DLL files
+__pycache__/
+*.py[cod]
+*$py.class
+
+# C extensions
+*.so
+
+# Distribution / packaging
+.Python
+bin/
+build/
+develop-eggs/
+dist/
+downloads/
+eggs/
+.eggs/
+etc/
+include/
+lib/
+lib64/
+parts/
+sdist/
+share/
+var/
+wheels/
+pip-wheel-metadata/
+share/python-wheels/
+*.egg-info/
+.installed.cfg
+*.egg
+MANIFEST
+
+# PyInstaller
+#  Usually these files are written by a python script from a template
+#  before PyInstaller builds the exe, so as to inject date/other infos into it.
+*.manifest
+*.spec
+
+# Installer logs
+pip-log.txt
+pip-delete-this-directory.txt
+
+# Unit test / coverage reports
+htmlcov/
+.tox/
+.nox/
+.coverage
+.coverage.*
+.cache
+nosetests.xml
+coverage.xml
+*.cover
+*.py,cover
+.hypothesis/
+.pytest_cache/
+.ruff_cache
+
+# Translations
+*.mo
+*.pot
+
+# Django stuff:
+*.log
+local_settings.py
+db.sqlite3
+db.sqlite3-journal
+
+# Flask stuff:
+instance/
+.webassets-cache
+
+# Scrapy stuff:
+.scrapy
+
+# Sphinx documentation
+docs/_build/
+
+# PyBuilder
+target/
+
+# Jupyter Notebook
+.ipynb_checkpoints
+notebooks/
+
+# IPython
+profile_default/
+ipython_config.py
+
+# pyenv
+.python-version
+
+# pipenv
+#   According to pypa/pipenv#598, it is recommended to include Pipfile.lock in version control.
+#   However, in case of collaboration, if having platform-specific dependencies or dependencies
+#   having no cross-platform support, pipenv may install dependencies that don't work, or not
+#   install all needed dependencies.
+#Pipfile.lock
+
+# PEP 582; used by e.g. github.com/David-OConnor/pyflow
+__pypackages__/
+
+# Celery stuff
+celerybeat-schedule
+celerybeat.pid
+
+# SageMath parsed files
+*.sage.py
+
+# Environments
+.env
+.venv
+env/
+venv/
+ENV/
+env.bak/
+venv.bak/
+pyvenv.cfg
+
+# Spyder project settings
+.spyderproject
+.spyproject
+
+# Rope project settings
+.ropeproject
+
+# mkdocs documentation
+/site
+
+# mypy
+.mypy_cache/
+.dmypy.json
+dmypy.json
+
+# Pyre type checker
+.pyre/
+
+# Jetbrains
+.idea
+modules/
+*.swp
+
+# VsCode
+.vscode
+
+# pipenv
+Pipfile
+Pipfile.lock
+
+# pyright
+pyrightconfig.json
--- a/llama-index-integrations/tools/llama-index-tools-cassandra/BUILD
+++ b/llama-index-integrations/tools/llama-index-tools-cassandra/BUILD
+poetry_requirements(
+    name="poetry",
+)
--- a/llama-index-integrations/tools/llama-index-tools-cassandra/Makefile
+++ b/llama-index-integrations/tools/llama-index-tools-cassandra/Makefile
+GIT_ROOT ?= $(shell git rev-parse --show-toplevel)
+
+help:	## Show all Makefile targets.
+	@grep -E '^[a-zA-Z_-]+:.*?## .*$$' $(MAKEFILE_LIST) | awk 'BEGIN {FS = ":.*?## "}; {printf "\033[33m%-30s\033[0m %s\n", $$1, $$2}'
+
+format:	## Run code autoformatters (black).
+	pre-commit install
+	git ls-files | xargs pre-commit run black --files
+
+lint:	## Run linters: pre-commit (black, ruff, codespell) and mypy
+	pre-commit install && git ls-files | xargs pre-commit run --show-diff-on-failure --files
+
+test:	## Run tests via pytest.
+	pytest tests
+
+watch-docs:	## Build and watch documentation.
+	sphinx-autobuild docs/ docs/_build/html --open-browser --watch $(GIT_ROOT)/llama_index/
--- a/llama-index-integrations/tools/llama-index-tools-cassandra/README.md
+++ b/llama-index-integrations/tools/llama-index-tools-cassandra/README.md
+# Cassandra Database Tools
+
+## Overview
+
+The Cassandra Database Tools project is designed to help AI engineers efficiently integrate Large Language Models (LLMs) with Apache Cassandra® data. It facilitates optimized and safe interactions with Cassandra databases, supporting various deployments like Apache Cassandra®, DataStax Enterprise™, and DataStax Astra™.
+
+## Key Features
+
+- **Fast Data Access:** Optimized queries ensure most operations complete in milliseconds.
+- **Schema Introspection:** Enhances the reasoning capabilities of LLMs by providing detailed schema information.
+- **Compatibility:** Supports various Cassandra deployments, ensuring wide applicability.
+- **Safety Measures:** Limits operations to SELECT queries and schema introspection to prioritize data integrity.
+
+## Installation
+
+Ensure your system has Python installed and proceed with the following installations via pip:
+
+```bash
+pip install python-dotenv cassio llama-index-tools-cassandra
+```
+
+Create a `.env` file for environmental variables related to Cassandra and Astra configurations, following the example structure provided in the notebook.
+
+## Environment Setup
+
+- For Cassandra: Configure `CASSANDRA_CONTACT_POINTS`, `CASSANDRA_USERNAME`, `CASSANDRA_PASSWORD`, and `CASSANDRA_KEYSPACE`.
+- For DataStax Astra: Set `ASTRA_DB_APPLICATION_TOKEN`, `ASTRA_DB_DATABASE_ID`, and `ASTRA_DB_KEYSPACE`.
+
+## How It Works
+
+The toolkit leverages the Cassandra Query Language (CQL) and integrates with LLMs to provide an efficient query path determination for the user's requests, ensuring best practices for querying are followed. Using functions, the LLMs decision making can invoke the tool instead of designing custom queries. The result is faster and efficient access to Cassandra data for agents.
+
+## Tools Included
+
+- **`cassandra_db_schema`**: Fetches schema information, essential for the agent’s operation.
+- **`cassandra_db_select_table_data`**: Allows selection of data from a specific keyspace and table.
+- **`cassandra_db_query`**: An experimental tool that accepts fully formed query strings from the agent.
+
+## Example Usage
+
+Initialize the CassandraDatabase and set up the agent with the tools provided. Query the database by interacting with the agent as shown in the example [notebook](https://docs.llamaindex.ai/en/latest/examples/tools/cassandra/).
--- a/llama-index-integrations/tools/llama-index-tools-cassandra/llama_index/tools/cassandra/BUILD
+++ b/llama-index-integrations/tools/llama-index-tools-cassandra/llama_index/tools/cassandra/BUILD
+python_sources()
--- a/llama-index-integrations/tools/llama-index-tools-cassandra/llama_index/tools/cassandra/__init__.py
+++ b/llama-index-integrations/tools/llama-index-tools-cassandra/llama_index/tools/cassandra/__init__.py
+from llama_index.tools.cassandra.base import CassandraDatabaseToolSpec
+
+
+__all__ = ["CassandraDatabaseToolSpec"]
--- a/llama-index-integrations/tools/llama-index-tools-cassandra/llama_index/tools/cassandra/base.py
+++ b/llama-index-integrations/tools/llama-index-tools-cassandra/llama_index/tools/cassandra/base.py
+"""Tools for interacting with an Apache Cassandra database."""
+from typing import List
+
+from llama_index.core.bridge.pydantic import Field
+from llama_index.core.schema import Document
+from llama_index.core.tools.tool_spec.base import BaseToolSpec
+
+from llama_index.tools.cassandra.cassandra_database_wrapper import (
+    CassandraDatabase,
+)
+
+
+class CassandraDatabaseToolSpec(BaseToolSpec):
+    """Base tool for interacting with an Apache Cassandra database."""
+
+    db: CassandraDatabase = Field(exclude=True)
+
+    spec_functions = [
+        "cassandra_db_query",
+        "cassandra_db_schema",
+        "cassandra_db_select_table_data",
+    ]
+
+    def __init__(self, db: CassandraDatabase) -> None:
+        """DB session in context."""
+        self.db = db
+
+    def cassandra_db_query(self, query: str) -> List[Document]:
+        """Execute a CQL query and return the results as a list of Documents.
+
+        Args:
+            query (str): A CQL query to execute.
+
+        Returns:
+            List[Document]: A list of Document objects, each containing data from a row.
+        """
+        documents = []
+        result = self.db.run_no_throw(query, fetch="Cursor")
+        for row in result:
+            doc_str = ", ".join([str(value) for value in row])
+            documents.append(Document(text=doc_str))
+        return documents
+
+    def cassandra_db_schema(self, keyspace: str) -> List[Document]:
+        """Input to this tool is a keyspace name, output is a table description
+            of Apache Cassandra tables.
+            If the query is not correct, an error message will be returned.
+            If an error is returned, report back to the user that the keyspace
+            doesn't exist and stop.
+
+        Args:
+            keyspace (str): The name of the keyspace for which to return the schema.
+
+        Returns:
+            List[Document]: A list of Document objects, each containing a table description.
+        """
+        return [Document(text=self.db.get_keyspace_tables_str_no_throw(keyspace))]
+
+    def cassandra_db_select_table_data(
+        self, keyspace: str, table: str, predicate: str, limit: int
+    ) -> List[Document]:
+        """Tool for getting data from a table in an Apache Cassandra database.
+            Use the WHERE clause to specify the predicate for the query that uses the
+            primary key. A blank predicate will return all rows. Avoid this if possible.
+            Use the limit to specify the number of rows to return. A blank limit will
+            return all rows.
+
+        Args:
+            keyspace (str): The name of the keyspace containing the table.
+            table (str): The name of the table for which to return data.
+            predicate (str): The predicate for the query that uses the primary key.
+            limit (int): The maximum number of rows to return.
+
+        Returns:
+            List[Document]: A list of Document objects, each containing a row of data.
+        """
+        return [
+            Document(
+                text=self.db.get_table_data_no_throw(keyspace, table, predicate, limit)
+            )
+        ]
--- a/llama-index-integrations/tools/llama-index-tools-cassandra/llama_index/tools/cassandra/cassandra_database_wrapper.py
+++ b/llama-index-integrations/tools/llama-index-tools-cassandra/llama_index/tools/cassandra/cassandra_database_wrapper.py
--- a/llama-index-integrations/tools/llama-index-tools-cassandra/pyproject.toml
+++ b/llama-index-integrations/tools/llama-index-tools-cassandra/pyproject.toml
+[build-system]
+build-backend = "poetry.core.masonry.api"
+requires = ["poetry-core"]
+
+[tool.codespell]
+check-filenames = true
+check-hidden = true
+skip = "*.csv,*.html,*.json,*.jsonl,*.pdf,*.txt,*.ipynb"
+
+[tool.llamahub]
+contains_example = false
+import_path = "llama_index.tools.cassandra"
+
+[tool.llamahub.class_authors]
+CassandraDatabaseToolSpec = "pmcfadin"
+
+[tool.mypy]
+disallow_untyped_defs = true
+exclude = ["_static", "build", "examples", "notebooks", "venv"]
+ignore_missing_imports = true
+python_version = "3.8"
+
+[tool.poetry]
+authors = ["Patrick McFadin <patrick@datastax.com>"]
+description = "llama-index tools Apache Cassandra® integration"
+license = "MIT"
+name = "llama-index-tools-cassandra"
+packages = [{include = "llama_index/"}]
+readme = "README.md"
+version = "0.1.0"
+
+[tool.poetry.dependencies]
+python = ">=3.8.1,<4.0"
+llama-index-core = "^0.10.0"
+cassio = "^0.1.7"
+
+[tool.poetry.group.dev.dependencies]
+black = {extras = ["jupyter"], version = "<=23.9.1,>=23.7.0"}
+codespell = {extras = ["toml"], version = ">=v2.2.6"}
+ipython = "8.10.0"
+jupyter = "^1.0.0"
+mypy = "0.991"
+pre-commit = "3.2.0"
+pylint = "2.15.10"
+pytest = "7.2.1"
+pytest-mock = "3.11.1"
+ruff = "0.0.292"
+tree-sitter-languages = "^1.8.0"
+types-Deprecated = ">=0.1.0"
+types-PyYAML = "^6.0.12.12"
+types-protobuf = "^4.24.0.4"
+types-redis = "4.5.5.0"
+types-requests = "2.28.11.8"  # TODO: unpin when mypy>0.991
+types-setuptools = "67.1.0.0"
--- a/llama-index-integrations/tools/llama-index-tools-cassandra/tests/BUILD
+++ b/llama-index-integrations/tools/llama-index-tools-cassandra/tests/BUILD
+python_tests()
--- a/llama-index-integrations/tools/llama-index-tools-cassandra/tests/__init__.py
+++ b/llama-index-integrations/tools/llama-index-tools-cassandra/tests/__init__.py
--- a/llama-index-integrations/tools/llama-index-tools-cassandra/tests/test_tools_cassandra.py
+++ b/llama-index-integrations/tools/llama-index-tools-cassandra/tests/test_tools_cassandra.py
+from llama_index.core.tools.tool_spec.base import BaseToolSpec
+from llama_index.tools.cassandra.base import CassandraDatabaseToolSpec
+
+
+def test_class() -> None:
+    names_of_base_classes = [b.__name__ for b in CassandraDatabaseToolSpec.__mro__]
+    assert BaseToolSpec.__name__ in names_of_base_classes