Introduction

What is Definable?

Definable is a Python framework for building AI-powered applications. It provides a unified interface across multiple LLM providers, a powerful agent runtime with tool execution, a complete RAG pipeline for knowledge retrieval, and first-class MCP (Model Context Protocol) support.

Models

Unified interface for OpenAI, DeepSeek, Moonshot, xAI, and any OpenAI-compatible provider. Supports streaming, structured output, vision, and audio.

Agents

Autonomous agents with tool calling, middleware, tracing, multi-turn conversations, and streaming. Built for production with retries, compression, and testing utilities.

Tools

Define tools with a simple decorator. Supports type-safe parameters, pre/post hooks, result caching, dependency injection, and async execution.

Knowledge

Full RAG pipeline with document readers, text chunkers, embedding providers, rerankers, and vector databases. Plug into agents automatically or on-demand.

MCP

Connect to MCP servers over stdio, SSE, or HTTP transports. Discover and call tools, read resources, and use prompts from any MCP-compatible server.

Toolkits

Interfaces

Connect agents to messaging platforms like Telegram, Discord, and Signal. Automatic session management, hooks, identity resolution, and multi-interface serving.

Memory

Persistent multi-tier memory with automatic recall, distillation, and cross-session context. Supports SQLite, PostgreSQL, Redis, Qdrant, Chroma, Pinecone, and MongoDB backends.

Readers

Extract text from files (PDF, DOCX, XLSX, audio) and inject into agent context automatically before LLM processing.

Architecture Overview

Definable is organized into composable layers that work together:

Key Features

Multi-provider: Switch between OpenAI, DeepSeek, Moonshot, xAI, or any OpenAI-compatible API with one line.
Sync and async: Every API has both synchronous and asynchronous variants.
Streaming: Stream responses token-by-token with rich event types for tool calls, reasoning, and content.
Structured output: Return Pydantic models directly from LLM calls with JSON Schema validation.
Middleware: Compose logging, retry, metrics, and custom middleware around agent execution.
Full RAG pipeline: Ingest documents from text, PDF, or URLs; chunk, embed, store, and retrieve with reranking.
MCP support: Connect to any MCP server and use its tools, resources, and prompts inside your agents.
Testing: Mock models, test agents without API calls, and assert on tool execution.
Cost tracking: Automatic token counting and cost calculation across providers.
Interfaces: Deploy agents to Telegram, Discord, and Signal with sessions, hooks, identity resolution, and multi-interface serving.
Memory: Persistent multi-tier cognitive memory with automatic recall, distillation, and 8 storage backends.
File readers: Extract text from PDF, DOCX, XLSX, and audio files attached to agent messages.
Type-safe: Built on Pydantic with full type annotations and mypy support.

Getting Started

Models

Agents

Tools

Toolkits

Interfaces

Memory

Readers

Knowledge

MCP

Advanced

What is Definable?

Models

Agents

Tools

Knowledge

MCP

Toolkits

Interfaces

Memory

Readers

Architecture Overview

Key Features

Next Steps

Quickstart

Installation

Getting Started

Models

Agents

Tools

Toolkits

Interfaces

Memory

Readers

Knowledge

MCP

Advanced

​What is Definable?

Models

Agents

Tools

Knowledge

MCP

Toolkits

Interfaces

Memory

Readers

​Architecture Overview

​Key Features

​Next Steps

Quickstart

Installation

What is Definable?

Architecture Overview

Key Features

Next Steps