🧩 Integrations

AIMLAPI

Call AIMLAPI's OpenAI-compatible chat models from Haystack pipelines and agents.

Model Provider

Maintained by deepset

AlloyDB

A Document Store for storing and retrieval from Google Cloud AlloyDB with pgvector

Document Store

Maintained by deepset

Amazon Bedrock

Use Models from AI21 Labs, Anthropic, Cohere, Meta, and Amazon via Amazon Bedrock with Haystack

Model Provider

Maintained by deepset

Amazon Sagemaker

Use Models from Huggingface, Anthropic, AI21 Labs, Cohere, Meta, and Amazon via Amazon Sagemaker with Haystack

Model Provider

Maintained by deepset

Amazon Textract

Use Amazon Textract with Haystack to extract text, tables, forms, and answers to queries from documents

Data Ingestion

Maintained by deepset

Anthropic

Use Anthropic Models with Haystack

Model Provider

Maintained by deepset

Apify

Extract data from the web and automate web tasks using Apify-Haystack integration.

Data Ingestion

ArangoDB

Use the ArangoDB database as a Document Store with Haystack

Document Store

Maintained by deepset

ArcadeDB

Use ArcadeDB as a document store with native HNSW vector search for Haystack

Document Store

Arize Phoenix

Trace your Haystack pipelines with Arize Phoenix

Monitoring Tool

Arize AI

Trace and Monitor your Haystack pipelines with Arize AI

Monitoring Tool

Asqav

Signed audit trails for Haystack pipelines - tamper-evident governance records for every pipeline run

Monitoring Tool

AssemblyAI

Use AssemblyAI transcription, summarization and speaker diarization models with Haystack

Model Provider

AstraDB

A Document Store for storing and retrieval from AstraDB.

Document Store

Maintained by deepset

Azure AI Search

Use Azure AI Search with Haystack

Document Store

Maintained by deepset

Azure CosmosDB

Use Azure CosmosDB with Haystack

Document Store

Maintained by deepset

Azure Document Intelligence

Use Azure Document Intelligence with Haystack

Data Ingestion

Maintained by deepset

Azure Form Recognizer

Convert files to Documents using Azure's Document Intelligence service (Form Recognizer SDK)

Data Ingestion

Maintained by deepset

Azure

Use OpenAI models deployed through Azure services with Haystack

Model Provider

Maintained by deepset

Brave Search

Search the web using the Brave Search API with Haystack

Search & Extraction

Maintained by deepset

Bright Data

Extract data from 45+ websites, get search engine results, and access geo-restricted content using Bright Data's web scraping services.

Data Ingestion

Burr

Build an agent from Haystack components

Framework

Cerebras

Use LLMs served by Cerebras API

Model Provider

Maintained by deepset

Chainlit

Use Chainlit UI for your Haystack apps through Hayhooks

UI

Maintained by deepset

Chonkie

Fast, lightweight text chunking for Haystack indexing pipelines, powered by Chonkie.

Data Ingestion

Maintained by deepset

Chroma

A Document Store for storing and retrieval from Chroma

Document Store

Maintained by deepset

Cognee

Add persistent, knowledge-graph-backed memory to your Haystack agents and pipelines with Cognee

Memory Store

Maintained by deepset

Cohere

Use Cohere models with Haystack

Model Provider

Maintained by deepset

Comet API

Use the Comet API for text generation models.

Model Provider

Maintained by deepset

Context AI

A component to log conversations for analytics by Context.ai

Monitoring Tool

Couchbase

Use the Couchbase database with Haystack

Document Store

Datadog

Monitor and trace your Haystack pipelines with Datadog.

Monitoring Tool

Maintained by deepset

DeepEval

Use the DeepEval evaluation framework to calculate model-based metrics

Evaluation Framework

Maintained by deepset

DeepL

Use DeepL translation services with Haystack

Custom Component

Dewey

Connect Haystack pipelines to Dewey — a managed document intelligence backend that handles PDF conversion, chunking, embedding, and hybrid retrieval behind a single API.

Document Store

Docling Serve

Use Docling Serve to convert PDF, DOCX, HTML, and other document types to Haystack Documents via a remote HTTP server, with no local ML dependencies

Data Ingestion

Maintained by deepset

Docling

Use Docling to locally parse and chunk PDF, DOCX, and other document types in Haystack

Data Ingestion

Maintained by deepset

DuckDuckGo

Uses DuckDuckGo API for web searches

Data Ingestion

E2B

Use E2B cloud sandboxes as tools in a Haystack Agent to run bash commands and read, write, and list files in an isolated Linux environment

Tool Integration

Maintained by deepset

Elasticsearch

Use an Elasticsearch database with Haystack

Document Store

Maintained by deepset

Elevenlabs

ElevenLabs Text-to-Speech components for Haystack.

Model Provider

EmpirioLabs AI

Use open and proprietary models served by EmpirioLabs

Model Provider

Exa

Search the web with Exa's AI-powered search, get content, answers, and conduct deep research

Search & Extraction

FAISS

A Document Store for vector search using FAISS

Document Store

Maintained by deepset

FalkorDB

Use FalkorDB as a document store with native vector search for GraphRAG workloads in Haystack

Document Store

Maintained by deepset

FastEmbed

Use the FastEmbed embedding models

Model Provider

Maintained by deepset

fastRAG

fastRAG is a research framework for efficient and optimized retrieval augmented generative pipelines

Custom Component

Featherless AI

Get access to thousands of open source language models hosted by Featherless.ai

Model Provider

Firecrawl

Crawl websites, search the web, and extract LLM-ready content using Firecrawl

Data Ingestion

Maintained by deepset

Flow Judge

Evaluate Haystack pipelines using Flow Judge

Evaluation Framework

FunASR

Transcribe audio files to Documents using FunASR — an open-source, self-hosted speech recognition toolkit supporting 50+ languages.

Data Ingestion

Maintained by deepset

Future AGI

OpenTelemetry tracing and evaluation for Haystack pipelines via traceAI.

Monitoring Tool

GitHub

Interact with GitHub repositories, issues, and pull requests within Haystack

Tool Integration

Maintained by deepset

Google AI

Use Google AI Models with Haystack

Model Provider

Maintained by deepset

Google Drive

Search and fetch files from Google Drive via the Drive API.

Data Ingestion

Maintained by deepset

Google Gen AI

Use Google's Gemini models with Haystack via the new Google Gen AI SDK

Model Provider

Maintained by deepset

Google Vertex AI

Use Google Vertex AI Models with Haystack

Model Provider

Maintained by deepset

Groq

Use open Language Models served by Groq

Model Provider

Maintained by deepset

HanLP

Use HanLP for Chinese text processing with Haystack

Preprocessor

Maintained by deepset

Hindsight

Add open-source long-term memory to your Haystack agents with Hindsight's retain, recall, and reflect tools

Memory Store

Hugging Face API

Use models through Hugging Face APIs - Inference Providers, Inference Endpoints, TGI and TEI

Model Provider

Maintained by deepset

Hugging Face Transformers

Run Transformers models locally in your Haystack pipelines

Model Provider

Maintained by deepset

IBM Db2

A Document Store for storing and retrieving documents from IBM Db2

Document Store

INSTRUCTOR Embedders

A component for computing embeddings using INSTRUCTOR embedding models.

Model Provider

InterSystems IRIS

Use the InterSystems IRIS database with Haystack

Document Store

Isaacus

Use the latest foundational legal AI models from Isaacus in Haystack.

Model Provider

Jina AI

Use the latest Jina AI embedding models

Model Provider

Maintained by deepset

Keenable

Web search and page fetch built for AI agents, keyless by default

Data Ingestion

Kreuzberg

Locally convert 91+ document formats into Haystack Documents using Kreuzberg's Rust-core engine

Data Ingestion

Maintained by deepset

LanceDB Haystack

A DocumentStore backed by LanceDB

Document Store

langdetect

Detect the language of documents and route text by language with langdetect

Custom Component

Maintained by deepset

langfuse

Monitor and trace your Haystack requests.

Monitoring Tool

Maintained by deepset

Lara

Translate Haystack documents using translated's Lara adaptive translation API

Custom Component

Maintained by deepset

LibreOffice File Converter

Convert office documents, spreadsheets, and presentations between formats using LibreOffice in Haystack pipelines.

Data Ingestion

Maintained by deepset

LiteLLM

Use any of 100+ LLM providers with Haystack through LiteLLM

Model Provider

Maintained by deepset

Llama.cpp

Use Llama.cpp models with Haystack.

Model Provider

Llama Stack

Use the Llama Stack generation models.

Model Provider

Maintained by deepset

llamafile

Run LLMs locally with llamafile

Model Provider

Maintained by deepset

LM Format Enforcer

Use the LM Format Enforcer to enforce JSON Schema / Regex output of your Local Models.

Model Provider

MarkItDown

Use Microsoft's MarkItDown to locally convert PDF, DOCX, PPTX, XLSX, HTML, images, and more into Markdown in Haystack

Data Ingestion

Maintained by deepset

Mastodon Fetcher

A custom component to fetch a mastodon usernames latest posts

Data Ingestion

Model Context Protocol - MCP

Haystack Tool Integration with the MCP

Tool Integration

Maintained by deepset

Mem0

Add persistent, user-specific memory to your Haystack agents and pipelines with Mem0

Memory Store

Maintained by deepset

Llama API

Use Llama Models with Haystack

Model Provider

Microsoft SharePoint

Search and fetch content from Microsoft SharePoint and OneDrive via the Microsoft Graph API.

Data Ingestion

Maintained by deepset

Milvus

Use the Milvus vector database with Haystack

Document Store

Mirage

Give a Haystack Agent a bash shell over Mirage's unified virtual filesystem, mounting S3, Google Drive, Postgres and 50+ other backends as one filesystem

Tool Integration

Maintained by deepset

Mistral

Use the Mistral API for embedding and text generation models.

Model Provider

Maintained by deepset

mixedbread ai

Use mixedbread's models as well as top open-source models in seconds

Model Provider

MLflow

Trace, evaluate, and monitor your Haystack applications with MLflow.

Monitoring Tool

MongoDB

Use a MongoDB Atlas database with Haystack

Document Store

Maintained by deepset

MonsterAPI

Use open Language Models served by MonsterAPI

Model Provider

Needle

Use Needle document store and retriever in Haystack.

Document Store

Neo4j

Use the Neo4j database with Haystack

Document Store

Notion Extractor

A component to extract pages from Notion to Haystack Documents. Useful for indexing Pipelines.

Data Ingestion

NVIDIA

Use NVIDIA models with Haystack.

Model Provider

Maintained by deepset

OAuth

Resolve OAuth 2.0 access tokens at pipeline runtime to authenticate downstream Haystack components.

Custom Component

Maintained by deepset

Ollama

Use Ollama models with Haystack. Ollama allows you to get up and running with large language models, locally.

Model Provider

Maintained by deepset

OPEA

Use the OPEA framework for hardware abstraction and orchestration

Distributed Computing

OpenAI

Use OpenAI Models with Haystack

Model Provider

Maintained by deepset

OpenAPI

Connect Haystack pipelines to any REST API described by an OpenAPI specification

Connector

Maintained by deepset

OpenLIT

Monitor, evaluate & improve GenAI and LLM applications

Monitoring Tool

OpenRouter

Use the OpenRouter API for text generation models.

Model Provider

Maintained by deepset

OpenSearch

A Document Store for storing and retrieval from OpenSearch

Document Store

Maintained by deepset

OpenStreetMap

Haystack component to fetch geographic data via the freely available OpenStreetMap (OSM) Overpass API.

Data Integration

OpenTelemetry

Trace and monitor your Haystack pipelines with OpenTelemetry.

Monitoring Tool

Maintained by deepset

Open WebUI

Use Open WebUI as a chat frontend for your Haystack apps through Hayhooks

UI

Maintained by deepset

Opik

Trace and evaluate your Haystack pipelines with Opik

Monitoring Tool

Optimum

High-performance inference using Hugging Face Optimum

Model Provider

Maintained by deepset

Oracle

Use Oracle AI Vector Search as a Document Store with Haystack

Document Store

OrcaRouter

Use OrcaRouter's OpenAI-compatible API gateway for chat generation in Haystack.

Model Provider

Maintained by deepset

oxidize-pdf

Convert PDFs into Haystack Documents with a fast Rust engine and element-disjoint RAG chunking; accepts paths and ByteStreams

Data Ingestion

PaddleOCR

Use PaddleOCR’s text-recognition and document-parsing capabilities with Haystack

Data Ingestion

Maintained by deepset

Perplexity

Use the Perplexity Agent API, Embeddings API, and grounded Search API in Haystack pipelines.

Model Provider

Maintained by deepset

pgvector

A Document Store for storing and retrieval from pgvector

Document Store

Maintained by deepset

Pinecone

Use a Pinecone database with Haystack

Document Store

Maintained by deepset

PraisonAI

Integrate PraisonAI multi-agent workflows into your Haystack pipelines

Custom Component

Presidio

PII detection and anonymization for Haystack Documents and text strings, powered by Microsoft Presidio.

Custom Component

Maintained by deepset

Prior Labs

Tabular data science via MCP - predict missing values, classify, and run regression on tabular datasets using Prior Labs' foundation model.

Tool Integration

MCP

Pyversity

A Ranker component for result diversification in retrieval pipelines

Ranker

Maintained by deepset

Qdrant

Use the Qdrant vector database with Haystack

Document Store

Maintained by deepset

Ragas

Use the Ragas evaluation framework to calculate model-based metrics

Evaluation Framework

Ray

Run and scale Haystack Pipelines with Ray in distributed manner

Distributed Computing

Respan

Trace, monitor, and route Haystack pipelines with Respan observability, gateway, and prompt management.

Monitoring Tool

Sambanova

Use open language models served by Sambanova

Model Provider

Scavio

Search the web using Scavio, a unified search API for AI agents

Search & Extraction

SearchApi

Uses SearchApi for web searches

Data Ingestion

Maintained by deepset

Sentence Transformers

Use Sentence Transformers embedding and ranking models in your Haystack pipelines

Model Provider

Maintained by deepset

SerperDev

Uses Serper.dev API for web searches

Search & Extraction

Maintained by deepset

Serpex

Multi-engine web search for Haystack — access Google, Bing, DuckDuckGo, Brave, Yahoo, and Yandex via Serpex API

Search & Extraction

SingleStore

Use SingleStore with Haystack

Document Store

Snowflake

A Snowflake integration that allows table retrieval from a Snowflake database.

Data Ingestion

Maintained by deepset

spaCy

Annotate named entities in your Haystack pipelines with spaCy models

Custom Component

Maintained by deepset

SQLAlchemy

Query any SQL database from a Haystack pipeline using SQLAlchemy

Data Ingestion

Maintained by deepset

STACKIT

Use the STACKIT API for text generation models.

Model Provider

Maintained by deepset

Supabase

Use Supabase as a Document Store for Haystack — pgvector for embedding search, PGroonga for full-text BM25 search, and Supabase Storage for file downloads

Document Store

Maintained by deepset

Superlinked

Use Superlinked (SIE) embeddings, reranking, and extraction in Haystack pipelines.

Model Provider

Synap

Add persistent, cross-session user memory to your Haystack agents and pipelines with Synap

Memory Store

Tavily

Search the web using Tavily's AI-powered search API, optimized for LLM applications

Search & Extraction

Maintained by deepset

TealTiger

Deterministic governance, cost tracking, and PII detection for Haystack pipelines. No LLM in the governance path.

Custom Component

Thunderbolt

Use Thunderbolt as a cross-platform AI client for your Haystack pipelines through Hayhooks

UI

Apache Tika

Convert files of different types (PDF, DOCX, HTML, and more) to Documents using Apache Tika

Data Ingestion

Maintained by deepset

Titan Takeoff Inference Server

Use Titan Takeoff to run local open-source LLMs with Haystack. Titan Takeoff allows you to run the latest models from Meta, Mistral and Alphabet directly in your laptop.

Model Provider