🚀 Haystack 3.0 Launch Week is live next week. 5 days, 5 new things. See what's coming

Tutorials

Whether you’re a beginner or an experienced user, these tutorials will walk you through Haystack features and functionalities making it easy for you to understand and implement them.

22 tutorials & walkthroughs for all levels

Sort By:

Full Walkthrough

Evaluation

A guided walkthrough to learn everything about evaluation

Creating Your First QA Pipeline with Retrieval-Augmentation

Build your first generative QA pipeline with OpenAI GPT models

Build a Tool-Calling Agent

Learn how to create an Agent that can use a web search tool to answer questions

Conversational RAG Agent using InMemoryChatMessageStore

Learn how to use conversational history for RAG to enable multi-turn conversations grounded in documents

Human-in-the-Loop with Haystack Agents

Learn how to use confirmation strategies to get user input before tool execution for safer, more controllable AI systems.

Creating Vision+Text RAG Pipelines

Build a multimodal RAG pipeline that can answer questions grounded in both images and text.

Creating a Multi-Agent System with Haystack

Use agents specialized in specific tasks to build more complex, modular agent workflows

Building an Agentic RAG with Fallback to Websearch

Learn how to direct the query to a web-based RAG route when necessary

Filtering Documents with Metadata

Learn how to filter down to specific documents at retrieval time using metadata

Preprocessing Different File Types

Learn how to build an indexing pipeline that will preprocess files based on their file type

Creating Custom SuperComponents

Learn how to use the @super_component decorator to create custom SuperComponents with input and output mappings

Embedding Metadata for Improved Retrieval

Learn how to embed metadata while indexing, to improve the quality of retrieval results

Serializing LLM Pipelines

Learn how to serialize and deserialize your pipelines between YAML and Python

Compress the KV Cache with TurboQuant and Haystack

Use TurboQuant KV cache compression to run large LLMs on consumer GPUs with significant memory reduction

Build an Extractive QA Pipeline

Learn how to build a Haystack pipeline that uses an extractive model to display where the answer to your query is.

Creating a Hybrid Retrieval Pipeline

Learn how to combine keyword-based retrieval and dense retrieval to enhance retrieval

Generating Structured Output with OpenAI

Learn how to generate structured output with OpenAI models using Pydantic models or JSON schema

Classifying Documents & Queries by Language

Learn how to classify documents and route queries by language, for both indexing and RAG pipelines

Evaluating RAG Pipelines

Learn how to evaluate your RAG pipelines using statistical and model-based evaluation metrics

Building a Chat Agent with Function Calling

Learn how to build chat applications that have agent-like behavior with OpenAI function calling

Query Classification with TransformersTextRouter and TransformersZeroShotTextRouter

Learn how to route user questions and other text inputs with classification models

Retrieving a Context Window Around a Sentence

Learn how to use the SentenceWindowRetriever to retrieve a context window