Skip to content
Cloudflare Docs

Overview

Available on all plans

AutoRAG allows developers to create fully-managed retrieval-augmented generation (RAG) pipelines powered by Cloudflare allowing developers to integrate context-aware AI into their applications without managing infrastructure.

You can use AutoRAG to build:

  • Chatbots: Improve chatbot accuracy by providing relevant up-to-date information.
  • Enterprise Knowledge Search: Enhance knowledge bases and wikis with intelligent search.

Features

Automated indexing

Automatically and continuously index your data source, keeping your content fresh without manual reprocessing.

Workers Binding

Call your AutoRAG instance for search or AI search directly from a Cloudflare Worker using the native binding integration.

Similarity caching

Cache repeated queries and results to improve latency and reduce compute on repeated requests.


Workers AI

Run machine learning models, powered by serverless GPUs, on Cloudflare’s global network.

AI Gateway

Observe and control your AI applications with caching, rate limiting, request retries, model fallback, and more.

Vectorize

Build full-stack AI applications with Vectorize, Cloudflare’s vector database.

Workers

Build serverless applications and deploy instantly across the globe for exceptional performance, reliability, and scale.

R2

Store large amounts of unstructured data without the costly egress bandwidth fees associated with typical cloud storage services.


More resources

Get started

Build and deploy your first Workers AI application.

Developer Discord

Connect with the Workers community on Discord to ask questions, share what you are building, and discuss the platform with other developers.

@CloudflareDev

Follow @CloudflareDev on Twitter to learn about product announcements, and what is new in Cloudflare Workers.