> ## Documentation Index
> Fetch the complete documentation index at: https://arize-ax.mintlify.site/docs/llms.txt
> Use this file to discover all available pages before exploring further.

# Overview

> Guides & tutorials to help you build with Arize AX

## Instrument

Capture traces and spans from your LLM and agent applications.

<CardGroup cols={2}>
  <Card title="OpenInference Best Practices" href="/ax/cookbooks/instrument/openinference-best-practice">Enrich auto-instrumented traces with LLM, tool, agent, chain, and session attributes.</Card>
  <Card title="Tracing Integrations" href="/ax/cookbooks/tracing-integrations">Use Arize integrations to automatically collect LLM traces.</Card>
  <Card title="Tracing & Evaluating a Customer Support Agent" href="/ax/cookbooks/instrument/tracing-and-evaluating-agents">Create and evaluate a custom support agent with Arize AX to improve performance.</Card>
  <Card title="OpenAI Agents Guide" href="/ax/cookbooks/instrument/openai-agents-cookbook">Create and evaluate agents with the OpenAI Agents SDK in Arize AX.</Card>
  <Card title="Tracing a Vercel Eve Agent" href="/ax/cookbooks/instrument/tracing-a-vercel-eve-agent">Scaffold a Vercel Eve agent and add Arize AX observability through OpenTelemetry.</Card>
  <Card title="Dual Tracing into Databricks Unity Catalog and Arize AX" href="/ax/cookbooks/instrument/dual-o-tel-ingest-into-unity-catalog-and-arize-ax">Split-stream OpenTelemetry traces into both Arize AX and Databricks Unity Catalog.</Card>
</CardGroup>

## Observe

Monitor your applications in production and surface high-signal issues.

<CardGroup cols={2}>
  <Card title="Online Evals & Monitoring for Agents in Production" href="/ax/cookbooks/observe/arize-+-mosaic-ai-agent-framework">Run online evals and monitor a tool-calling LangGraph agent in production.</Card>
  <Card title="Designing Realtime Guardrails" href="/ax/cookbooks/observe/designing-realtime-guardrails">Decide what to guard at input vs. output and layer guardrails without blocking real users.</Card>
</CardGroup>

## Evaluate

Build evaluators, align them with human judgment, and measure quality.

<CardGroup cols={2}>
  <Card title="Evaluations Quickstart" href="/ax/cookbooks/evaluate/evaluations-quickstart">Get started running evaluations to measure how your model performs.</Card>
  <Card title="Align LLM Evals with Human Judgment" href="/ax/cookbooks/evaluate/align-llm-evals-with-human-judgment">Iteratively refine a custom LLM-as-a-Judge evaluator against human-annotated ground truth.</Card>
  <Card title="Why Public Benchmarks Lie: Building Your Own Eval Harness" href="/ax/cookbooks/evaluate/model-comparison-for-an-email-text-extraction-service">Build your own eval harness instead of trusting public benchmarks, via an email-extraction service.</Card>
  <Card title="Trace-Level Evaluations for a Recommendation Agent" href="/ax/cookbooks/evaluate/trace-level-evaluations-for-a-recommendation-agent">Run trace-level evaluations on individual requests to a recommendation agent.</Card>
  <Card title="Session-Level Evaluations for an AI Tutor" href="/ax/cookbooks/evaluate/session-level-evaluations-for-an-ai-tutor">Run multi-dimensional session-level evaluations on multi-turn AI tutor conversations.</Card>
  <Card title="Evaluating RAG Retrieval Quality and Correctness" href="/ax/cookbooks/evaluate/evaluating-rag">Create and evaluate a RAG application to improve retrieval quality and correctness.</Card>
  <Card title="Retrieval Evaluation" href="/ax/cookbooks/evaluate/retrieval-evaluation">Debug RAG retrieval quality with embeddings and LLM-assisted metrics.</Card>
  <Card title="Evaluating Agentic RAG Using Arize AX and Couchbase" href="/ax/cookbooks/evaluate/evaluating-agentic-rag-using-arize-and-couchbase">Build and evaluate an agentic RAG application on a Couchbase vector store.</Card>
  <Card title="Evaluating a RAG-Powered Chatbot" href="/ax/cookbooks/evaluate/llamaindex-evals">Monitor and debug a LlamaIndex RAG-powered chatbot with traces and spans.</Card>
  <Card title="Evaluate a Math Problem-Solving Agent Using Ragas" href="/ax/cookbooks/evaluate/ragas-agents-cookbook">Create and evaluate a math problem-solving agent using Ragas and Arize AX.</Card>
  <Card title="Pydantic Evals" href="/ax/cookbooks/evaluate/pydantic-evals">Evaluate a question-answering task with Pydantic Evals and log results to Arize AX.</Card>
  <Card title="Tracing and Evaluating Voice Applications" href="/ax/cookbooks/evaluate/tracing-and-evaluating-audio">Trace OpenAI Realtime voice agents and run tone evaluation on captured audio.</Card>
  <Card title="Audio Transcription and Evaluation with Gemini Flash" href="/ax/cookbooks/evaluate/gemini-audio-evals">Transcribe and evaluate audio with Gemini Flash, traced in Arize AX.</Card>
  <Card title="More Guides" href="/ax/cookbooks/evaluate/evaluation">Span-level evaluator examples for hallucination, relevance, toxicity, SQL, tool calling, and more.</Card>
</CardGroup>

## Improve

Run experiments, optimize prompts, and add guardrails.

<CardGroup cols={2}>
  <Card title="Build, Test, and Optimize a Prompt" href="/ax/cookbooks/improve/trip-planner-build-test-optimize">An end-to-end walkthrough of the prompt iteration cycle using a trip-planner use case.</Card>
  <Card title="Prompt Experimentation for Summarization" href="/ax/cookbooks/improve/summarization">Experiment with prompts to optimize a summarization task.</Card>
  <Card title="Text2SQL Application for Database Querying" href="/ax/cookbooks/improve/text2sql">Build and optimize a Text2SQL application for database querying from scratch.</Card>
  <Card title="Improving Structured Output Generation with Prompt Learning" href="/ax/cookbooks/improve/improving-structured-output-generation-with-prompt-learning">Use Prompt Learning to improve accuracy on structured output generation.</Card>
  <Card title="Optimizing Coding Agent Prompts for Planning" href="/ax/cookbooks/improve/optimizing-coding-agent-prompts-for-planning">Optimize coding agent prompts for the planning phase with Prompt Learning.</Card>
  <Card title="Optimizing Coding Agent Prompts for Execution" href="/ax/cookbooks/improve/optimizing-coding-agent-prompts-for-execution">Optimize coding agent prompts for execution and track improvement.</Card>
  <Card title="Optimizing Your Eval Prompts" href="/ax/cookbooks/improve/optimizing-your-eval-prompts">Use Prompt Learning to improve your LLM evaluation prompts.</Card>
  <Card title="Guardrails for Realtime Detection" href="/ax/cookbooks/improve/guardrails">Add realtime guardrails so production LLM apps output safe responses.</Card>
</CardGroup>

## Advanced Workflows

End-to-end guides for complex multi-agent, multi-modal, and security-focused systems.

<CardGroup cols={2}>
  <Card title="Product Recommendation Agent: Google Agent Engine & LangGraph" href="/ax/cookbooks/advanced-workflows/tracing-a-langgraph-application-with-agent-engine-in-vertex-ai">Build and deploy a LangGraph product-recommendation agent on Vertex AI Agent Engine.</Card>
  <Card title="A2A Financial Trading Agents - Google ADK / MCP / Llama" href="/ax/cookbooks/advanced-workflows/tracing-a2a-agent">Build a multi-agent trading system with Google ADK, the A2A protocol, MCP, and Llama.</Card>
  <Card title="Multi-modal Autonomous Browser Agent with Llama Models" href="/ax/cookbooks/advanced-workflows/trace-evaluate-browser-use-agent-with-l-lama4">Build and trace a multi-modal autonomous browser agent powered by Llama 4.</Card>
  <Card title="Trace LangChain Agent & Microsoft Risk+Safety Evaluators" href="/ax/cookbooks/advanced-workflows/evaluating-and-improving-ai-agents-at-scale-with-microsoft-foundry">Trace a LangChain agent and run Microsoft Foundry risk and safety evaluators.</Card>
  <Card title="Trace Red Teaming Agent (Microsoft Foundry)" href="/ax/cookbooks/advanced-workflows/foundry-red-team">Trace Microsoft Foundry Red Teaming Agent scans against your LLM or agent.</Card>
  <Card title="Jailbreak and Prompt Injection Defense" href="/ax/cookbooks/advanced-workflows/jailbreak-and-prompt-injection-defense">Red-team an assistant across an attack taxonomy, score Attack Success Rate, and find which defenses work.</Card>
  <Card title="AI Research" href="/ax/cookbooks/ai-research">Advanced experiments and benchmarks in LLM evaluation, instrumentation, and agent systems.</Card>
</CardGroup>
