Skip to main content

Documentation Index

Fetch the complete documentation index at: https://handbook.fiddler.ai/llms.txt

Use this file to discover all available pages before exploring further.

Welcome to Fiddler! Choose your integration path based on what you want to accomplish. Each quick start gets you up and running in 10-20 minutes.

🎯 What Do You Want to Do?

🤖 Monitor AI Agents & Multi-Step Workflows

Best for: Applications using LangGraph, Strands, or custom agentic frameworks Your AI agents make complex decisions across multiple steps. Monitor the complete workflow from initial reasoning to final response. Choose your framework:
FrameworkIntegrationTimeQuick Start
LangGraph / LangChainAuto-instrumentation10 minLangGraph SDK →
Strands AgentsNative integration10 minStrands Agents SDK →
Custom / OtherOpenTelemetry15 minOpenTelemetry →
What you’ll monitor:
  • Agent decision-making and tool selection
  • Multi-step reasoning chains
  • LLM calls with inputs/outputs
  • Tool usage and external API calls
  • Error propagation and recovery

💬 Monitor Simple LLM Applications

Best for: Single-shot LLM inference, chatbots, simple RAG systems You’re using LLMs for straightforward tasks like Q&A, content generation, or basic chat interfaces. Quick Start: Simple LLM Monitoring → ⏱️ 10 min What you’ll monitor:
  • LLM prompts and completions
  • Token usage and costs
  • Response latency
  • Quality metrics (toxicity, PII, sentiment)
  • Embedding visualizations

📊 Monitor Traditional ML Models

Best for: Scikit-learn, XGBoost, TensorFlow, PyTorch models in production You have traditional ML models (classification, regression, ranking) deployed and need to track their performance. Quick Start: Simple ML Monitoring → ⏱️ 10 min What you’ll monitor:
  • Model performance (accuracy, precision, recall)
  • Data drift and distribution shifts
  • Feature importance
  • Prediction analytics
  • Custom business metrics

🧪 Evaluate & Test LLM Applications

Best for: Pre-deployment testing, A/B testing, regression testing You want to systematically evaluate LLM quality before deployment or compare different prompts/models. Quick Start: Experiments → ⏱️ 15 min What you’ll evaluate:
  • Response accuracy and relevance
  • Semantic similarity
  • Custom domain-specific metrics
  • Safety and bias
  • RAG-specific metrics (faithfulness, context relevance)

🛡️ Add Safety Guardrails

Best for: Protecting LLM applications from harmful content, PII leaks, and hallucinations You need real-time protection to prevent your LLM from generating harmful, sensitive, or incorrect content. Quick Start: Guardrails → ⏱️ 10 min What you’ll protect against:
  • Harmful and toxic content
  • PII leaks (emails, SSNs, credit cards)
  • Hallucinations and unsupported claims
  • Jailbreak attempts
  • Content policy violations

🤔 Not Sure Where to Start?

If you’re building with AI agents:

Start with Agentic Observability - it covers everything you need for multi-step workflows.

If you’re using LLMs for simple tasks:

Start with Simple LLM Monitoring - perfect for chat, Q&A, and content generation.

If you have traditional ML models:

Start with Simple ML Monitoring - track performance and drift for any ML model.

If you want to test before deploying:

Start with Experiments - build confidence with systematic testing.

If you need to protect your users:

Start with Guardrails - add safety checks in minutes.

🚀 Quick Comparison

Use CaseMonitoring TypeBest Quick StartTime
Multi-agent systems, complex workflowsAgenticLangGraph / Strands / OTeL10 min
Simple chatbots, Q&A, content generationLLMSimple LLM Monitoring15 min
Classification, regression, ranking modelsMLSimple ML Monitoring10 min
Pre-deployment testing, A/B testingExperimentsExperiments15 min
Safety, PII protection, hallucination preventionGuardrailsGuardrails10 min

📚 After Your Quick Start

Once you’ve completed a quick start:
  1. Explore the UI - View your dashboards, metrics, and insights
  2. Set Up Alerts - Get notified when issues occur
  3. Customize Metrics - Add domain-specific monitoring
  4. Read Advanced Guides - Deep dive into specific features
  5. Join the Community - Get help and share best practices

💡 Pro Tips

  • Start Simple: Pick one quick start, complete it fully, then expand
  • Use Notebooks: Most quick starts have Colab notebooks for hands-on learning
  • Test Data First: Use sample data before connecting production systems
  • Monitor + Evaluate: Combine monitoring with evaluation for comprehensive coverage
  • Layer Guardrails: Add safety checks on both inputs and outputs

Need Help?