Skip to main content

What is a Trace?

A trace in Galtea represents a single operation or function call that occurs during an AI agent’s execution. Traces capture the internal workings of your agent—such as tool calls, retrieval operations, chain orchestrations, and LLM invocations—providing deep visibility into how your agent processes requests. Traces are always linked to an inference result, enabling you to understand not just what your agent responded, but how it arrived at that response. Every trace must belong to a specific inference result.

Why Use Traces?

Debugging

Identify exactly where and why your agent failed or produced unexpected results.

Performance Optimization

Pinpoint slow operations with latency tracking at every step.

Compliance & Auditing

Maintain a complete audit trail of all operations for regulatory requirements.

Cost Analysis

Understand which operations consume the most resources.

Trace Hierarchy

Traces support parent-child relationships, allowing you to visualize the complete execution flow of your agent. When a traced function calls another traced function, the hierarchy is automatically captured.
Agent Call (root)
├─ Route Query (CHAIN)
├─ Retrieve Context (RETRIEVER)
│  └─ Vector Search (TOOL)
├─ Fetch Product Data (TOOL)
└─ Calculate Discount (TOOL)
Each trace includes:
  • id: Unique identifier for the trace
  • parent_trace_id: Reference to the parent trace (null for root traces)
  • name: The operation name
  • type: Classification of the operation (TraceType)
  • description: Human-readable description of what the operation does

Trace Types

Traces are classified by type to help you understand the nature of each operation and debug issues more effectively.
TypeDefinitionWhy This Matters for Tracing
SPANGeneric durations of work in a trace.Default type for general operations that don’t fit other categories. Useful for grouping related work.
GENERATIONAI model generations including prompts, token usage, and costs.This is where cost (tokens) and latency come from. Clearly see these operations and identify expensive calls and bottlenecks.
EVENTDiscrete point-in-time events.Capture important moments without duration, like user interactions or state changes.
AGENTAgent that orchestrates flow and uses tools with LLM guidance.High-level orchestration nodes that coordinate multiple operations and make decisions.
TOOLTool/function calls (e.g., external APIs, calculations).Deterministic or external calls where inputs, outputs, and side effects determine correctness.
CHAINLinks between different application steps.Composite orchestration nodes that run multiple internal steps and pass data between stages.
RETRIEVERData retrieval steps (vector store, database).Operations that fetch contextual data which directly affect prompt relevance and the context window.
EVALUATORFunctions that assess LLM outputs.Operations that evaluate quality, safety, or correctness of generated content.
EMBEDDINGEmbedding model calls.Vector embedding operations for semantic search or similarity.
GUARDRAILComponents that protect against malicious content.Safety checks that filter or validate inputs/outputs.

Best Practices

Choose descriptive names that clearly indicate the operation being traced — e.g., search_documents rather than step_2.
Trace operations that represent logical units of work (tool calls, LLM invocations, retrieval steps), not every single function.
Classify operations correctly (TOOL, GENERATION, RETRIEVER, etc.) to enable better filtering and analysis in the dashboard.
The @trace decorator captures function arguments automatically. Avoid tracing functions that receive very large inputs (e.g., full documents) — pass summaries or IDs instead.

SDK Integration

Tracing Tutorial

Step-by-step guide to instrumenting your agent and collecting traces.

Trace Service

Manage and collect traces for your AI agent operations using the SDK.

Trace Properties

Inference Result
InferenceResult
required
The inference result this trace belongs to. Every trace must be linked to an inference result.
Name
string
required
The name of the traced operation (e.g., function name).
Type
TraceType
The type of operation: SPAN, GENERATION, EVENT, AGENT, TOOL, CHAIN, RETRIEVER, EVALUATOR, EMBEDDING, or GUARDRAIL.
Description
string
A human-readable description of the operation. Can be set manually via start_trace(description=...) or automatically from function docstrings using @trace(include_docstring=True). Maximum size: 32KB.
Parent Trace ID
string
The ID of the parent trace for hierarchical relationships.
Input Data
any
The input parameters passed to the operation. Maximum size: 128KB.
Output Data
any
The result returned by the operation. Maximum size: 128KB.
Error
string
Error message if the operation failed.
Latency (ms)
float
The execution time of the operation in milliseconds.
Start Time
string
ISO 8601 timestamp when the operation started.
End Time
string
ISO 8601 timestamp when the operation completed.
Metadata
any
Additional custom metadata about the trace. Maximum size: 128KB.

Tracing Agent Operations

Step-by-step guide to capturing and analyzing agent traces.

Inference Result

The inference results that traces are linked to.