Truefoundry Docs

This guide demonstrates how to instrument your Python applications with Traceloop and send traces to the TrueFoundry backend. We’ll cover installing the required packages, initializing the tracer, automatic HTTP instrumentation, adding custom attributes and spans, and configuring sampling and debugging options. By the end, you’ll have a clear blueprint for integrating Traceloop tracing into your Python services.

Installation

To start, install the Traceloop SDK and necessary instrumentation packages:

pip install traceloop-sdk \
  opentelemetry-instrumentation-flask \
  opentelemetry-instrumentation-requests \
  flask

Initializing Traceloop

Next, initialize the Traceloop SDK in your application. This involves setting up Traceloop with your TrueFoundry configuration to send traces to the TrueFoundry Backend.

import os
from traceloop.sdk import Traceloop
from opentelemetry.trace.propagation.tracecontext import TraceContextTextMapPropagator

def setup_traceloop():
    """Setup Traceloop SDK with TrueFoundry configuration."""
    
    # Initialize Traceloop
    Traceloop.init(
        api_endpoint="<enter_your_api_endpoint>",
        app_name="<enter_your_tracing_application_name>",
        headers={
            "Authorization": f"Bearer <enter_your_tfy_api_key>",
            "TFY-Tracing-Project": "<enter_your_tracing_project_fqn>",
        },

        # Sets text map propagator for context propagation. Remove this line if you don't want to respect incoming request trace context
        propagator=TraceContextTextMapPropagator(), 
    )

Automatic HTTP Instrumentation

Now that the Traceloop SDK is set up, let’s instrument the HTTP server to automatically trace incoming requests. Traceloop works with OpenTelemetry instrumentation libraries for popular frameworks; for Flask, we use the FlaskInstrumentor. At this point, all incoming HTTP requests are being traced automatically.

from flask import Flask
from opentelemetry.instrumentation.flask import FlaskInstrumentor

app = Flask(__name__)

# Initialize Traceloop
setup_traceloop()

# Instrument Flask
FlaskInstrumentor().instrument_app(app)

@app.route('/')
def hello():
    return 'Hello, World!'

if __name__ == '__main__':
    app.run(debug=True, port=8080)

Context propagation for outgoing requests

To fully benefit from distributed tracing, you should also propagate trace context in your outgoing HTTP requests. This helps downstream services recognize that their requests are part of a larger distributed trace. Configure your HTTP client to automatically inject trace context into outgoing requests using OpenTelemetry’s requests instrumentation:

This functionality is not incorporated into the Complete Application example to reduce complexity

import requests
from opentelemetry.instrumentation.requests import RequestsInstrumentor
from traceloop.sdk import Traceloop

# Initialize Traceloop
setup_traceloop()

# Instrument requests library
RequestsInstrumentor().instrument()

def make_request():
    # This request will automatically include trace context
    response = requests.get("http://localhost:8081")
    return response.text

Adding Attributes to Spans

Automatic instrumentation captures basic request information, but you can add custom data to your traces using attributes. Attributes are key-value pairs that provide additional context about your operations. For example, in order service, you might add order.id to make traces more useful.

from opentelemetry import trace

def handle_request(order_id: str):
    # Get current span from context
    current_span = trace.get_current_span()

    # Add attributes to the current span
    current_span.set_attribute("order.id", order_id)

Creating Custom Spans

Automatic instrumentation captures HTTP requests and external calls, but it doesn’t track your application’s internal logic. For important operations, you can manually create spans to trace specific parts of your code. A span represents a unit of work, and creating sub-spans helps you see detailed timing and context for key processes. For example, if a request triggers a complex function or external call that isn’t automatically captured, you can create a span to trace that specific operation. Manual instrumentation fills these gaps by letting you track what happens inside your application, not just at the edges.

from opentelemetry import trace

def get_order(order_id: str):
    db_tracer = trace.get_tracer("order-server-db")
    
    # Create a new span
    with db_tracer.start_as_current_span("db.query") as span:
        span.set_attribute("order.id", order_id)
        span.set_attribute("order.items", ["item1", "item2", "item3"])
        return f"order.id: {order_id}, items: ['item1', 'item2', 'item3']"

Complete Application Example

Below is a comprehensive example that demonstrates all the Traceloop concepts we’ve covered. This application creates an order service HTTP server that sets up Traceloop tracing with proper configuration, automatically instruments HTTP requests using FlaskInstrumentor, creates custom spans for database operations, and adds custom attributes to provide order-specific context.

from flask import Flask, request
from opentelemetry import trace
from opentelemetry.instrumentation.flask import FlaskInstrumentor
from traceloop.sdk import Traceloop
from opentelemetry.trace.propagation.tracecontext import TraceContextTextMapPropagator

# Global tracer
db_tracer = trace.get_tracer("order-server-db")

def setup_traceloop():
    """Setup Traceloop SDK with TrueFoundry configuration."""
    
    # Initialize Traceloop
    Traceloop.init(
        api_endpoint="<enter_your_api_endpoint>",
        headers={
            "Authorization": f"Bearer <enter_your_tfy_api_key>",
            "TFY-Tracing-Project": "<enter_your_tracing_project_fqn>",
        },
        propagator=TraceContextTextMapPropagator(),
    )

def mock_fetch_order(order_id: str) -> str:
    """Mock database operation with custom span."""
    with db_tracer.start_as_current_span("db.query") as span:
        span.set_attribute("order.id", order_id)
        span.set_attribute("order.items", ["item1", "item2", "item3"])
        return f"order.id: {order_id}, items: ['item1', 'item2', 'item3']"

app = Flask(__name__)

# Initialize Traceloop
setup_traceloop()

# Instrument Flask
FlaskInstrumentor().instrument_app(app)

@app.route('/order-server/orders/<order_id>', methods=['GET'])
def get_order(order_id):
    """Get order endpoint with tracing."""
        
    # Update existing span attributes
    current_span = trace.get_current_span()
    current_span.set_attribute("order.id", order_id)
    
    # Fetch order with custom span
    order = mock_fetch_order(order_id)
    
    return order

if __name__ == '__main__':
    app.run(debug=True, port=8080)

Run your application and view logged trace

Run the application and make a request to test the tracing:

curl http://localhost:8080/order-server/orders/123

Advanced Configuration

Sampling

Tracing sampling is a crucial technique for managing the volume of trace data in production environments. By default, Traceloop traces every request, which works well for debugging or development but can become expensive and noisy in high-traffic production systems. Sampling helps in several ways: it reduces noise in traces, helping you focus on important traces while maintaining visibility into your system. It also helps with cost management in terms of storage, processing, and network bandwidth, making it essential for production deployments.

Sampling Strategies

Traceloop supports OpenTelemetry’s built-in samplers, but in practice, two cover most use cases: 1. TraceIdRatioBased Sampler Samples a fixed percentage of root traces. This sampler makes sampling decisions independently for each trace.

from opentelemetry.sdk.trace.sampling import TraceIdRatioBased
from traceloop.sdk import Traceloop

def setup_traceloop():
    # Initialize Traceloop with sampling
    Traceloop.init(
        api_endpoint="<enter_your_api_endpoint>",
        headers={
            "Authorization": f"Bearer <enter_your_tfy_api_key>",
            "TFY-Tracing-Project": "<enter_your_tracing_project_fqn>",
        },
        propagator=TraceContextTextMapPropagator(),
        sampler=TraceIdRatioBased(0.1)  # Sample 10% of traces
    )    

Pros: Simple to configure, predictable sampling rate, deterministic behavior Cons: May create partial traces if child spans are sampled differently, especially when spans are spread across multiple microservices or services with different sampling configurations. 2. ParentBased Sampler (Recommended) Samples a fixed percentage of root traces and ensures that child spans follow the parent’s sampling decision, maintaining complete trace integrity.

from opentelemetry.sdk.trace.sampling import TraceIdRatioBased, ParentBased
from traceloop.sdk import Traceloop

def setup_traceloop():
    # Initialize Traceloop with sampling
    Traceloop.init(
        api_endpoint="<enter_your_api_endpoint>",
        headers={
            "Authorization": f"Bearer <enter_your_tfy_api_key>",
            "TFY-Tracing-Project": "<enter_your_tracing_project_fqn>",
        },
        propagator=TraceContextTextMapPropagator(),
        sampler=ParentBased(TraceIdRatioBased(0.1))
    )

Pros: Maintains trace integrity, prevents partial traces, ensures complete trace visibility when sampled Cons: Slightly more complex configuration, but worth the additional setup for production environments

Troubleshooting

Partial Traces If you see partial traces (missing spans in the middle of a trace), ensure you’re using ParentBased sampler:

# ❌ May create partial traces across services
sampler = TraceIdRatioBased(0.1)

# ✅ Maintains trace integrity across all services
sampler = ParentBased(TraceIdRatioBased(0.1))

Too Much Data If you’re still collecting too much data or experiencing high costs, reduce the sampling rate:

# Reduce from 10% to 5% for high-traffic environments
sampler = ParentBased(TraceIdRatioBased(0.05))

# For very high traffic, consider 1% sampling
sampler = ParentBased(TraceIdRatioBased(0.01))

Too Little Data If you need more visibility for debugging or monitoring, increase the sampling rate:

# Increase from 10% to 25% for better visibility
sampler = ParentBased(TraceIdRatioBased(0.25))

# For critical debugging, consider 50% or higher
sampler = ParentBased(TraceIdRatioBased(0.5))

Local Debugging

For local development and debugging, you can use the console exporter to see traces in your terminal:

from opentelemetry.sdk.trace.export import ConsoleSpanExporter
from traceloop.sdk import Traceloop

def setup_traceloop():
    # Get API key from environment
    TFY_API_KEY = os.environ.get("TFY_API_KEY")
    
    # Initialize Traceloop with console exporter for debugging
    Traceloop.init(
        api_endpoint="<enter_your_api_endpoint>",
        headers={
            "Authorization": f"Bearer {TFY_API_KEY}",
            "TFY-Tracing-Project": "<enter_your_tracing_project_fqn>",
        },
        console_exporter=True  # Enable console output for debugging
    )

Using Instrumentation Libraries

Traceloop works seamlessly with OpenTelemetry’s instrumentation libraries for many popular frameworks and libraries in the Python ecosystem. These are drop-in packages that automatically generate spans and metrics for operations in those libraries, so you don’t have to instrument everything manually. When you are using third-party libraries or frameworks, you should take advantage of these to save time and ensure consistency. For example, if you are using a web framework like Django, there is an official instrumentation package opentelemetry-instrumentation-django that can create spans for each HTTP request handled by Django. OpenTelemetry’s Python Contrib repository contains many such instrumentation packages for popular libraries.

Getting Started

Train and Deploy Models

Service Deployment

Job Deployment

LLM Deployment

LLM Finetuning

Workflow Deployment

Async Service Deployment

Volumes

ML Repository

LLM Tracing

Advanced Features

Python app tracing with Traceloop

Installation

Initializing Traceloop

Automatic HTTP Instrumentation

Context propagation for outgoing requests

Adding Attributes to Spans

Creating Custom Spans

Complete Application Example

Run your application and view logged trace

Advanced Configuration

Sampling

Sampling Strategies

Troubleshooting

Local Debugging

Using Instrumentation Libraries

Getting Started

Train and Deploy Models

Service Deployment

Job Deployment

LLM Deployment

LLM Finetuning

Workflow Deployment

Async Service Deployment

Volumes

ML Repository

LLM Tracing

Advanced Features

​Installation

​Initializing Traceloop

​Automatic HTTP Instrumentation

​Context propagation for outgoing requests

​Adding Attributes to Spans

​Creating Custom Spans

​Complete Application Example

​Run your application and view logged trace

​Advanced Configuration

​Sampling

​Sampling Strategies

​Troubleshooting

​Local Debugging

​Using Instrumentation Libraries

Installation

Initializing Traceloop

Automatic HTTP Instrumentation

Context propagation for outgoing requests

Adding Attributes to Spans

Creating Custom Spans

Complete Application Example

Run your application and view logged trace

Advanced Configuration

Sampling

Sampling Strategies

Troubleshooting

Local Debugging

Using Instrumentation Libraries