Python app tracing with OpenTelemetry

This guide demonstrates how to instrument your Python applications with OpenTelemetry and send traces to the TrueFoundry backend. We’ll cover installing the required packages, initializing the tracer, automatic HTTP instrumentation, adding custom attributes and spans, and configuring sampling and debugging options. By the end, you’ll have a clear blueprint for integrating OpenTelemetry tracing into your Python services.

Installation

To start, install the OpenTelemetry SDK and necessary instrumentation packages:

pip install opentelemetry-api \
  opentelemetry-sdk \
  opentelemetry-exporter-otlp-proto-http \
  opentelemetry-instrumentation-flask \
  opentelemetry-instrumentation-requests \
  flask

Initializing OpenTelemetry

Next, initialize the OpenTelemetry SDK in your application. This involves setting up a Tracer Provider (which manages tracers and spans) and OTLP exporter to send the traces to TrueFoundry Backend.

import os
from opentelemetry import trace
from opentelemetry.exporter.otlp.proto.http.trace_exporter import OTLPSpanExporter
from opentelemetry.sdk.trace import TracerProvider
from opentelemetry.sdk.trace.export import BatchSpanProcessor

def setup_otel_sdk():
    """Setup OpenTelemetry SDK with TrueFoundry configuration."""
    
    # Set environment variables for OTLP exporter
    os.environ["OTEL_EXPORTER_OTLP_ENDPOINT"] = "<enter_your_api_endpoint>"
    os.environ["OTEL_EXPORTER_OTLP_HEADERS"] = f"Authorization=Bearer {enter_your_api_key},TFY-Tracing-Project={enter_your_tracing_project}"
        
    # Create tracer provider
    provider = TracerProvider()
    processor = BatchSpanProcessor(otlp_exporter)
    provider.add_span_processor(processor)
    
    # Set global trace provider
    trace.set_tracer_provider(provider)

    # Set global text map propagator for context propagation. Remove this line if you don't want to respect incoming request trace context
    set_global_textmap(TraceContextTextMapPropagator())

    
    return provider

Automatic HTTP Instrumentation

Now that the OpenTelemetry SDK is set up, let’s instrument the Flask HTTP server to automatically trace incoming requests. OpenTelemetry provides middleware for popular frameworks; for Flask, we use the FlaskInstrumentor. At this point, all incoming HTTP requests are being traced automatically.

from flask import Flask
from opentelemetry.instrumentation.flask import FlaskInstrumentor

app = Flask(__name__)

# Initialize OpenTelemetry
setup_otel_sdk()

# Instrument Flask
FlaskInstrumentor().instrument_app(app)

@app.route('/')
def hello():
    return 'Hello, World!'

if __name__ == '__main__':
    app.run(debug=True, port=8080)

Context propagation for outgoing requests

To fully benefit from distributed tracing, you should also propagate trace context in your outgoing HTTP requests. This helps downstream services recognize that their requests are part of a larger distributed trace. Configure your HTTP client to automatically inject trace context into outgoing requests using OpenTelemetry’s requests instrumentation:

This functionality is not incorporated into the Complete Application example to reduce complexity

import requests
from opentelemetry.instrumentation.requests import RequestsInstrumentor

# Instrument requests library
RequestsInstrumentor().instrument()

def make_request():
    # This request will automatically include trace context
    response = requests.get("http://localhost:8081")
    return response.text

Adding Attributes to Spans

Automatic instrumentation captures basic request information, but you can add custom data to your traces using attributes. Attributes are key-value pairs that provide additional context about your operations. For example, in order service, you might add order.id to make traces more useful.

from opentelemetry import trace

def handle_request(order_id: str):
    # Get current span from context
    current_span = trace.get_current_span()

    # Add attributes to the current span
    current_span.set_attribute("order.id", order_id)

Creating Custom Spans

Automatic instrumentation captures HTTP requests and external calls, but it doesn’t track your application’s internal logic. For important operations, you can manually create spans to trace specific parts of your code. A span represents a unit of work, and creating sub-spans helps you see detailed timing and context for key processes. For example, if a request triggers a complex function or external call that isn’t automatically captured, you can create a span to trace that specific operation. Manual instrumentation fills these gaps by letting you track what happens inside your application, not just at the edges.

from opentelemetry import trace

def get_order(order_id: str):
    db_tracer = trace.get_tracer("order-server-db")
    
    # Create a new span
    with db_tracer.start_as_current_span("db.query") as span:
        span.set_attribute("order.id", order_id)
        span.set_attribute("order.items", ["item1", "item2", "item3"])
        return f"order.id: {order_id}, items: ['item1', 'item2', 'item3']"

Complete Application Example

Below is a comprehensive example that demonstrates all the OpenTelemetry concepts we’ve covered. This application creates an order service HTTP server that sets up OpenTelemetry tracing with proper configuration, automatically instruments HTTP requests using FlaskInstrumentor, creates custom spans for database operations, and adds custom attributes to provide order-specific context.

import os
from flask import Flask
from opentelemetry import trace
from opentelemetry.exporter.otlp.proto.http.trace_exporter import OTLPSpanExporter
from opentelemetry.instrumentation.flask import FlaskInstrumentor
from opentelemetry.sdk.trace import TracerProvider
from opentelemetry.sdk.trace.export import BatchSpanProcessor
from opentelemetry.propagate import set_global_textmap
from opentelemetry.trace.propagation.tracecontext import TraceContextTextMapPropagator

# Global tracer
db_tracer = trace.get_tracer("order-server-db")

def setup_otel_sdk():
    """Setup OpenTelemetry SDK with TrueFoundry configuration."""
    
    # Set environment variables for OTLP exporter
    os.environ["OTEL_EXPORTER_OTLP_ENDPOINT"] = "<enter_your_api_endpoint>"
    os.environ["OTEL_EXPORTER_OTLP_HEADERS"] = f"Authorization=Bearer {enter_your_api_key},TFY-Tracing-Project={enter_your_tracing_project}"

    # Create OTLP exporter
    otlp_exporter = OTLPSpanExporter()
        
    # Create tracer provider
    provider = TracerProvider()
    
    # Add span processors
    provider.add_span_processor(BatchSpanProcessor(otlp_exporter))
    
    # Set global tracer provider
    trace.set_tracer_provider(provider)
        
    # Set global text map propagator for context propagation. Remove this line if you don't want to respect incoming request trace context
    set_global_textmap(TraceContextTextMapPropagator())
    
    return provider

def mock_fetch_order(order_id: str) -> str:
    """Mock database operation with custom span."""
    with db_tracer.start_as_current_span("db.query") as span:
        span.set_attribute("order.id", order_id)
        span.set_attribute("order.items", ["item1", "item2", "item3"])
        return f"order.id: {order_id}, items: ['item1', 'item2', 'item3']"

app = Flask(__name__)

# Initialize OpenTelemetry
setup_otel_sdk()

# Instrument Flask
FlaskInstrumentor().instrument_app(app)

@app.route('/order-server/orders/<order_id>', methods=['GET'])
def get_order(order_id):
    """Get order endpoint with tracing."""

    # Update existing span attributes
    current_span = trace.get_current_span()
    current_span.set_attribute("order.id", order_id)
    
    # Fetch order with custom span
    order = mock_fetch_order(order_id)
    
    return order

if __name__ == '__main__':
    app.run(debug=True, port=8080)

Run your application and view logged trace

Run the application and make a request to test the tracing:

curl http://localhost:8080/order-server/orders/123

Advanced Configuration

Sampling

Tracing sampling is a crucial technique for managing the volume of trace data in production environments. By default, OpenTelemetry Python traces every request, which works well for debugging or development but can become expensive and noisy in high-traffic production systems. Sampling helps in several ways: it reduces noise in traces, helping you focus on important traces while maintaining visibility into your system. It also helps with cost management in terms of storage, processing, and network bandwidth, making it essential for production deployments.

Sampling Strategies

OpenTelemetry supports several built-in samplers, but in practice, two cover most use cases: 1. TraceIdRatioBased Sampler Samples a fixed percentage of root traces. This sampler makes sampling decisions independently for each trace.

from opentelemetry.sdk.trace.sampling import TraceIdRatioBased

def setup_otel_sdk():
    # Configure sampling for production
    sampler = TraceIdRatioBased(0.1)  # Sample 10% of traces
    
    provider = TracerProvider(
        sampler=sampler
    )
    
    # ... rest of initialization

Pros: Simple to configure, predictable sampling rate, deterministic behavior Cons: May create partial traces if child spans are sampled differently, especially when spans are spread across multiple microservices or services with different sampling configurations. 2. ParentBased Sampler (Recommended) Samples a fixed percentage of root traces and ensures that child spans follow the parent’s sampling decision, maintaining complete trace integrity.

from opentelemetry.sdk.trace.sampling import TraceIdRatioBased, ParentBased

def setup_otel_sdk():
    # Configure sampling for production
    sampler = ParentBased(TraceIdRatioBased(0.1))
    
    provider = TracerProvider(
        sampler=sampler
    )
    
    # ... rest of initialization

Pros: Maintains trace integrity, prevents partial traces, ensures complete trace visibility when sampled Cons: Slightly more complex configuration, but worth the additional setup for production environments

Troubleshooting

Partial Traces If you see partial traces (missing spans in the middle of a trace), ensure you’re using ParentBased sampler:

# ❌ May create partial traces across services
sampler = TraceIdRatioBased(0.1)

# ✅ Maintains trace integrity across all services
sampler = ParentBased(TraceIdRatioBased(0.1))

Too Much Data If you’re still collecting too much data or experiencing high costs, reduce the sampling rate:

# Reduce from 10% to 5% for high-traffic environments
sampler = ParentBased(TraceIdRatioBased(0.05))

# For very high traffic, consider 1% sampling
sampler = ParentBased(TraceIdRatioBased(0.01))

Too Little Data If you need more visibility for debugging or monitoring, increase the sampling rate:

# Increase from 10% to 25% for better visibility
sampler = ParentBased(TraceIdRatioBased(0.25))

# For critical debugging, consider 50% or higher
sampler = ParentBased(TraceIdRatioBased(0.5))

Local Debugging

For local development and debugging, you can use the console exporter to see traces in your terminal:

from opentelemetry.sdk.trace.export import ConsoleSpanExporter

def setup_otel_sdk():
    # OTLP exporter for TrueFoundry
    otlp_exporter = OTLPSpanExporter()
    
    # Console exporter for local debugging
    console_exporter = ConsoleSpanExporter()
    
    provider = TracerProvider(resource=resource)
    provider.add_span_processor(BatchSpanProcessor(otlp_exporter))
    provider.add_span_processor(BatchSpanProcessor(console_exporter))
    
    trace.set_tracer_provider(provider)

Using Instrumentation Libraries

OpenTelemetry provides instrumentation libraries for many popular frameworks and libraries in the Python ecosystem. These are drop-in packages that automatically generate spans and metrics for operations in those libraries, so you don’t have to instrument everything manually. When you are using third-party libraries or frameworks, you should take advantage of these to save time and ensure consistency. For example, if you are using a web framework like Django, there is an official instrumentation package opentelemetry-instrumentation-django that can create spans for each HTTP request handled by Django. OpenTelemetry’s Python Contrib repository contains many such instrumentation packages for popular libraries.

LLM Deployment

LLM Finetuning

Prompt Management

LLM Tracing

Python app tracing with OpenTelemetry

Installation

Initializing OpenTelemetry

Automatic HTTP Instrumentation

Context propagation for outgoing requests

Adding Attributes to Spans

Creating Custom Spans

Complete Application Example

Run your application and view logged trace

Advanced Configuration

Sampling

Sampling Strategies

Troubleshooting

Local Debugging

Using Instrumentation Libraries

LLM Deployment

LLM Finetuning

Prompt Management

LLM Tracing

​Installation

​Initializing OpenTelemetry

​Automatic HTTP Instrumentation

​Context propagation for outgoing requests

​Adding Attributes to Spans

​Creating Custom Spans

​Complete Application Example

​Run your application and view logged trace

​Advanced Configuration

​Sampling

​Sampling Strategies

​Troubleshooting

​Local Debugging

​Using Instrumentation Libraries

Installation

Initializing OpenTelemetry

Automatic HTTP Instrumentation

Context propagation for outgoing requests

Adding Attributes to Spans

Creating Custom Spans

Complete Application Example

Run your application and view logged trace

Advanced Configuration

Sampling

Sampling Strategies

Troubleshooting

Local Debugging

Using Instrumentation Libraries