← All posts

langfuse

1 post tagged “langfuse

Building a Hybrid LLM Platform on EKS, Part 7: Observability and Cost Telemetry

Part 7 of our hands-on EKS series. We instrument the TypeScript router with OpenTelemetry, upgrade Prometheus to kube-prometheus-stack for GPU and vLLM metrics, add Grafana Tempo for distributed traces, and wire Langfuse so every request shows its backend, token count, and dollar cost.