Fly.io's metrics dashboard service is operating under a major impact status as of this incident, with users reporting delayed or missing metrics in the fly-metrics.net dashboard. The issue remains identified but unresolved.
Fly.io provides infrastructure and application hosting services, allowing developers to deploy and monitor applications globally.
The company identified resource contention on a subset of metrics ingestion hosts as the underlying cause. Since investigation began, Fly.io has been working to rebalance ingestion traffic, increase cluster throughput, and process backlogged metric data. According to the latest update, backlog processing continues with gradual improvement observed, though ingestion delays persist for some customers.
This is a developing story. The Downtime will continue monitoring for status updates.