Understanding Differences Between Exported Data and the Observability Dashboard

Overview

When comparing numbers from the Observability Dashboard with data retrieved through Exporting your Data or the API Explorer, you may notice small differences in values. This is expected behavior, not a data loss or reporting error.

This page explains the two data pipelines behind these tools and why they produce slightly different numbers.

How the Observability Dashboard Processes Data

The Observability Dashboard uses two different data types depending on how recent the data is, and merges them into a single view.

Recent data (last 2 days): Uncompressed, high-granularity data collected at the finest available resolution.

Historical data (older than 2 days): Compressed data. After 2 days, you can be certain that compression for a given time window is complete and the data is final.

⚠️
Within the last 2 days, compression may not have completed yet for all data points. The exact cutover point is not visible in the dashboard.

Why the Dashboard Uses Compressed Data for Historical Metrics

Computing exact distinct counts (such as unique viewers) on large datasets requires scanning massive volumes of data, which makes historical queries prohibitively slow at scale. Instead, Bitmovin uses HyperLogLog++, an industry-standard cardinality estimation algorithm that provides near-accurate distinct counts with a small error margin of up to 4%.

Once compression completes for a given time window, all metrics become exact and the HyperLogLog++ error margin no longer applies. Bitmovin continuously monitors compressed data to ensure it stays within 2% of the raw source.

The 200-Category Limit

When breaking down a metric by a high-cardinality dimension (for example, Video Title or User ID), the Dashboard returns up to approximately 200 distinct categorical values per query. Categories beyond that threshold are excluded, which causes the sum of individual category values to be lower than the reported Total.

⚠️
This limit applies to both the Dashboard breakdown view and the CSV exported from the API Explorer. See API Explorer: Design Limitations for details.

To work around this limit:

Shorten the time range to reduce the number of active categories below 200.
Apply additional filters (by platform, country, or other dimensions) to narrow the result set.
Use Exporting your Data to export the full raw dataset and perform the breakdown independently.

How Exported Data and the API Explorer Work

Both the Export Data feature and the API Explorer always return raw, uncompressed (high-granularity) data. The compression pipeline used by the dashboard is not applied.

Exports are well-suited for session-level analysis, custom data pipelines, and long-term storage in your own data warehouse.

Why the Numbers Differ

Data Source	Data Type	Distinct Count Method	Processing Applied
Dashboard (last ~2 days) and Export Data / API Explorer	Uncompressed	HyperLogLog++ (small margin of error, up to 4%)	No
Dashboard (older than ~2 days)	Compressed	Exact	Yes

For historical data, the dashboard shows a compressed and processed version of the data, while exports always provide the raw unprocessed equivalent. This is the most common source of visible discrepancy.

Replicating Dashboard Numbers from Exports

Exactly reproducing Observability Dashboard numbers from raw exports is not recommended. The dashboard relies on a complex data processing and compression pipeline that is not exposed externally. Raw exports are accurate and reliable for independent analysis, and the small variance between the two will not significantly impact your conclusions.

Long-Term Data Retention and Further Compression

The uncompressed data discussed in this article is also referred to as high-granularity data. It is retained for 30 or 90 days depending on your plan. After that period, it is further compressed into a single record per session, reducing it to aggregated values only (total playback time, error counts, etc.). The resulting compressed-per-session data is available for up to 13 months if you have purchased this as part of your plan.

Overview

How the Observability Dashboard Processes Data

Within the last 2 days, compression may not have completed yet for all data points. The exact cutover point is not visible in the dashboard.

Why the Dashboard Uses Compressed Data for Historical Metrics

The 200-Category Limit

This limit applies to both the Dashboard breakdown view and the CSV exported from the API Explorer. See API Explorer: Design Limitations for details.

How Exported Data and the API Explorer Work

Why the Numbers Differ

Replicating Dashboard Numbers from Exports

Long-Term Data Retention and Further Compression

Further Reading