aggregate_column_statistics

aggregate_column_statistics#

Catalog.aggregate_column_statistics(use_default_columns: bool = True, exclude_hats_columns: bool = True, exclude_columns: list[str] | None = None, include_columns: list[str] | None = None, include_pixels: list[HealpixPixel] | None = None) DataFrame#

Read footer statistics in parquet metadata, and report on global min/max values.

Parameters:
use_default_columnsbool, default True

Should we use only the columns that are loaded by default (will be set in the metadata by the catalog provider). Defaults to True.

exclude_hats_columnsbool, default True

Exclude HATS spatial and partitioning fields from the statistics. Defaults to True.

exclude_columnslist[str] or None, default None

Additional columns to exclude from the statistics.

include_columnslist[str] or None, default None

If specified, only return statistics for the column names provided. Defaults to None, and returns all non-hats columns.

include_pixelslist[HealpixPixel] or None, default None

If specified, only return statistics for the pixels indicated. Defaults to none, and returns all pixels.

Returns:
pd.DataFrame

Dataframe with global summary statistics