aggregate_column_statistics#
- Catalog.aggregate_column_statistics(use_default_columns: bool = True, exclude_hats_columns: bool = True, exclude_columns: list[str] | None = None, include_columns: list[str] | None = None, include_pixels: list[HealpixPixel] | None = None) DataFrame#
Read footer statistics in parquet metadata, and report on global min/max values.
- Parameters:
- use_default_columnsbool, default True
Should we use only the columns that are loaded by default (will be set in the metadata by the catalog provider). Defaults to True.
- exclude_hats_columnsbool, default True
Exclude HATS spatial and partitioning fields from the statistics. Defaults to True.
- exclude_columnslist[str] or None, default None
Additional columns to exclude from the statistics.
- include_columnslist[str] or None, default None
If specified, only return statistics for the column names provided. Defaults to None, and returns all non-hats columns.
- include_pixelslist[HealpixPixel] or None, default None
If specified, only return statistics for the pixels indicated. Defaults to none, and returns all pixels.
- Returns:
- pd.DataFrame
Dataframe with global summary statistics