LSDB

Contents

LSDB#

LSDB is a python tool for scalable analysis of large catalogs (e.g. analyzing, querying and/or crossmatching ~10⁹ sources). It aims to address large-scale data processing challenges expected from upcoming surveys, such as LSST, Euclid, Roman, and the Schmidt Observatory System.

Built on top of Dask to efficiently scale and parallelize operations across multiple distributed workers, it uses the HATS data format to efficiently perform spatial operations.

LSDB API surface map with clickable function and class labels open_catalog from_dataframe from_astropy generate_catalog show_versions plot_pixels plot_coverage plot_points Catalog name columns all_columns nested_columns original_schema dtypes head tail sample random_sample rename hc_structure hc_collection MarginCatalog MapCatalog AssociationCatalog len aggregate_column_statistics per_pixel_statistics get_healpix_pixels get_ordered_healpix_pixels partitions npartitions get_partition_index estimate_size get_partition prune_empty_partitions map_partitions map_rows to_dask_dataframe to_delayed query box_search cone_search polygon_search order_search pixel_search id_search search moc_search crossmatch crossmatch_nested merge concat join join_nested nest_lists merge_asof merge_map CatalogStream InfiniteStream compute write_catalog to_hats to_association

Using this Guide#

Getting Started

Installation and QuickStart Guide

Getting Started with LSDB
Tutorials

Learn the LSDB features by working through our guides

Tutorials
Data Access

How LSDB catalogs are served and accessed across providers

Data Access
API Reference

The detailed API documentation

API Reference
Contact Us / Getting Help

Reach out for more help

Contact Us / Getting Help