Data Access
Data sets can be local or remote (e.g. cloud storage), and a wide variety of file formats are supported. The large, multidimensional, datasets required for modern scientific analysis are supported natively. Adding completely new data formats requires only a few lines of code.
Data can be anywhere
Local data
Data on your machine can be loaded directly into memory or lazy-loaded from disk (for very large files)
Remote data
Remote datasets can either be streamed on-demand or downloaded and loaded into memory
Data can be BIG
🧬 A large (~60 GB) multi-resolution image can be interactively panned and zoomed while remaining entirely on disk.
Most common data formats are already supported
Many common scientific data file formats are supported alongside domain-specific datasets, leveraging open-source libraries that provide access to a wide variety of underlying file formats.
🪐 Astro Favorites
🧬 Bio Favorites
New data loaders are easy
If a Python library exists to read your data file format, it is often just a few lines of code to define a custom data loader.