Skip to content

BeaconA modern open-source data lakehouse for scientific data

Manage, query, and serve climate and scientific datasets from S3 Buckets, or local storage through SQL and JSON APIs.

timelatitudelongitudetemperature
2024-01-0336.21-5.4321.8
2024-01-0335.88-6.1022.4
2024-01-0437.02-4.7720.9
2024-01-0436.55-5.9223.1
2024-01-0535.40-6.5822.0
executing…

Supported formats

ZarrParquetTIFFNetCDFCSVODVArrow
Works withDataGripDBeaverJDBCArrow Flight SQLPythonDaskJupyter Notebooks

Deploy Beacon anywhere

☁️Cloud (AWS)
πŸ’»Jupyterremote notebook
Beaconon EC2
πŸͺ£S3 Bucketobject storage

Managed cloud β€” Beacon on EC2, data in S3.

πŸ–₯️On-premise
πŸ’»Jupyterremote notebook
Beaconyour server
πŸ’ΎLocal diskNetCDF Β· Parquet

Self-hosted β€” Beacon and data on one server.

πŸ’»Local
πŸ““Jupytersame machine
Beaconlocalhost:5001
πŸ’ΎLocal fileson disk

All on one machine β€” ideal for development.

Built for scientific data

Serve your existing NetCDF, Zarr, Parquet and more as-is β€” fast, standards-based SQL, fully open source.

Get started in minutes

Run Beacon with Docker, point it at your files, and query over SQL or JSON.

docker pull ghcr.io/maris-development/beacon:latest