Build the Market Leader in Satellite Analytics with us at LiveEO
We are looking for a Senior Data Engineer to build the high-performance data backbone for our multitemporal, multimodal Earth observation models. While our ML Engineers focus on model architecture, you will own the infrastructure, ingestion, and refinement pipelines that combine very high-resolution optical and Synthetic Aperture Radar (SAR) data into production-ready datasets.
This is a high-impact role at the intersection of Big Data and AI. You will ensure that our "data engine" is scalable, deterministic, and capable of handling petabytes of geospatial information to enable semantic understanding across sensors and time.
LiveEO is a young, dynamic team that thrives on big challenges and fast learning cycles—we move quickly, stay curious, and genuinely enjoy building together. We’re on a mission to break the “curse of Earth Observation”: turning incredible satellite data into reliable, actionable decisions that people can trust and use in real operations. In this role, you’ll work in a fun, high-ownership environment where ambitious technical problems (multimodal SAR/optical foundation models) meet real-world impact—and where your ideas can go from whiteboard to production in tight, collaborative iterations.
You’ll sit within LiveEO’s AI team and partner closely with downstream product teams to translate model capabilities into measurable business value and production-ready workflows. You’ll also work hand-in-hand with our dedicated data annotation team to define labeling guidelines, drive feedback loops on data quality, and ensure training/evaluation datasets reflect real-world edge cases.
Tech stack & tools, which potential candidate will work with:
Ray (distributed compute)
Prefect (workflow orchestration)
AWS (cloud infrastructure)
Datastores: PostgreSQL (metadata / operational data)
Python (core development)
PyTorch + PyTorch Lightning (model training, experimentation)
Databricks + MLflow (experiment tracking, model registry)
Geospatial stack: GDAL, Rasterio, GeoPandas, STAC (EO data handling and standardization)