B

Sr. Staff ML Data Platform Engineer

Boston Dynamics, Inc.
1 day ago
Full-time
On-site
Waltham Office (POST) United States of America

We are seeking a seasoned and creative ML Data Platform Senior Staff Engineer to play a lead role scaling the data platforms that power our robots, including Spot, Stretch, and Atlas. In this role, you will shape the end-to-end data pipeline that turns raw robot logs into high-quality training data for our teams. You'll work at the intersection of robotics, data engineering, and machine learning — ensuring our models have the right data, at the right scale, with the right quality.

You’ll make an impact by:

  • Scaling the unified data platform for the robotics fleet. Grow the multi-tenant infrastructure that ingests, stores, and serves sensor data, operational telemetry, and behavioral logs from Spot, Stretch, and Atlas — supporting diverse downstream consumers including ML training, safety analysis, fleet operations, and product development.

  • Making robot data usable at scale. Today's bottleneck isn't collecting data — it's finding and accessing the right data. You'll build the indexing, cataloging, and query layers that let engineers across the company ask questions of petabytes of heterogeneous robot data without needing to understand the underlying storage.

  • Designing for the physical world's constraints. Robot data is messy, intermittent, multi-modal, and generated at the edge. You'll solve hard problems around schema evolution, time-series alignment across sensor modalities, graceful handling of connectivity gaps, and data integrity across on-robot, on-prem, and cloud environments.

  • Building the platform as a product. Your users are internal engineering teams, and their adoption is your success metric. You'll define APIs, documentation, SLOs, and self-service workflows that make the platform the obvious default for any team that needs robot data — and you'll deprecate the ad-hoc alternatives that exist today.

  • Setting the technical direction for a growing domain. Drive architecture decisions through design documents and RFCs. Build alignment across infrastructure, ML, autonomy, and operations stakeholders. Mentor engineers and set standards for a platform org that will scale alongside the fleet.

To make an impact in this role you’ll bring:

  • 8+ years of experience architecting and leading large-scale data platforms, ideally within autonomous vehicle, robotics, or global IoT domains.

  • Expertise in high-volume, low-latency data processing technologies (Spark, Kafka)

  • Demonstrated mastery of data warehousing solutions (e.g., BigQuery, Custom Architecture) and data models for optimizing storage and retention of massive datasets.

  • Strong foundation in data serialization and in-memory representation, such as Apache Arrow.

  • Deep expertise in 2-3 languages, including experience with performance-critical system languages (Rust, Go, or Java) and a strong foundation in systems engineering principles, memory management, and performance trade-offs.

  • A solid grasp of the Linux development environment and infrastructure management.

  • Practical experience defining and managing data lifecycles, including retention policies and regulatory compliance.

How You’ll Work

You'll be the technical lead for a team of six engineers building and operating the robotics data platform. You won't be a people manager — you'll lead through architecture decisions, code review, design mentorship, and setting the bar for engineering quality.

Day to day, you'll split your time between your platform team and the project teams that depend on it. You'll embed directly with project teams to understand their data needs firsthand, then translate those needs into platform roadmap priorities. This means you'll need to be as effective in a cross-functional planning meeting as you are in a design review.