Job Summary
We are seeking a Robotics Data Engineer with strong experience in machine learning data pipelines, robotics perception systems, and multimodal datasets. The ideal candidate will build scalable data infrastructure to support robotic vision, grasping, manipulation AI, and simulation-to-real (Sim2Real) workflows.
This role requires hands-on Python development, production ML pipeline experience, robotics simulation exposure, and collaboration with cross-functional AI and robotics teams.
Required Technical / Functional Skills
- 3+ years of experience in Data Engineering, Machine Learning Systems, Robotics, or related fields
- Experience building production-grade ML/AI data pipelines
- Strong Python programming skills (data processing & pipeline development)
- Experience handling large-scale multimodal datasets (vision, depth, tactile, sensor data)
- Direct experience supporting robotics perception, grasping, or manipulation AI
- Familiarity with robotics simulation platforms (e.g., Isaac Sim)
- Experience with synthetic data generation and Sim2Real workflows
- Experience with data labeling tools and annotation workflows at scale
- Hands-on knowledge of TensorFlow and/or PyTorch
- Experience with Microsoft data ecosystem (Azure data services, Power BI)
- Exposure to self-supervised or weakly supervised learning techniques
- Strong collaboration and systems-thinking mindset
Roles & Responsibilities
- Design and implement scalable data pipelines for robotic vision and sensor datasets
- Build infrastructure for high-throughput data capture from robots and simulation environments
- Develop semi-supervised and self-supervised data labeling workflows
- Enable Simulation-to-Real (Sim2Real) data workflows including domain randomization
- Manage dataset versioning, metadata, and data governance for ML model training
- Collaborate with Robotics Perception, Grasping AI, and Simulation teams
- Establish and monitor data quality metrics aligned with AI model performance
Technologies / Environments
- Python
- Machine Learning Data Systems
- TensorFlow / PyTorch
- Robotics Perception & Manipulation AI
- Isaac Sim (or similar simulation platforms)
- Azure Data Services
- Multimodal Robotics Datasets