GCP Data Engineer (Vertex AI)

Long Finch Technology LLC

Posted On: Jul 03, 2025

Posted On: Jul 03, 2025

Job Overview

Job Type

Full-time

Experience

10 - 25 Years

Salary

Depends on Experience

Work Arrangement

Remote

Travel Requirement

0%

Required Skills

  • GCP
  • Vertex AI
Job Description

As an Advanced Data Engineer, you will have the opportunity to lead the development of innovative data solutions, enabling the effective use of data across the organization. You will be responsible for designing, building, and maintaining robust data pipelines and platforms to meet business objectives, focusing on data as a strategic asset. Your role will involve collaboration with cross-functional teams, leveraging cutting-edge technologies, and ensuring scalable, efficient, and secure data engineering practices. A strong emphasis will be placed on expertise in GCP, Vertex AI, and advanced feature engineering techniques.

4+ years of professional Data Development experience.
4+ years of experience with SQL and NoSQL technologies.
3+ years of experience building and maintaining data pipelines and workflows.
Data Engineer Experience with GCP services such as Vertex AI Platform, Cloud Storage, AutoMLOps, and Dataflow
Experience developing with Python.
Experience with PySpark and Spark development.
Experience with CI/CD pipelines and processes.
Experience with automated unit, integration, and performance testing.
Experience with version control software such as Git.
Strong understanding of Agile principles (Scrum).
Additional Qualifications: 

Knowledge of Structured Streaming (Spark, Kafka, EventHub, or similar technologies).
Experience with GitHub SaaS/GitHub Actions.
Experience understanding Databricks concepts.
Roles & Responsibilities:
Provide Technical Leadership: Offer technical leadership to ensure clarity between ongoing projects and facilitate collaboration across teams to solve complex data engineering challenges.
Build and Maintain Data Pipelines: Design, build, and maintain scalable, efficient, and reliable data pipelines to support data ingestion, transformation, and integration across diverse sources and destinations, using tools such as Kafka, Databricks, and similar toolsets.
Drive Digital Innovation: Leverage innovative technologies and approaches to modernize and extend core data assets, including SQL-based, NoSQL-based, cloud-based, and real-time streaming data platforms.
Implement Feature Engineering: Develop and manage feature engineering pipelines for machine learning workflows, utilizing tools like Vertex AI, BigQuery ML, and custom Python libraries.
Implement Automated Testing: Design and implement automated unit, integration, and performance testing frameworks to ensure data quality, reliability, and compliance with organizational standards.
Optimize Data Workflows: Optimize data workflows for performance, cost efficiency, and scalability across large datasets and complex environments.
Mentor Team Members: Mentor team members in data principles, patterns, processes, and practices to promote best practices and improve team capabilities.
Draft and Review Documentation: Draft and review architectural diagrams, interface specifications, and other design documents to ensure clear communication of data solutions and technical requirements.
Co st/Benefit Analysis: Present opportunities with cost/benefit analysis to leadership, guiding sound architectural decisions for scalable and efficient data solutions
Support flows for an ML platform in GCP and needs to be able to work with data science and understand the ML concepts in terms of requirements to be met by the data.


Job ID: LF250007


Posted By

Vikrant Singh

Technical Recruiter