AI Lead Engineer – Generative AI & LLM Applications

Longfinch Technologies

Plano, TX

Posted On: May 14, 2026

Job Overview

Job Type

Full-time

Experience

5 - 20 Years

Salary

$110,000 - $130,000 Per Year

Work Arrangement

On-Site

Travel Requirement

Required Skills

GenAI
LLM
Cloud & AI Services

Job Description

8–15 years in AI/ML development, with 3+ years specialized in Generative AI and LLM applications.

Role Overview

The AI Lead Engineer will design, build, and operate production-grade Generative AI solutions for complex enterprise scenarios. The role focuses on scalable LLM-powered applications, robust RAG pipelines, and multi-agent systems with MCP deployed across major cloud AI platforms.

Key Responsibilities:

Technical Leadership & Development

· Design and implement enterprise-grade GenAI solutions using LLMs (GPT, Claude, Llama and similar families).

· Build and optimize production-ready RAG pipelines including chunking, embeddings, retrieval tuning, query rewriting, and prompt optimization.

· Develop single- and multi-agent systems using LangChain, LangGraph, LlamaIndex and similar orchestration frameworks.

· Design agentic systems with robust tool calling, memory management, and reasoning patterns.

· Author MCP (Model Context Protocol) servers, tools, and resources, and integrate them with Cursor, Claude, Codex, Copilot, and internal enterprise systems.

· Build plugins and extensions for Claude, Codex, Cursor and GitHub Copilot ecosystems.

· Building AI Agents and Sub-Agents, Agent Skills for tools like Claude Code, Codex, and GitHub Copilot.

· Build scalable Python + FastAPI/Flask or MCP microservices for AI-powered applications, including integration with enterprise APIs.

· Implement model evaluation frameworks using RAGAS, DeepEval, or custom metrics aligned to business KPIs.

· Implement agent-based memory management using Mem0, LangMem or similar libraries.

· Fine-tune and evaluate LLMs for specific domains and business use cases.

· Deploy and manage AI solutions on Azure (Azure OpenAI, Azure AI Studio, Copilot Studio), AWS (Bedrock, SageMaker, Comprehend, Lex), and GCP (Vertex AI, Generative AI Studio).

· Implement observability, logging, and telemetry for AI systems to ensure traceability and performance monitoring.

· Ensure scalability, reliability, security, and cost-efficiency of production AI applications.

· Deep understanding of RAG architectures, hybrid retrieval, and context engineering patterns.

· Translate business requirements into robust technical designs, architectures, and implementation roadmaps.

· Drive innovation by evaluating new LLMs, orchestration frameworks, and cloud AI capabilities (including Copilot Studio for copilots and workflow automation).

Required Skills & Experience:

Core Technical

· Programming: Expert-level Python with production-quality code, testing, and performance tuning.

· GenAI Frameworks: Strong hands-on experience with LangChain, LangGraph, LlamaIndex, agentic orchestration libraries.

· LLM Integration: Practical experience integrating OpenAI, Anthropic Claude, Azure OpenAI, AWS Bedrock, and Vertex AI models via APIs/SDKs.

· RAG & Search: Deep experience designing and operating RAG workflows (document ingestion, embeddings, retrieval optimization, query rewriting).

· Vector Databases: Production experience with at least two of OpenSearch, Pinecone, Qdrant, Weaviate, pgvector, FAISS.

Cloud & AI Services

· Azure: Azure OpenAI, Azure AI Studio, Copilot Studio, Azure Cognitive Search.

· AWS: Bedrock, SageMaker endpoints, AWS Nova, AWS Transform etc.

· GCP: Vertex AI (models, endpoints), Agentspace, Agent Builder.

Preferred Qualifications

· Master's degree in Computer Science, AI/ML, Data Science, or related field.

· Experience with multi-agent systems, Agent-to-Agent (A2A) communication, and MCP-based ecosystems.

· Familiarity with LLMOps / observability platforms such as LangSmith, Opik, Azure AI Foundry.

· Experience integrating graph databases and knowledge graphs to enhance retrieval and reasoning.

Job ID: LFT121256

Posted By

Chandramani Prakash