We are looking for a Gen AI Solution Architect to design, build, and scale reusable Generative AI modules and APIs. This role blends product vision, technical leadership, and hands-on implementation of LLM workflows, hybrid ML/LLM systems, and enterprise integrations.
Roles and Responsibilities
- Define product roadmap and architect reusable Gen AI modules (RAG, prompting frameworks, hybrid ML/LLM).
- Design APIs and microservices to expose AI capabilities as plug-and-play components.
- Standardize patterns across use cases: prompts, chunking, few-shot pipelines.
- Integrate LLMs with traditional ML and enterprise systems, ensuring performance, scalability, and cost efficiency.
- Enable adoption through documentation, tutorials, sandboxes, and team training.
- Track adoption and impact metrics (module reuse, deployment speed, contribution volume).
Required Skills & Experience
- Hands-on experience with LLMs (OpenAI, Anthropic, LLaMA 2) and frameworks (LangChain, LlamaIndex).
- Expertise in RAG, document chunking, vector DBs (Pinecone, FAISS), text-to-SQL, and hybrid ML workflows.
- Strong software engineering skills: Python, API design (FastAPI/Flask), cloud AI platforms, CI/CD, Docker, Terraform.
- Frontend/middleware integration (React/Streamlit, message queues, auth systems).
- Proven track record in building reusable ML/API products or internal AI platforms.