AI/ML Platform Architect
Leading Fortune 100 - BFSI domain client
Years of exp. required: 15+
Role: AI/ML Platform Engineer
Contact: 12 to 24 months (Possible conversion to FTE)
Location: Concord, CA
Hybrid working model – 3 days each week onsite
Our client is looking for a talented AI/ML Platform Engineer with a strong emphasis on Google Cloud Platform (GCP) to join our dynamic team. In this role, you will be responsible for designing, implementing, and maintaining AI/ML infrastructure and solutions that leverage the full power of GCP. Your expertise will be crucial in enabling our data scientists and machine learning engineers to develop, deploy, and scale AI models efficiently and effectively.
Key Responsibilities:
• Build robust AI/ML platforms on GCP, ensuring scalability, reliability, and performance.
• Set up and maintain GCP services such as AI Platform, BigQuery, Cloud Storage, Compute Engine, and Kubernetes Engine.
• Develop automated workflows and pipelines for model training, validation, deployment, and monitoring.
• Work closely with data scientists, ML engineers, and other stakeholders to understand their needs and provide optimal solutions.
• Continuously optimize the AI/ML infrastructure for cost, performance, and security.
• Implement monitoring solutions to ensure the health and performance of AI/ML systems, and troubleshoot any issues that arise.
• Maintain comprehensive documentation of the architecture, workflows, and best practices.
Preferred Qualifications:
• Proven experience as an AI/ML engineer with a focus on platform engineering and GCP.
• Experience with machine learning frameworks and libraries such as TensorFlow, PyTorch, or Scikit-learn.
• Experience in Terraform and Helm will be an advantage.
• Experience in implementing inferencing, benchmarking, and fine-tuning Generative AI models using GCP services.
• Strong understanding of LLMs and proven experience building platforms and applications that leverage them.
• Experience with CI/CD pipelines, containerization (Docker), and orchestration (Kubernetes).
• Strong understanding of data storage, processing, and ETL workflows.
• Knowledge of chunking strategies for vector database
• Knowledge on classification & Embedding models.
• Familiarity with Agile methodologies and project management tools.
Desired Qualifications:
• Excellent problem-solving skills and the ability to work in a fast-paced environment.
• Strong communication and collaboration skills to work effectively with cross-functional teams.
• 2+ years of designing, developing, testing, and optimizing Python microservices using gRPC
• 2+ years of Python development experience
• 3+ years of Linux experience
EEO:
“Mindlance is an Equal Opportunity Employer and does not discriminate in employment based on – Minority/Gender/Disability/Religion/LGBTQI/Age/Veterans.”