Generative AI Engineer

This role is for a Generative AI Engineer on a contract basis, 100% remote, with a pay rate of "unknown". Key skills include AWS Sagemaker, Generative AI, and model finetuning for LLMs. A degree in Computer Science or related field is required.
🌎 - Country
United States
💱 - Currency
$ USD
💰 - Day rate
Unknown
Unknown
480
🗓️ - Date discovered
January 17, 2025
🕒 - Project duration
Unknown
🏝️ - Location type
Remote
📄 - Contract type
Unknown
🔒 - Security clearance
Unknown
📍 - Location detailed
United States
🧠 - Skills detailed
#AI (Artificial Intelligence) #Deployment #Cloud #Python #AWS SageMaker #SQL (Structured Query Language) #Model Evaluation #AWS (Amazon Web Services) #Programming #SageMaker #TensorFlow #Databases #Docker #Computer Science #Scala #Data Science #PyTorch #ML (Machine Learning) #Kubernetes #Data Processing
Role description
Log in or sign up for free to view the full role description and the link to apply.

Role: Generative AI Engineer with AWS

Location: 100% Remote

Contract role

Mandatory Skills : AWS Sagemaker , AWS BedRock ,Generative AI

Job Overview:

AWS Experience - AWS Sagemaker is required, AWS BedRock would be a nice to have.

Model Building, Accuracy Metrics, Finetuning - standard Data Science skillset.

Proven expertise in model finetuning for LLMs - PEFT, LORA techniques would be a big plus.

Able to understand what technique to use for data type.

RAG Experience would be great to have - similar to AI Engineer.

Machine Learning Engineering:
• Develop, train, and deploy ML models, ensuring they are optimized for production environments.
• Create and maintain automated feedback loops to enhance model accuracy and performance.
• Implement ML pipelines for continuous evaluation and refinement of models in production.

AI Orchestration & Integration:
• Integrate Large Language Models (LLMs) into business applications.
• Build AI orchestration systems to manage the end-to-end lifecycle of AI models, including deployment and scaling.
• Work with Vector Databases (VectorDB) to store and query high-dimensional data for AI applications.

Model Evaluation & Feedback Loops:
• Set up evaluation metrics and processes to assess model performance over time.
• Create feedback loops using real-world data to improve model reliability and accuracy.

Text-to-SQL & Generative AI-driven Solutions:
• Develop GenAI-driven Text-to-SQL solutions to automate database queries based on natural language input.
• Optimize GenAI workflows for database interactions and information retrieval.

Embedding/Chunking & Prompt Engineering:
• Design and implement embedding and chunking strategies for scalable data processing.
• Utilize prompt engineering techniques to fine-tune the performance of AI models in production environments.

Required Qualifications:
• Bachelor's or Master's degree in Computer Science, AI, Machine Learning, or a related field.
• Proven experience in building, deploying, and maintaining ML models in production environments.
• Proficiency in programming languages like Python, and frameworks such as TensorFlow, PyTorch, or similar.
• Familiarity with LLMs, VectorDB, embedding/chunking strategies, and AI orchestration tools.
• Strong understanding of model evaluation techniques and feedback loop systems.
• Hands-on experience with Text-to-SQL and prompt engineering methodologies.
• Knowledge of cloud platforms (AWS) and containerization tools (Docker, Kubernetes).