Experienced LLM Engineer

This role is for an "Experienced LLM Engineer" with a contract length of "more than 6 months" and a remote work location. Key skills include "Python, FastAPI, Generative AI development," and experience with "AWS or Azure."

🌎 - Country

United States

💱 - Currency

$ USD

💰 - Day rate

Unknown

🗓️ - Date discovered

January 20, 2025

🕒 - Project duration

More than 6 months

🏝️ - Location type

Remote

📄 - Contract type

Unknown

🔒 - Security clearance

Unknown

📍 - Location detailed

United States

🧠 - Skills detailed

#Python #Programming #AWS (Amazon Web Services) #"ETL (Extract #Transform #Load)" #Azure #FastAPI #ML (Machine Learning) #Model Deployment #Deployment #AI (Artificial Intelligence) #Data Science #Databases #Scala #Cloud #API (Application Programming Interface)

Role description

Saroe, Inc. is hiring an Experienced LLM Engineer on behalf of our client, a forward-thinking technology company specializing in Generative AI solutions. This role is ideal for candidates with hands-on experience in GenAI development and a strong foundation in backend engineering. As part of this position, you will work on diverse and impactful projects, including client-facing and internal initiatives to develop innovative AI-driven use cases such as AI assistants and content generation tools.

Key Responsibilities
• RAG Pipeline Development: Design and implement Retrieval-Augmented Generation (RAG) pipelines for efficient, context-aware information retrieval.
• Prompt Engineering: Apply prompt tuning and fine-tuning strategies to enhance the relevance and quality of AI outputs.
• API Development: Build, maintain, and optimize backend APIs using FastAPI or similar frameworks for seamless integration with AI applications.
• Model Deployment: Deploy and manage Large Language Models (LLMs) such as OpenAI models or equivalents, ensuring production readiness and performance.
• Cloud Integration: Utilize cloud platforms like AWS or Azure to deploy, scale, and optimize AI models.
• Validation and Testing: Validate and test generative AI outputs to ensure reliability and high-quality performance.
• Vector Databases and Caching: Work with vector databases and caching systems to enhance data retrieval and minimize latency.
• Collaboration: Collaborate with data scientists, engineers, and stakeholders to deliver robust AI solutions for diverse projects.
• Performance Optimization: Profile, debug, and optimize machine learning systems to ensure scalability and optimal system performance.

Qualifications and Requirements
• Experience:
• At least 9+ months of relevant experience in Generative AI development.
• 2+ years of overall experience in software development or machine learning engineering.
• Technical Skills:
• Proficiency in Python programming.
• Experience with FastAPI or similar frameworks for backend API development.
• Hands-on expertise with large language models (e.g., OpenAI or similar).
• Proficiency with cloud platforms like AWS or Azure for deployment and scaling.
• Experience deploying LLM-based products into production environments.
• Project Experience:
• Involvement in AI-driven projects such as building AI assistants or content generation tools.
• Familiarity with RAG pipeline creation, prompt tuning, and validation of generative outputs.
• Experience with vector databases and caching for optimizing data retrieval.
• Soft Skills:
• Strong communication skills for collaborative teamwork and occasional client interactions.
• Problem-solving mindset and ability to work in dynamic environments.

Performance Expectations
• Design and implement efficient RAG pipelines and GenAI solutions aligned with project goals.
• Optimize backend systems and APIs for seamless integration with AI models.
• Ensure scalable, reliable, and high-performance deployments.
• Collaborate effectively with cross-functional teams and stakeholders.
• Contribute to client-facing projects, including interviews and presentations, when required.

Location

This position is remote,

Why Join Us?

At Saroe, Inc., we collaborate with industry leaders to provide transformative technology solutions. This role offers an opportunity to work on high-impact Generative AI projects, contributing to advancements in AI innovation.

How to Apply

If you are passionate about pushing the boundaries of Generative AI and ready to make an impact, we want to hear from you. Apply now to be a part of this exciting journey!

Apply now Sign up

 See all roles

Go to role

Palantir Developer – Data Engineering & AI

This role is for a "Palantir Developer – Data Engineering & AI" on a contract basis, paying $65/hour. It requires expertise in Palantir Foundry, data engineering, AI/ML frameworks, and strong programming skills. Remote work location; Bachelor's/Master’s in relevant field required.

🌎 - Country

United States

Experienced LLM Engineer

Ready for your next role? Let us help you land it—here’s how!

Palantir Developer – Data Engineering & AI

Enterprise Data Modeler

Project Engineer

Data Modeler

Ready for your next role? Let us help you land it—here’s how!