ℹ️ - 1 of 5 free roles viewed today. Upgrade to premium for unlimited from only $19.99 with a 2-day free trial.

GenAI Ops Engineer

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a GenAI Ops Engineer in Austin, TX, with a long-term contract and a pay rate of $55-$62. Key skills include LLMs, PyTorch, and cloud services (AWS, GCP). Experience in LLM training and optimization is required.
🌎 - Country
United States
💱 - Currency
$ USD
💰 - Day rate
Unknown
Unknown
496
🗓️ - Date discovered
April 3, 2025
🕒 - Project duration
Unknown
🏝️ - Location type
On-site
📄 - Contract type
Unknown
🔒 - Security clearance
Unknown
📍 - Location detailed
Austin, TX
🧠 - Skills detailed
#Datasets #AI (Artificial Intelligence) #AWS (Amazon Web Services) #GCP (Google Cloud Platform) #Databases #PyTorch #Python #ML (Machine Learning) #Kubernetes #Docker #Cloud
Role description
You've reached your limit of 5 free role views today.
Upgrade to premium for unlimited access - from only $19.99.

Job Title: GenAI Ops Engineer

Location: Austin TX

Duration: Long Term

Rate: $55-$62

Must Have Skills:-LLMs,PyTorch,DeepSpeed,LoRA,ONNX,vLLM,TensorRT,GPU,AWS,GCP

Key Responsibilities:

   • Train and fine-tune LLMs using PyTorch, DeepSpeed, and LoRA.

   • Optimize inference using ONNX, vLLM, TensorRT, and GPU acceleration.

   • Manage datasets, preprocess data, and implement RAG with vector databases (FAISS, Chroma, Pinecone).

   • Automate training workflows using ML flow, Weights & Biases, and Ray.

   • Deploy models using Kubernetes, Docker, and cloud AI services (AWS or GCP).

   • Monitor model performance, mitigate drift, and optimize resource utilization.

Requirements:

   • Experience with LLM training, fine-tuning, and inference optimization.

   • Proficiency in Python, cloud AI services, and distributed training.

   • Familiarity with retrieval-augmented generation (RAG) and prompt engineering.

   • Strong problem-solving skills and ability to work in fast-paced AI environments.

Preferred:

   • Experience with open-weight models (LLaMA, Mistral, Gemma, Falcon, etc.).

   • Hands-on knowledge of multi-agent architectures and synthetic data generation.

Abhijeet A

Lead Technical Recruiter @ CoreTek Labs

Cell : +18164630256

E-Mail : - Abhijeet@coretek.io