

GenAI Ops Engineer
Job Title: GenAI Ops Engineer
Location: Austin TX
Duration: Long Term
Rate: $55-$62
Must Have Skills:-LLMs,PyTorch,DeepSpeed,LoRA,ONNX,vLLM,TensorRT,GPU,AWS,GCP
Key Responsibilities:
• Train and fine-tune LLMs using PyTorch, DeepSpeed, and LoRA.
• Optimize inference using ONNX, vLLM, TensorRT, and GPU acceleration.
• Manage datasets, preprocess data, and implement RAG with vector databases (FAISS, Chroma, Pinecone).
• Automate training workflows using ML flow, Weights & Biases, and Ray.
• Deploy models using Kubernetes, Docker, and cloud AI services (AWS or GCP).
• Monitor model performance, mitigate drift, and optimize resource utilization.
Requirements:
• Experience with LLM training, fine-tuning, and inference optimization.
• Proficiency in Python, cloud AI services, and distributed training.
• Familiarity with retrieval-augmented generation (RAG) and prompt engineering.
• Strong problem-solving skills and ability to work in fast-paced AI environments.
Preferred:
• Experience with open-weight models (LLaMA, Mistral, Gemma, Falcon, etc.).
• Hands-on knowledge of multi-agent architectures and synthetic data generation.
Abhijeet A
Lead Technical Recruiter @ CoreTek Labs
Cell : +18164630256
E-Mail : - Abhijeet@coretek.io
Job Title: GenAI Ops Engineer
Location: Austin TX
Duration: Long Term
Rate: $55-$62
Must Have Skills:-LLMs,PyTorch,DeepSpeed,LoRA,ONNX,vLLM,TensorRT,GPU,AWS,GCP
Key Responsibilities:
• Train and fine-tune LLMs using PyTorch, DeepSpeed, and LoRA.
• Optimize inference using ONNX, vLLM, TensorRT, and GPU acceleration.
• Manage datasets, preprocess data, and implement RAG with vector databases (FAISS, Chroma, Pinecone).
• Automate training workflows using ML flow, Weights & Biases, and Ray.
• Deploy models using Kubernetes, Docker, and cloud AI services (AWS or GCP).
• Monitor model performance, mitigate drift, and optimize resource utilization.
Requirements:
• Experience with LLM training, fine-tuning, and inference optimization.
• Proficiency in Python, cloud AI services, and distributed training.
• Familiarity with retrieval-augmented generation (RAG) and prompt engineering.
• Strong problem-solving skills and ability to work in fast-paced AI environments.
Preferred:
• Experience with open-weight models (LLaMA, Mistral, Gemma, Falcon, etc.).
• Hands-on knowledge of multi-agent architectures and synthetic data generation.
Abhijeet A
Lead Technical Recruiter @ CoreTek Labs
Cell : +18164630256
E-Mail : - Abhijeet@coretek.io