1 of 5 free roles viewed today. Upgrade to premium for unlimited from only $19.99 with a 2-day free trial.

AIML / LLM Engineer @ Major Financial Services

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for an AIML / LLM Engineer in Charlotte, NC (Hybrid). Contract length is unspecified, with a pay rate for W2 candidates. Key skills include LLMs, GCP, Django, and 7–10 years of technical experience in AI model deployment and data processing.
🌎 - Country
United States
💱 - Currency
$ USD
💰 - Day rate
Unknown
Unknown
🗓️ - Date discovered
April 6, 2025
🕒 - Project duration
Unknown
🏝️ - Location type
Hybrid
📄 - Contract type
W2 Contractor
🔒 - Security clearance
Unknown
📍 - Location detailed
Charlotte Metro
🧠 - Skills detailed
#GCP (Google Cloud Platform) #AI (Artificial Intelligence) #Model Deployment #API (Application Programming Interface) #Computer Science #Distributed Computing #Cloud #Data Processing #Deployment #Scala #Swagger #Apache Spark #PySpark #Apache Kafka #TensorFlow #Kubernetes #FastAPI #Python #Django #Docker #Spark (Apache Spark) #PyTorch #Kafka (Apache Kafka) #Deep Learning
Role description
You've reached your limit of 5 free role views today.
Upgrade to premium for unlimited access - from only $19.99.

Job Title: AIML / LLM Engineer

Client: Major Financial Services

Location: Charlotte, NC (Hybrid: 3 days onsite, 2 days remote)

Compensation: W2 candidates only

Local preferred, but open to candidates willing to relocate

✅ Must-Have Skills:

   • Experience with LLMs (Large Language Models) – e.g., LLaMA, Mistral

   • Strong hands-on with GCP (Google Cloud Platform) and GPU-based model deployment

   • Backend experience with Django framework

   • API development expertise with FastAPI, Uvicorn, and Swagger

   • 🎓 Education: Minimum Bachelor’s degree in Computer Science, IT, or related field with

🧠 Required Technical Experience (7–10 years total):

   • Python and Apache Spark (PySpark) for data processing

   • Kubernetes for container orchestration

   • Apache Kafka for real-time data streaming

   • Cloud-native architecture and scalable API development

   • Optimization and deployment of AI models on GPU clusters using:

   • TensorFlow Distributed

   • PyTorch Distributed

   • Horovod

   • Managing and configuring NVIDIA GPUs and GCP (TPUs, GPU instances)

   • Experience with distributed computing frameworks and parallel processing for large-scale deep learning or generative AI

💡 Nice-to-Haves:

Prior experience in financial services or banking

Exposure to MLOps pipelines

Familiarity with containerized model deployment using Docker + Kubernetes

Must have skill: LLM, GCP, GPU and Django

TopTech Talent is proud to be an equal opportunity workplace and is an affirmative action employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, age, national origin, citizenship status, disability, protected veteran status, gender identity or any other factor protected by applicable federal, state, or local laws.