

AIML / LLM Engineer @ Major Financial Services
Job Title: AIML / LLM Engineer
Client: Major Financial Services
Location: Charlotte, NC (Hybrid: 3 days onsite, 2 days remote)
Compensation: W2 candidates only
Local preferred, but open to candidates willing to relocate
✅ Must-Have Skills:
• Experience with LLMs (Large Language Models) – e.g., LLaMA, Mistral
• Strong hands-on with GCP (Google Cloud Platform) and GPU-based model deployment
• Backend experience with Django framework
• API development expertise with FastAPI, Uvicorn, and Swagger
• 🎓 Education: Minimum Bachelor’s degree in Computer Science, IT, or related field with
🧠 Required Technical Experience (7–10 years total):
• Python and Apache Spark (PySpark) for data processing
• Kubernetes for container orchestration
• Apache Kafka for real-time data streaming
• Cloud-native architecture and scalable API development
• Optimization and deployment of AI models on GPU clusters using:
• TensorFlow Distributed
• PyTorch Distributed
• Horovod
• Managing and configuring NVIDIA GPUs and GCP (TPUs, GPU instances)
• Experience with distributed computing frameworks and parallel processing for large-scale deep learning or generative AI
💡 Nice-to-Haves:
Prior experience in financial services or banking
Exposure to MLOps pipelines
Familiarity with containerized model deployment using Docker + Kubernetes
Must have skill: LLM, GCP, GPU and Django
TopTech Talent is proud to be an equal opportunity workplace and is an affirmative action employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, age, national origin, citizenship status, disability, protected veteran status, gender identity or any other factor protected by applicable federal, state, or local laws.
Job Title: AIML / LLM Engineer
Client: Major Financial Services
Location: Charlotte, NC (Hybrid: 3 days onsite, 2 days remote)
Compensation: W2 candidates only
Local preferred, but open to candidates willing to relocate
✅ Must-Have Skills:
• Experience with LLMs (Large Language Models) – e.g., LLaMA, Mistral
• Strong hands-on with GCP (Google Cloud Platform) and GPU-based model deployment
• Backend experience with Django framework
• API development expertise with FastAPI, Uvicorn, and Swagger
• 🎓 Education: Minimum Bachelor’s degree in Computer Science, IT, or related field with
🧠 Required Technical Experience (7–10 years total):
• Python and Apache Spark (PySpark) for data processing
• Kubernetes for container orchestration
• Apache Kafka for real-time data streaming
• Cloud-native architecture and scalable API development
• Optimization and deployment of AI models on GPU clusters using:
• TensorFlow Distributed
• PyTorch Distributed
• Horovod
• Managing and configuring NVIDIA GPUs and GCP (TPUs, GPU instances)
• Experience with distributed computing frameworks and parallel processing for large-scale deep learning or generative AI
💡 Nice-to-Haves:
Prior experience in financial services or banking
Exposure to MLOps pipelines
Familiarity with containerized model deployment using Docker + Kubernetes
Must have skill: LLM, GCP, GPU and Django
TopTech Talent is proud to be an equal opportunity workplace and is an affirmative action employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, age, national origin, citizenship status, disability, protected veteran status, gender identity or any other factor protected by applicable federal, state, or local laws.