DevOps Engineer - AI Infrastructure
Job Title: DevOps Engineer – AI Infrastructure
Location: 100% Remote
Pay: Up to $80/hr.
Type: 6-month contract, potential for conversion to FTE
TL;DR: Seeking a DevOps Engineer who specifically understands the nuances of AI Infrastructure and security. Skills include CI/CD, K8s, IaC with Terraform, And Azure/AWS.
Who We Are
Xcelocloud is a services partner & integration hub powering some of the largest partners in the world serving enterprise clients. We combine expert engineers, unique integrations, sales and delivery strategies, and proven processes to support our partners globally. Each segment closely aligns with the Xcelocloud difference: Expertise, Experience, and Execution.
We have built an ecosystem with carefully selected industry leaders across networking, security, and cloud segments. Our team maximizes our clients’ available resources while tailoring unique solutions to solve business challenges.
Xcelocloud is on a mission to empower large resellers to achieve more through services. Our culture is centered on ownership through customer experience, industry expertise, learning, critical thinking, and encouraging teams and leaders to bring their best each day to execute. In doing so, we create a modern approach that impacts partners and end-users around the world. You can help us to achieve our mission.
Job Description:
Xcelocloud is seeking a highly skilled AI Infrastructure DevOps Engineer to architect, manage, and optimize our cutting-edge AI infrastructure. The ideal candidate will possess advanced expertise in AI-specific security protocols, Kubernetes, monitoring, performance optimization, CI/CD automation, and infrastructure as code (Terraform). This role requires a versatile professional with exceptional technical acumen and leadership capabilities to drive cross-functional collaboration.
What You’ll Do:
AI Infrastructure Engineering:
Engineer and deploy scalable, secure infrastructures specifically tailored for AI model deployment, training, and management.
Implement and manage Kubernetes clusters optimized for AI workloads, ensuring high availability and performance.
Design and enforce security measures unique to AI systems, including data privacy, model integrity, and compliance with AI ethics standards.
Utilize Azure and AWS AI services to enhance the delivery and management of AI solutions.
Develop and automate CI/CD pipelines to streamline and accelerate deployment processes.
Utilize Terraform for infrastructure as code, ensuring consistent and reproducible deployments.
Technical Leadership:
Work with cross-functional teams including AI developers, data analysts, and security experts to design and implement AI solutions that meet client needs.
Collaborate with stakeholders to align AI infrastructure development with business objectives.
Drive technical discussions and strategy sessions focused on AI innovation and infrastructure improvement.
System Integrity and Security:
Ensure AI systems' security through best practices in access control, data protection, and compliance with AI-specific regulations.
Conduct regular security audits and performance assessments for AI infrastructure.
Documentation and Knowledge Sharing:
Document AI-specific system configurations, security protocols, and best practices.
Mentor team members on AI infrastructure management and security.
Required Experience:
Bachelor of Science in Information Technology, Computer Science, AI, or a related field.
1+ years of experience in managing AI infrastructure.
5+ years of DevOps Engineering experience.
Preferred Experience:
Experience with AI infrastructure (LLMs, deep learning models, vector databases, AI pipelines).
Strong expertise in AWS, GCP, or Azure (serverless, Kubernetes, cloud storage, etc.).Experience with MLOps, AI model serving, and GPU-based computing.Knowledge of Docker, Kubernetes, Terraform, Ansible for deployment automation.Experience with real-time data processing (Kafka, WebSockets, or similar).Familiarity with Deepgram, Twilio, Microsoft Teams, and RingCentral integrations is a plus.Hands-on experience in AI model optimization, logging, and scaling.
If this DevOps Engineering role is interesting to you, apply today!
Keywords: AI Infrastructure DevOps Engineer, AI Security, AI Model Deployment, Kubernetes for AI, AI Performance Optimization, AI Compliance, AI Infrastructure Management, AI Data Privacy, AI System Integrity, Azure AI Services, AWS AI Services, CI/CD for Artificial Intelligence, Infrastructure as Code, Terraform, Artificial Intelligence Monitoring, DevOps, AI Scalability, AI High Availability, AI Development, AI Infrastructure Documentation, AI System Security, DevOps for AI, Artificial Intelligence Solutions Architecture, AI Technical Leadership, AI Infrastructure Automation, AI Performance Tuning, AI Innovation, AI Best Practices, AI Workload Management, DevOps, development operations, Kubernetes, cluster, container, containerize, terraform, IaC, infrastructure as code, script, automate, automation, automates, automated, Systems Engineer.
Job Type: Contract
Pay: Up to $80.00 per hour
Expected hours: 40 per week
Compensation Package:
1099 contract
Hourly pay
Schedule:
8 hour shift
Experience:
DevOps: 5 years (Required)
AIOps: 1 year (Preferred)
Kubernetes: 1 year (Required)
Terraform: 1 year (Required)
Work Location: Remote