Title:
AI Infrastructure / ML Infrastructure Engineer
Job Type:
Contract
Contract Length:
12 Months
Pay Range:
$50/hr – $175/hr
Start Date:
ASAP
Location:
Remote
About the Opportunity:
Our client, a leader in AI testing, is looking for a skilled AI Infrastructure / ML Infrastructure Engineer
to join their team for a 12-month engagement. This project involves provisioning, managing, and optimizing high-performance GPU clusters and infrastructure to support mission-critical AI applications. This is a high-impact role that requires a self-motivated professional who can hit the ground running and deliver results quickly.
Key Responsibilities & Deliverables:
This role is focused on the successful completion of specific tasks and deliverables. Your responsibilities will include:
- Provisioning and managing high-performance GPU clusters using Terraform or CloudFormation.
- Building and maintaining the internal "Model Hub" for versioning and deploying AI models across the company.
- Optimizing the networking and storage layers to support multi-node distributed training.
- Implementing autoscaling logic to manage inference costs while meeting peak user demand.
- Designing high-availability infrastructure for mission-critical AI applications.
We are looking for someone with a proven track record of successful contract engagements. The ideal candidate will have:
- 5+ years of experience in Cloud Infrastructure or DevOps.
- Deep expertise in AWS/Azure/GCP, Kubernetes (EKS/GKE), and GPU orchestration. This isn't a learning role—you need to be a subject matter expert.
- Demonstrated ability to work autonomously and manage your own time effectively to meet project goals.
- Experience with Terraform, Docker, and monitoring tools like Prometheus/Grafana.
- Strong communication skills to provide clear and concise status updates to the project team.





