We are looking for a Network and Cloud AI Engineer to design, deploy, and optimize our AI infrastructure across cloud and hybrid environments. This role combines strong networking fundamentals with hands-on experience in AI/ML deployment, cloud infrastructure, and data pipelines.
You will collaborate closely with data scientists, cloud architects, and DevOps engineers to deliver scalable, secure, and high-performance AI solutions.
Responsibilities
Infrastructure & Cloud
Design and manage cloud-native AI platforms (e.g., Databricks, Azure ML, SageMaker, GCP Vertex AI)
Optimize network performance for real-time and distributed AI workloads
Manage GPU/TPU infrastructure and scalable compute environments
Deploy models using containerization technologies (Docker, Kubernetes)
Networking & Security
Configure VPCs, load balancers, firewalls, and VPNs
Secure communication between edge devices, cloud, and data sources
Implement IAM policies, encryption, and access controls
AI & MLOps
Support AI workflows: data prep, training, deployment, and monitoring
Build and maintain ML pipelines using MLflow, Kubeflow, or CI/CD tools
Monitor models for performance, drift, and cost efficiency
Optimize AI models for edge, hybrid, and cloud deployment
Collaboration & Documentation
Work with data scientists and DevOps teams on model lifecycle
Document infrastructure, architecture, and processes
Participate in code reviews and architecture planning
Qualifications
Required
Bachelor's or Master's in Computer Science, Engineering, or related field
3-7 years of experience in cloud infrastructure, networking, or AI deployment
Experience with AWS, Azure, or GCP networking and AI tools
Proficiency with Docker and Kubernetes
Strong understanding of networking (VPCs, routing, firewalls, etc.)
Familiarity with ML tools like MLflow, SageMaker, or Azure ML
Programming in Python and scripting in Bash/PowerShell
Preferred
Certifications in Databricks, AWS ML, Azure AI, or GCP ML
Experience with Kafka, Pub/Sub, or other data streaming tools
Knowledge of edge model optimization (e.g., ONNX, TensorRT)
Experience in regulated domains (e.g., healthcare, finance, telecom)
Tools & Technologies
Cloud:
AWS, Azure, GCP
AI Platforms:
Databricks, MLflow, Kubeflow, SageMaker, Azure ML
Networking:
VPC, DNS, Load Balancers, VPN, Private Link
DevOps:
Terraform, Ansible, GitHub Actions, Jenkins
Languages:
Python, SQL, Bash
Benefits
Competitive salary with performance bonus
Flexible remote work arrangements
Certification and training sponsorship
Health, dental, and vision insurance
Learning and development budget
Note
: At this time, we are not seeking assistance from staffing or recruitment agencies.