Senior MLOps Engineer

Institute Of Foundation Models

نشرت قبل أكثر من 30 يومًا

الخبرة

4 - 10 سنوات

موقع العمل

Abu Dhabi - United Arab Emirates

التعليم

أي تخرج()

الجنسية

أي جنسية

جنس

غير مذكور

عدد الشواغر

1 عدد الشواغر

الوصف الوظيفي

الأدوار والمسؤوليات

The Role

As a Senior MLOps Engineer, you will design, build, and maintain robust ML(Machine Learning) infrastructure across training, inference, and deployment pipelines. You will take ownership of the model lifecycle from data ingestion to real-time serving and ensure our LLM and speech models are deployed efficiently, securely, and reproducibly in Kubernetes-based environments.
This position requires deep hands-on experience with Kubernetes (EKS), Helm, AWS cloud infrastructure, and modern MLOps toolchains (e.g., vLLM, SGLang, OpenWebUI, Weights & Biases, MLflow). Familiarity with speech/voice AI frameworks like ElevenLabs, Whisper, and RVC is also valuable.
Key Responsibilities
    • Design and manage scalable ML infrastructure on AWS using EKS, EC2, RDS, S3, and IAM-based access control.
    • Build and maintain Kubernetes deployments for LLM and TTS inference using Helm, ArgoCD, and Prometheus/Grafana monitoring.
    • Implement and optimize model serving pipelines using vLLM, SGLang, TensorRT, or similar frameworks for high-throughput inference.
    • Develop CI/CD and MLOps automation for data versioning, model validation, and deployment (GitHub Actions, Jenkins, or AWS CodePipeline).
    • Integrate OpenWebUI, Gradio, or similar UIs for user-facing model demos and internal evaluation tools.
    • Collaborate with ML researchers to productize models including TTS (e.g., ElevenLabs API), ASR (Whisper), and LLM-based chat systems.
    • Ensure observability, cost optimization, and reliability of cloud resources across multiple environments.
    • Contribute to internal tools for dataset curation, model monitoring, and retraining pipelines.
    • Maintain infrastructure-as-code using Terraform and Helm charts for reproducibility and governance.
    • Support real-time multimodal workloads (voice, text, vision) across inference clusters.
Academic Qualifications
    • 4+ years of experience in MLOps, DevOps, or Cloud Infrastructure Engineering for ML systems.
    • Strong proficiency in Kubernetes, Helm, and container orchestration.
    • Experience deploying ML models via vLLM, SGLang, TensorRT, or Ray Serve.
    • Proficiency with AWS services (EKS, EC2, S3, RDS, CloudWatch, IAM).
    • Solid experience with Python, Docker, Git, and CI/CD pipelines.
    • Strong understanding of model lifecycle management, data pipelines, and observability tools (Grafana, Prometheus, Loki).
    • Excellent collaboration skills with ML researchers and software engineers.
Professional Experience Preferred
    • Extensive Experience with vLLM, K8s, Elevenlabs, Whisper, Gradio/OpenWebUI, or custom TTS/ASR model hosting.
    • Familiarity with multi-GPU scheduling, NCCL optimization, and HPC cluster integration.
    • Knowledge of security, cost management, and network policy in multi-tenant Kubernetes clusters and cloudflare systems.
    • Prior work in LLM deployment, fine-tuning pipelines, or foundation model research.
    • Exposure to data governance and responsible AI operations in research or enterprise settings.

القطاع المهني للشركة

المجال الوظيفي / القسم

الكلمات الرئيسية

  • Senior MLOps Engineer

تنويه: نوكري غلف هو مجرد منصة لجمع الباحثين عن عمل وأصحاب العمل معا. وينصح المتقدمون بالبحث في حسن نية صاحب العمل المحتمل بشكل مستقل. نحن لا نؤيد أي طلبات لدفع الأموال وننصح بشدة ضد تبادل المعلومات الشخصية أو المصرفية ذات الصلة. نوصي أيضا زيارة نصائح أمنية للمزيد من المعلومات. إذا كنت تشك في أي احتيال أو سوء تصرف ، راسلنا عبر البريد الإلكتروني abuse@naukrigulf.com

وظائف مماثلة

مهندس أقدم

Phars Films

  • 4 - 9 سنوات
  • دبي , أبو ظبي , الشارقة - الإمارات العربية المتحدة

مهندس ديفأوبس

مهندس حلول (البيانات والذكاء الاصطناعي)

Confidential Company

  • 8 - 15 سنوات
  • أبوظبي - الإمارات العربية المتحدة

Senior DevOps Engineer (Arabic Speakers)

عرض الكل