Skip to content

AI Infra Engineer

Israel · Full-time · Intermediate

About The Position

About ScaleOps

ScaleOps, the leader in real-time automated cloud resource management, is revolutionizing how DevOps teams manage their cloud-native application infrastructures. Backed by venture capital and software industry titans, ScaleOps’ platform removes the organizational friction between application owners and DevOps teams by fully automating the resource management process to meet real-time demand. 

The ScaleOps platform dynamically manages the application’s resource allocation, eliminating the need for manual intervention. The result is improved application performance, 60%- 80% cloud cost savings, and a fully automated allocation process. 

With well over $210 million in backing, ScaleOps has seen tremendous business growth, attracting global industry leaders to its customer base. ScaleOps automatically manages the production environments of over 50 enterprises, including Adobe, Salseorce,Wiz, Docusign, EA (EA Sports), Coupa.

We're looking for an AI Infra Engineer who lives at the intersection of LLM systems and Kubernetes-native infrastructure. You'll be building the AI backbone of ScaleOps - from internal agentic platforms and LLM-powered automation to the inference infrastructure that powers our product and customers.

This is not a research role. You'll ship production systems, own reliability, and work closely with R&D, Product, and the field teams building the future of AI infrastructure automation.

What You Will Be Doing

  • Join a team of top-notch and skilled professionals building our core product offering.
  • Build a world-class Kubernetes resource management platform with a focus on GPU and AI workload optimization - improving the productivity of DevOps, ML Infra, and Platform teams across the world
  • Enjoy a high degree of autonomy and responsibility.

Requirements

  • A top-talent software engineer with significant hands-on experience in developing cloud-based applications and AI/ML infrastructure
  • 4+ years of industry experience in developing complex, technically demanding, and large-scale systems.
  • Solid understanding of cloud-native technologies: Kubernetes, containers, GPU node pools, and how AI workloads behave differently from CPU-bound services
  • Experience with LLM serving framework or agentic frameworks (LangGraph, LangChain, PydanticAI).
  • A degree in Computer Science from a top academic institute, or relevant experience as a Software Engineer in an IDF technological unit
  • Someone who enjoys taking ownership and collaborating with others to build a world-class product
  • You will be able to highly impact the company's future and work on the core product while collaborating with top-tier engineers and enjoying a vibrant culture of innovation.

Apply for this position