• Startup Job BoardStartup Job Board
  • Weekly Job DropWeekly Job Drop
  • Axira AIAxira AI
  • PricingPricing
  • Career ResourcesCareer Resources
  • Why Candidates Choose UsWhy Candidates Choose Us
  • Compensation CalculatorCompensation Calculator
  • Hiring ResourcesHiring Resources
  • Why Founders Choose UsWhy Founders Choose Us
  • Screen Candidates with AIScreen Candidates with AI
  • ArticlesArticles
logo of foundersarehiring
  • Startup Job Board
  • For Founders
  • For Candidates
  • Axira AI
  • Pricing
Submit Job
foundersarehiring wordmark

Connect directly with founders who are building the future.

For Job Seekers

  • Browse Jobs
  • Weekly Job Drop
  • Career Resources
  • Why Candidates Choose Us
  • Get Axira Certified
  • Compensation Calculator

For Founders

  • Post a Job
  • Pricing
  • Hiring Resources
  • Why Founders Choose Us
  • Screen Candidates with AI
  • Salary & Equity Trade-Offs

Company

  • About Us
  • Frequently asked questions
  • Contact Us
  • Articles

Join a high-growth startup or land remote work in tech, marketing, or product with FoundersAreHiring (FAH). We connect top talent directly with founders in AI, SaaS, fintech, edtech, and healthtech. All jobs are vetted, equity is transparent, and weekly curated drops cut the noise. Axira AI screening lets you prove skills and culture fit fast-private, recruiter-free, and with a shareable certificate.

© 2025 FoundersAreHiring. All rights reserved.

Build v0.15.0

Axira AI ™ is a trademark. Used under license by FoundersAreHiring.

Legal & PoliciesPrivacy PolicyTerms of ServiceCookie PolicyTakedown PolicyGuidelines

Systems Engineer - Advanced Orchestration

US / Remote (US)
C

Cedana

Startup
Category
Technical
Experience
8+ years
Type
Full-Time
Location
US / Remote (US)
Salary
$120,000 - $150,000
Job Description
Benefits & Culture
Flexible hours
Remote work
Paid time off
Fast-paced environment
Direct feedback
Growth opportunities
Core Values
Innovation
Collaboration
Excellence
Reliability
This goes straight to the founder
Share on: TwitterFacebookLinkedInHacker NewsReddit

No recruiters. Founders reply directly.

Founder Signals
Response Rate
85%
Avg Response Time
2 hours
Compensation Calculator
In startups, higher salary usually means lower equity.
Salary (affects equity %)
Link salary ↔ equity
$125,000
$120,000$150,000
Equity % (auto from trade-off model)
1.08%
0%5%
Projected Exit Value
$350,000,000
$10M$500M
Live Trade-off
Salary
Equity %
Est. Equity Value at Exit
$100,000
1.19%
$2,082,500
$125,000
1.08%
$1,884,167
$150,000
0.51%
$892,500
$175,000
0.51%
$892,500

Advanced Salary & Equity Calculator

Similar Jobs

Systems Engineer - Advanced Orchestration
Cedana
$120000 - $150000
Chief of Staff
SnapMagic
$130000 - $200000
Founding Full Stack Engineer
Text.ai
$100000 - $150000

Related Jobs

Machine Learning Intern Summer 2026
Bindwell
$6500 - $10000
Founding AI Engineer (Go, Typescript, Kubernetes, Docker)
Glasskube
$50000 - $75000
Machine Learning Engineer
Prosper AI
0
Machine Learning Scientist
UC San Diego
$90000 - $110000
Share on: TwitterFacebookLinkedInHacker NewsReddit

About the role

At Cedana, we are solving what many thought was impossible: the seamless, live migration of active CPU+GPU containers across global compute.

We're building the next generation of AI orchestration systems, founded on our pioneering work in checkpoint/restore technology. This isn't just an incremental improvement; it's a fundamental shift that makes distributed computing truly portable, elastic, and resilient across planet scale compute. This is an exceptionally difficult systems problem that requires a rare combination of kernel engineering, distributed systems design, and a relentless pursuit of perfection.

We’re backed by leading investors, including a co-founder of OpenAI, the former Chief Architect of Slack, the founding team of Meta AI, YC, Initialized Capital, and Garry Tan. To achieve our mission, we’re looking for brilliant systems engineers the kind who are obsessed with understanding how computing works from the silicon up. We’re looking for systems engineers who live deep in the container stack and understand Kubernetes beyond just the surface.

If you thrive on solving deep, complex problems in uncharted territory, we invite you to join us.

What You Will Do

As a core member of our engineering team, you will build and fortify the "magic" that powers our platform. You will operate across the entire compute stack, from the Linux kernel to our managed Kubernetes offering, to deliver a product that is both powerful and exceptionally reliable.

  1. Design and Build New Orchestration Primitives: Architect and implement core components of our system, leveraging our unique insights into checkpointing, virtualization, and container orchestration to create capabilities that don't exist anywhere else. Design and implement novel scheduling and resource management capabilities by integrating our core checkpoint/restore engine directly into the control planes of Kubernetes, SLURM, and other orchestrators.
  2. Engineer Unbreakable Reliability: Enhance the stability and performance of our entire system, from kernel-level interactions and hypervisor optimizations to our managed Kubernetes cloud platform. Dive deep into the Linux kernel, container runtimes, and hypervisors to ensure our live migration capability is bulletproof.
  3. Partner with Customers: Work directly with customers to solve their most complex infrastructure challenges, acting as a trusted technical partner and gathering insights that drive our product roadmap.
  4. Develop Sophisticated Tooling: Build and refine our internal observability and alerting infrastructure to proactively identify and resolve issues anywhere in the stack, ensuring our systems meet the highest standards of performance and availability.

Who You Are

You aren't a traditional full-stack developer. You are driven by a deep curiosity to understand every layer of the technology you work with. You have a track record of solving challenging problems in complex systems and a passion for building robust, high-performance infrastructure.

  1. A Systems Thinker: You have the intellectual bandwidth and desire to learn the full compute stack, from hardware and device drivers to the OS kernel, container runtimes, and distributed systems.
  2. A Creative Problem-Solver: You possess a history of tackling difficult technical challenges, perhaps in compilers, distributed systems, embedded systems, or highly available platforms.
  3. A Proven Collaborator: You have a demonstrated ability to work effectively with a team of high-caliber engineers to achieve ambitious goals.
  4. Intellectually Fearless: You are energized, not intimidated, by problems that have no known solutions. The prospect of building something that has never been built before is your primary motivator.

Required Experience

  1. Deep Understanding of Concurrency and Distributed Systems: Strong grasp of the theoretical and practical challenges of building distributed systems, including concurrency control, multi-threading, pre-emption, and resource contention. You can reason about race conditions, deadlocks, and consistency models from first principles.
  2. Mastery of Systems Programming: You have demonstrable, expert-level proficiency in C for kernel-level work and either Go or Rust for building high-performance, concurrent services. Python for integrating with existing orchestration frameworks. You are not just a user of these languages; you understand their memory models, concurrency primitives, and how they translate to machine code.
  3. Linux & Container Internals: You possess a fundamental understanding of Linux/UNIX (system libraries, services, networking, kernel/user-space interaction) and containerization tech (containerd/cri-o, runc, cgroups, namespaces, seccomp).
  4. Orchestrator Internals: Understanding of fairshare principles, including multifactor priority, fairshare decay, and QOS management.
  5. HPC & GPU Workloads: Deployed or managed GPU workloads under SLURM, with knowledge of workload isolation and accelerator resource accounting.
  6. Understanding of Networking: You understand how packets flow in Kubernetes, and have hacked around or deployed tooling like CNI, Cilium, and/or Istio.
  7. Production Experience and On-call Ready: You have hands-on experience scaling infrastructure, managing production-level Kubernetes clusters, and working with infrastructure-as-code tools like Helm and Terraform. You understand the importance of reliability and are familiar with being on-call. Our founders have extensive on-call experience and are committed to building a sane, sustainable rotation.

Bonus Points If You Have

  1. Contributed to open-source projects like Kubernetes, containerd, or the Linux kernel.
  2. Experience with virtualization in Kubernetes, like KubeVirt or Kata.
  3. Experience checkpointing and restoring jobs within SLURM (e.g., DMTCP, BLCR, CRIU).
  4. Experience writing SLURM plugins (e.g., sched, job_submit, prolog/epilog), or extending SLURM behavior via Lua or C.
  5. Worked on multi-cluster or federated SLURM setups.
  6. Built tooling to bridge SLURM and Kubernetes or run mixed workload environments.
  7. Contributed to open-source schedulers or job systems (SLURM, Flux, Torque, PBS, etc).
  8. Familiarity with HPC environments (SLURM, MPI, RDMA) or GPU-centric Kubernetes tooling (Kueue, KubeFlow, KServe).
  9. A passion for debugging weird kernel panics just as much as you enjoy writing elegant Go or Rust code.
  10. Experience leading teams or mentoring other engineers in a remote environment.
  11. Written your own container runtime!