← All open roles
Sieve logoSievesievedata.com

Forward Deployed Engineer

San Francisco, CAIn-personFull-Time1 - 3+ Years

About

About Sieve

Sieve works with frontier AI labs and enterprise customers on highly specific dataset problems, building custom algorithms, models, and data pipelines at scale. The company focuses on video, audio, and multimodal data processing for AI training and evaluation. The team is four people and moving fast to scale deployments across leading AI labs.

About the role

You'll own end-to-end dataset projects for customers, from untangling ambiguous requirements through shipping production systems that find, generate, filter, transform, evaluate, and package high-quality datasets. This is a high-agency role working directly with customers and internal teams, combining research prototypes with reliable production pipelines. You'll ship fast, move between different technical domains within each project, and own customer outcomes directly.

What you'll own

  • Work directly with customers to translate ambiguous dataset needs into concrete technical systems and delivery timelines
  • Build custom algorithms, models, and large-scale data pipelines spanning computer vision, audio processing, text processing, and metadata analysis
  • Move between research prototypes and production systems, using models and APIs creatively to solve customer problems
  • Break down customer-level goals into the models, heuristics, infrastructure, and QA steps needed to deliver
  • Optimize performance through pre/post-processing, parallelism, inference optimization, fine-tuning, and evaluation loops

Requirements

Must-have

  • Strong Python developer with hands-on experience building custom algorithms, model workflows, or large-scale data pipelines
  • Comfortable working directly with customers or external teams to translate ambiguous needs into technical systems
  • Deep intuition for dataset quality, filtering, labeling, evaluation, and edge cases
  • Able to move quickly between research prototypes and reliable production systems without creating brittle code
  • 1 to 3 years of experience shipping technical work in a startup or high-velocity environment

Nice-to-have

  • Experience building custom algorithms or ML workflows for production video, audio, or multimodal data
  • Hands-on work with large-scale data pipelines at scale
  • Background with PyTorch or similar ML frameworks in production
  • Active contributor to open source projects
  • Early hire experience at a startup

Benefits & perks

  • 401k
  • Full Health Insurance
  • Breakfast, Lunch, and Dinner covered
  • Choice of snacks
  • Ubers covered home
  • Competitive Equity

Interview process

  1. 1Application Review
  2. 2Initial Screen
  3. 3Technical Chat
  4. 4Chat
  5. 5On Site
  6. 6Offer
  7. 7Hired

Drop your CV for this role.

One PDF and your email. We read it, score your fit for this role at Sieve, and route the introduction through us.

How should we use your CV?

Free for engineers, always. By applying you agree to roles.cc holding your CV to match you. Sieve never sees your identity until you have agreed to an introduction.