Sievesievedata.com

Forward Deployed Engineer

San Francisco, CAIn-personFull-Time1 - 3+ Years

About

About Sieve

Sieve works with frontier AI labs and enterprise customers on highly specific dataset problems, building custom algorithms, models, and data pipelines at scale. The company focuses on video, audio, and multimodal data processing for AI training and evaluation. The team is four people and moving fast to scale deployments across leading AI labs.

About the role

You'll own end-to-end dataset projects for customers, from untangling ambiguous requirements through shipping production systems that find, generate, filter, transform, evaluate, and package high-quality datasets. This is a high-agency role working directly with customers and internal teams, combining research prototypes with reliable production pipelines. You'll ship fast, move between different technical domains within each project, and own customer outcomes directly.

What you'll own

Work directly with customers to translate ambiguous dataset needs into concrete technical systems and delivery timelines
Build custom algorithms, models, and large-scale data pipelines spanning computer vision, audio processing, text processing, and metadata analysis
Move between research prototypes and production systems, using models and APIs creatively to solve customer problems
Break down customer-level goals into the models, heuristics, infrastructure, and QA steps needed to deliver
Optimize performance through pre/post-processing, parallelism, inference optimization, fine-tuning, and evaluation loops

Requirements

Must-have

Strong Python developer with hands-on experience building custom algorithms, model workflows, or large-scale data pipelines
Comfortable working directly with customers or external teams to translate ambiguous needs into technical systems
Deep intuition for dataset quality, filtering, labeling, evaluation, and edge cases
Able to move quickly between research prototypes and reliable production systems without creating brittle code
1 to 3 years of experience shipping technical work in a startup or high-velocity environment

Nice-to-have

Experience building custom algorithms or ML workflows for production video, audio, or multimodal data
Hands-on work with large-scale data pipelines at scale
Background with PyTorch or similar ML frameworks in production
Active contributor to open source projects
Early hire experience at a startup

Benefits & perks

401k
Full Health Insurance
Breakfast, Lunch, and Dinner covered
Choice of snacks
Ubers covered home
Competitive Equity

Interview process

1Application Review
2Initial Screen
3Technical Chat
4Chat
5On Site
6Offer
7Hired

Drop your CV for this role.

One PDF and your email. We read it, score your fit for this role at Sieve, and route the introduction through us.