ML Engineer

At Cleanlab you’ll get to

Spend 100% of your time writing open-source ML, with guidance from MIT PhDs who are prominent in ML research. Get paid a Silicon Valley engineer salary but have the ability to work remotely from wherever you want. And the cherry on top? You get to work with a modern tech stack (the latest tooling and ML models) at a dynamic startup to pioneer the rapidly growing field of data-centric AI.

What we’re looking for

First and foremost, strong software engineering skills and experience productionizing your code. You should be comfortable with large software systems and love building applications.

Contributed to the open-source community for more than 4 years, either via your own open-source package or via substantial contributions to existing open-source repositories, written blog posts, and tutorials.

Experience working in ML: processing data, deploying models, etc. We do not require deep theoretical ML expertise, but practical experience in ML projects is desired.

Responsibilities

  • Develop and contribute novel data-centric AI algorithms to Cleanlab’s open-source projects, via high quality implementations and maintaining the highest coding standards.
  • Be a good member of the open-source community (address Github issues and queries, review PRs, write crisp documentation/tutorials, help integrate with other packages).
  • Work on applied ML projects with Cleanlab enterprise customers.
  • Collaborate with other engineers to build large-scale systems and help establish a strong engineering culture across the company.

Qualifications

We select candidates based on strengths, not on weaknesses. Experience with the following is highly recommended, but not required:

  • Python, NumPy
  • pandas, scikit-learn
  • PyTorch/PyTorch Lightning or TensorFlow + Keras

Bonus:

  • MLOps Experience with model deployment and monitoring (MLflow, Sagemaker, etc.)
  • Databases and ETL (Postgres, etc.)
  • Cleanlab or other data-centric AI tools
  • AutoGluon, H2O, or other AutoML tools
  • Tableau, DataBricks, Alteryx, Trifacta
  • Hugging Face, Snorkel, Weights and Biases, Gradio, Streamlit

Benefits

Working at Cleanlab is awesome! Beyond the opportunity to work at a well-funded (backed by Bain Capital Ventures) early stage AI tech company with an incredible, friendly founding team of MIT, Stanford, and Harvard graduates, all full-time employees receive the following:

  • $9,000 per year travel benefit
    • Travel enhances our empathy with different cultures and enables us to work together more effectively. It’s how we grow and learn: traveling is an essential part of what makes us human. At Cleanlab, every two months you will receive a $1500 reimbursable travel benefit (resets on Jan 1, March 1, May 1, July 1, Sep 1, Nov 1). This is a unique benefit that lets you work from Paris for a week in February, then take a backpacking trip in the Andes for a weekend in March. Cleanlab will cover the flight for your partner or friend, too, as long as you attend and its within the $1500 / two-month period. For remote employees, you can use this benefit to come work with us in Boston/SF from time to time (encouraged, but not required).
  • Premium health insurance
    • We provide a fantastic $4 (we cover the rest) health insurance option. We also provide a $0 deductible 100% coverage premium health care option for those who prefer the best health insurance.
  • Stipend for attending conferences to keep up with the latest innovations in ML and software.
  • Competitive salary (+ equity offering for certain roles), with regular opportunities for a raise if things are going well.

About Us

Prior to Cleanlab, our founders (3 ML PhDs from MIT) worked at OpenAI, Google, Microsoft, Amazon, AWS, Facebook AI Research (FAIR), Dropbox, Oculus, Palantir, NASA, General Electric, MIT Lincoln Laboratory, MIT, Harvard, and Stanford – at every place we worked we repeatedly encountered the same issue – AI solutions failed to work reliably on real-world, human-centric data due to label errors and poor data quality. So, we spent eight years of PhD research at MIT inventing a new field to solve this problem and after successful pilots with world-leading organizations, Cleanlab emerged.

Everything we do at Cleanlab is guided by our north star – to improve the world’s ML data more easily and quicker than any other solution – enabling AI systems to train more reliably on real-world, messy, error-prone data. We develop next-generation data-centric AI, open-source algorithms and provide no-code SaaS enterprise solutions to help individuals and teams at companies (across all industries) diagnose/fix issues in their datasets and produce more reliable ML models by providing clean labels for training.

Cleanlab is a well-funded early-stage startup that is rapidly growing to transform the future of data-centric AI. Some of Cleanlab’s early work (while the company was still in stealth-mode) has been featured in various media such as: Wired, MIT Technology Review, and VentureBeat.

While many companies can help store/manage data or develop ML models, there exist few solutions today to improve the quality of existing data, which is the core asset of the modern enterprise. This is where you come in. At Cleanlab, you’ll be able to take ownership of critical projects that pioneer the future of data-centric AI.

We are a remote-first company, with roughly half of our team located near Boston, MA (EST time) and the other half located near San Francisco, CA (PST time).

  • Read about the Cleanlab team here.
  • Read how Cleanlab went from MIT PhD research to tech used by Amazon, Google, etc here.
  • See what Google, Wells Fargo, and other Cleanlab users think here.

How to Apply