Manufacturing & Agriculture

Cleanlab ensures reliable data/models in industrial quality control applications (and more generally across enterprise computer vision, natural language processing, and statistical data mining). Use Cleanlab Studio to seamlessly handle mislabeled data, outliers/anomalies, drift, ambiguous instances, and other real-world data issues.
Hero Picture

Case StudyAutomatically Correcting Image Labels

14%
error improvement for ResNet computer vision model (without any change in modeling code)

Quote from Travis Tang, Data Scientist at Gojek:

I used an open-sourced library, cleanlab, to remove low-quality labels on an image dataset. The [ResNet] model trained on the dataset without low-quality data gained 4 percentage points of accuracy compared to the baseline model (trained on all data).

Gojek is an Indonesian on-demand multi-service platform and digital payment technology group based in Jakarta. Gojek was first established in Indonesia in 2009 as a call center to connect consumers to courier delivery and two-wheeled ride-hailing services.
Company Logo

Case StudyCleanlab Finds Errors in ImageNet

Label errors automatically detected by Cleanlab in ImageNet, the most famous image recognition dataset.

Browse other labeling errors detected by Cleanlab in famous ML benchmark datasets: labelerrors.com
Graph showing results achieved with Cleanlab on a real dataset

HOW CLEANLAB HELPS IMPROVE YOUR OUTPUT

Icon

Automatically identify and resolve data issues, and deploy robust ML models with a few clicks. Cleanlab Studio facilitates data-centric AI workflows in:

  • Agricultural applications: disease inspection, yield estimation, animal monitoring, as well as tasks involving grading and sorting.
  • Industrial quality control applications: ingredient inspection, process quality monitoring, assembly inspection, and defect detection.

Icon

Video on using Cleanlab Studio to find and fix incorrect labels for image data

Icon

Automatically detect outliers (anomalies) which may have an outsized impact on data-driven conclusions and should be handled with care.

Icon

Automatically detect low-quality images including those which are under/over-exposed, blurry, near duplicates, low-information, etc. Learn more.

Icon

Model images together with tabular (numeric, categorical) and text information.

Icon

Know which subset of the data is high-quality with confidence, and evaluate the quality of different data sources.

Icon

Effectively analyze data labeled by multiple annotators, and estimate which examples require additional review and which annotators are best/worst overall. Learn more.

Icon

Use our ActiveLab system (active learning with relabeling) to efficiently collect new labels for training accurate models.

Icon

Read more about why 2022 was the most exciting year in computer vision history and how Cleanlab fits into it.

Icon

Read more about why the foundations of AI are riddled with errors.