Manufacturing & Agriculture
Case StudyAutomatically Correcting Image Labels
I used an open-sourced library, cleanlab, to remove low-quality labels on an image dataset. The [ResNet] model trained on the dataset without low-quality data gained 4 percentage points of accuracy compared to the baseline model (trained on all data).
Case StudyCleanlab Finds Errors in ImageNet
HOW CLEANLAB HELPS IMPROVE YOUR OUTPUT
Automatically identify and resolve data issues, and deploy robust ML models with a few clicks. Cleanlab Studio facilitates data-centric AI workflows in:
- Agricultural applications: disease inspection, yield estimation, animal monitoring, as well as tasks involving grading and sorting.
- Industrial quality control applications: ingredient inspection, process quality monitoring, assembly inspection, and defect detection.
Video on using Cleanlab Studio to find and fix incorrect labels for image data
Automatically detect outliers (anomalies) which may have an outsized impact on data-driven conclusions and should be handled with care.
Automatically detect low-quality images including those which are under/over-exposed, blurry, near duplicates, low-information, etc. Learn more.
Model images together with tabular (numeric, categorical) and text information.
Know which subset of the data is high-quality with confidence, and evaluate the quality of different data sources.
Effectively analyze data labeled by multiple annotators, and estimate which examples require additional review and which annotators are best/worst overall. Learn more.
Use our ActiveLab system (active learning with relabeling) to efficiently collect new labels for training accurate models.
Read more about why 2022 was the most exciting year in computer vision history and how Cleanlab fits into it.
Read more about why the foundations of AI are riddled with errors.
Related applications
Data Entry, Management, and Curation
AI expert review of your data stores to find errors or incorrect labels.
Business Intelligence / Analytics
Correct data errors for more accurate analytics/modeling enabling better decisions.
Data Annotation & Crowdsourcing
Label data efficiently and accurately, understand annotator quality.