Cleanlab for E-Commerce and Retail
Case StudyDetecting product categorization errors and mitigating their business impact
Case StudyPing An Insurance
If the classifier is trained with these noisy images directly, its performance could be degraded. In view of this, we attempted to find label errors in the image dataset with an open source tool cleanlab, a framework powered by the theory of confident learning. Specifically, we trained multiple ResNet50 image classifiers to compute the predicted product category probabilities for all the training samples in a cross-validation manner. Then the cleanlab tool could utilize the matrix of predicted probabilities to find noisy samples, ordered by likelihood of being an error. We removed the top 10% noisy samples from the training set.
HOW CLEANLAB CAN HELP YOUR BUSINESS
Videos on using Cleanlab Studio to find and fix incorrect labels for:
- product reviews (text data)
- product categories (image data)
Detect errors in product descriptions/categorizations. Learn more.
Detect low-quality product images. Learn more.
Better estimate the true quality of product from noisy reviews.
Cleanlab Studio enables data-centric AI to build accurate ML models for messy real-world tabular or text data. You can effortlessly harness AutoML for various data types, including text, image, and tabular formats (Excel, CSV, Json), allowing you to focus on the most important aspect: the data. Learn more about Cleanlab:
Related applications
Customer Service
Reduce cost/time to improve customer service entity recognition and ...
Business Intelligence / Analytics
Correct data errors for more accurate analytics/modeling enabling better decisions.
Data Entry, Management, and Curation
AI expert review of your data stores to find errors or incorrect labels.
Content Moderation
Train more accurate content moderation models in less time.
Foundation and Large Language Models
Boost fine-tuning accuracy and reduce time spent
Data Annotation & Crowdsourcing
Label data efficiently and accurately, understand annotator quality.