Reading Enhanced Product Matching


Description

The Unitail-OCR consists of an open-set gallery and a testing suite that operate product matching inside. All known categories are registered in the gallery. In case of a query product, the matching algorithms find the top ranked category in the gallery. The gallery contains 1454 fine-grained and one-shot product categories. Among these products, 10709 text regions and 7565 legible text transcriptions (words) are annotated. This enables the gallery to act as the training source and the matching reference.

The testing suite contains four components
(1) 3012 products labeled with 18972 text regions for text detection.
(2) Among the pre-localized text regions, 13416 legible word-level transcriptions for text recognition.
(3) 10k product samples from the 1454 categories for general evaluation on product matching.
(4) From the 10k products, we select 2.4k fine-grained samples (visually similar for humans) for hard-example evaluation on product matching.

Downloads