Text Understanding in Visual Data
Text recognition (traditionally called OCR) is crucial for visual understanding and reasoning in many application scenarios.
We focus on Arbitrary-Shaped Text detection and Recognition and related Natual Language Processing tasks for understanding these text.
|
Z. Kuang, H. Sun, Z. Li, X. Yue, T.H. Lin, J. Chen, H. Wei, Y. Zhu, T. Gao, W. Zhang, K. Chen, W. Zhang and D. Lin.
MMOCR: A comprehensive toolbox for text detection, recognition and understanding.
Proc. ACM Multimedia (MM) Open Source Software Competition, 2021. [code]
An open-source toolbox for text detection, recognition and understanding.
|
Text Detection
|
X. Yue, Z. Kuang, Z. Zhang, Z. Chen, P. He, Y. Qiao and W. Zhang.
Boosting up scene text detectors with guided CNN.
British Machine Vision Conference (BMVC), 2018. (Oral presentation, acceptance rate: 6.5%) [paper]
A general framework for speeding up scene text detection. Demonstrated performance on two state-of-the-art methods, CTPN and EAST.
|
Text Recognition
Stats
Copyright © 2021 Wayne Zhang
|