Dimosthenis Karatzas, email@example.com
Computer Vision Center, Autonomous University of Barcelona
Written information in the world around us is a fundamental cue for a multitude of everyday tasks. From shopping at the supermarket to finding our destination in an unknown urban space, written text helps us perform many tasks that would otherwise be much more complex.
Computer vision systems on the other hand, have been practically illiterate for the first half century of their lifetime. Specific research on reading systems has been going on for decades, but the semantic information that image text conveys was not incorporated to higher-level computer vision tasks until very recently. This is gradually changing, afforded by the great success achieved in the field of scene text recognition in recent years.
Through this short interactive course, doctoral students will have a chance to reconcile with the state of the art in reading systems, especially scene text recognition, and explore how image text enables us to tackle new and exciting computer vision tasks such as fine-grained image classification, cross-modal retrieval, captioning and visual question answering.
November 2021 (exact date TBA)
More information about this course will be available soon