Literate models for computer vision: Combining vision, language and reading

bg-new

Literate models for computer vision: Combining vision, language and reading

Lecturer

Dimosthenis Karatzas, dimos@cvc.uab.es

Lluis Gomez,

Ernest Valveny,

Andres Mafla,

Ali Biten,

Ruben Perez,

Sergi Garcia,

Organizer/s

Computer Vision Center, Autonomous University of Barcelona

Content and organization

Written information in the world around us is a fundamental cue for a multitude of everyday tasks. From shopping at the supermarket to finding our destination in an unknown urban space, written text helps us perform many tasks that would otherwise be much more complex.

Computer vision systems on the other hand, have been practically illiterate for the first half century of their lifetime. Specific research on reading systems has been going on for decades, but the semantic information that image text conveys was not incorporated to higher-level computer vision tasks until very recently. This is gradually changing, afforded by the great success achieved in the field of scene text recognition in recent years.

Through this short interactive course, doctoral students will have a chance to reconcile with the state of the art in reading systems, especially scene text recognition, and explore how image text enables us to tackle new and exciting computer vision tasks such as fine-grained image classification, cross-modal retrieval, captioning and visual question answering.

Level

MSc / PhD

Course Duration

4 hours

Course Type

Short Course

Participation terms

Both AIDA and non-AIDA students are encouraged to participate in this short course.

If you are an AIDA Student* already, please:
Step (a): Register in the course by filing in the form at the Web site of the course.

AND

Step (b): Enroll in the same course in the AIDA system using the “Enroll on this Course” button below, so that this course enters your AIDA Certificate of Course Attendance.

If you are not an AIDA Student do only step (a).

*The International AI Doctoral Academy (AIDA) has 73 members, which are top AI Universities, Research centers and Industries: https://www.i-aida.org/

AIDA Students should have been registered in the AIDA system already (they are PhD students or PostDocs that belong only to the AIDA Members listed in this page: Members)

Language

English

Modality (online/in person):

Hybrid

Notes

The Course will take place in hybrid mode. Onsite attendance is possible, prior confirmation, at the Computer Vision Centre, Barcelona. There is limited availability for onsite attendants. If you are interested to attend onsite, please indicate so during registration and expect to receive a confirmation. Online link will be provided by the Lecturer after registration/enrollment. Basic deep learning knowledge is expected. To participate in the demo session you should be able to use Google CoLab and have basic understanding of PyTorch / TensorFlow.

Other short courses

10. 04. 2024 Go

Ethics & STICs

01. 03. 2024 Go

Computer Vision

24. 11. 2023 Go

Human Rights Toolbox

21. 02. 2023 Go

Computer Vision

11. 05. 2022 Go

Geometric learning

05. 04. 2022 Go

Computer Graphics

04. 04. 2022 Go

Bayesian Learning

02. 04. 2022 Go

Computer Graphics

31. 03. 2022 Go

Web of Data

28. 03. 2022 Go

Machine Learning

27. 03. 2022 Go

Machine Learning

02. 03. 2022 Go

Player Modeling

28. 02. 2022 Go

Player Modeling

21. 02. 2022 Go

Affective Computing

21. 02. 2022 Go

Machine Listening

21. 02. 2022 Go

Computer Vision

21. 02. 2022 Go

Computer Vision

21. 02. 2022 Go

Self-Driving Cars

21. 02. 2022 Go

Deep Learning

21. 02. 2022 Go

Deep Learning 2

09. 07. 2021 Go

Self-Driving Cars

09. 07. 2021 Go

Computer Vision

09. 07. 2021 Go

Deep Learning

17. 06. 2021 Go

Deep Learning School

17. 06. 2021 Go

Memory Network

02. 06. 2021 Go

Machine Listening

02. 06. 2021 Go

Affective Computing

02. 06. 2021 Go

Deep Learning 2

01. 06. 2021 Go

Computer Vision

Cookie Settings

A AIDA - AI Doctoral Academy may use cookies to remember your login data, collect statistics to optimize the functionality of the site and to perform marketing actions based on your interests.


These cookies are necessary to allow the main functionality of the website and are automatically activated when you use this website.
These cookies allow us to analyze the use of the website, so that we can measure and improve its performance.
Allow you to stay in touch with your social network, share content, send and post comments.

Required Cookies They allow you to personalize the commercial offers that are presented to you, directing them to your interests. They can be own or third party cookies. We warn you that, even if you do not accept these cookies, you will receive commercial offers, but without meeting your preferences.

Functional Cookies They offer a more personalized and complete experience, allow you to save preferences, show you content relevant to your taste and send you the alerts you have requested.

Advertising Cookies Allow you to stay in touch with your social network, share content, send and post comments.