
Machine Learning for Computer Vision
Code: 44774 ECTS Credits: 6| Degree | Type | Year |
|---|---|---|
| Computer Vision | OB | 1 |
Contact
- Name:
- Maria Vanrell Martorell
- Email:
- maria.vanrell@uab.cat
Teachers
- Ramon Baldrich Caselles
- Fernando Luis Vilariño Freire
- Dimosthenis Karatzas
- Pau Rodriguez Lopez
- Carlos Boned Riera
- Guillem Arias Bedmar
- Luis Gomez Bigorda
Teaching groups languages
You can view this information at the end of this document.
Prerequisites
- Degree in Engineering, Maths, Physics or similar.
- Programming Skills in Python.
Objectives and Contextualisation
Module Coordinator: Dr. Ramon Baldrich Caselles
The objective of this module is to introduce the Machine learning techniques for solving computer vision problems. Machine learning deals with the automatic analisys of large scale data. Nowadays it conforms the basics of many computer vision methods, specially those related to visual pattern recognition or classification, where 'patterns' encompasses images of world objects, scenes and video sequences of human actions, to name a few.
This module presents the foundations and most important techniques for the classification of visual patterns, mainly focusing on supervised methods. Also, related topics like image descriptors and dimensionality reduction are addressed. As much as possible, all these techniques are tried and assessed on a practical project concerning scene description from pictures, toghether with the standard metrics and procedures for performance evaluation like precision-recall curves and k-fold cross-validation.
The learning outcomes are:
(a) Distinguish the main types of ML techniques for computer vision: supervised vs. unsupervised, generative vs. discriminative, original feature space vs. feature vector kernelization.
(b) Know the strong and weak points of the different methods, in part learned while solving a real pattern classification problem.
(c) Being able to use existing method implementations and build them from scratch.
The module goes in depth in two main approches to introduce ML into the image classification problem. Using: a) handcrafted image description, b) data driven image description. On the first case the Bag of Words is used, on the second one, the Deep Learning approach. The DL content is developed extensively providing both, thoretical basis of the different parts of modern Neural Networs acrhitecutres, and best practices to apply it on real applications.
Learning Outcomes
- CA06 (Competence) Achieve the objectives of a project of vision carried out in a team.
- KA03 (Knowledge) Identify the computational learning methods that can be used based on the data to solve a problem of vision.
- KA10 (Knowledge) Select the best experimentation procedures to be designed for computational learning from training to evaluation.
- KA16 (Knowledge) Recognise the ethical, gender and environmental dimensions of systems of vision and their application.
- SA03 (Skill) Apply and evaluate computational learning techniques to solve a specific problem.
- SA13 (Skill) Calculate the carbon footprint for any experiment that requires training a deep neural network.
- SA14 (Skill) Detect bias in learning data sets which allow the construction of systems that are socially discriminatory to be avoided.
- SA17 (Skill) Prepare oral presentations that allow debate of the results of a project of vision.
Content
- Introduction to machine learning
- Experimental Setup
- Embeddings: SVM and Random Forest
- Introduction to Neural Networks
- Introduction to Deep Learning
- Convolutional Neural Networks
- Training: data pre-processing, initialization, gradient optimization
- Image Classification
- Understanding and visualizing CNNs
- Efficient methods for Deep Learning
Activities and Methodology
| Title | Hours | ECTS | Learning Outcomes |
|---|---|---|---|
| Type: Directed | |||
| Lecture sessions | 20 | 0.8 | CA06, KA03, KA10, KA16, SA03, SA13, SA14, SA17, CA06 |
| Type: Supervised | |||
| Project follow-up sessions | 8 | 0.32 | CA06, KA03, KA10, KA16, SA03, SA13, SA14, SA17, CA06 |
| Type: Autonomous | |||
| Homework | 113 | 4.52 | CA06, KA03, KA10, KA16, SA03, SA13, SA14, SA17, CA06 |
Supervised sessions: (Some of these sessions could be Synchronous on-line sessions)
- Lecture Sessions, where the lecturers will explain general contents about the topics. Some of them will be used to solve the problems.
Directed sessions:
- Project Sessions, where the problems and goals of the projects will be presented and discussed, students will interact with the project coordinator about problems and ideas on solving the project (approx. 1 hour/week)
- Presentation Session, where the students give an oral presentation about how they have solved the project and a demo of the results.
- Exam Session, where the students are evaluated individually. Knowledge achievements and problem-solving skills
Autonomous work:
- Student will autonomously study and work with the materials derived from the lectures.
- Student will work in groups to solve the problems of the projects with deliverables:
- Code
- Reports
- Oral presentations
Annotation: Within the schedule set by the centre or degree programme, 15 minutes of one class will be reserved for students to evaluate their lecturers and their courses or modules through questionnaires.
Assessment
Continous Assessment Activities
| Title | Weighting | Hours | ECTS | Learning Outcomes |
|---|---|---|---|---|
| Exam | 0.4 | 2.5 | 0.1 | KA03, KA10, KA16, SA03 |
| Project | 0.55 | 6 | 0.24 | CA06, KA03, KA10, KA16, SA03, SA13, SA14, SA17 |
| Session attendance | 0.05 | 0.5 | 0.02 | CA06, KA03, KA10, KA16, SA03, SA13, SA14, SA17 |
The final marks for this module will be computed with the following formula:
Final Mark = 0.4 x Exam + 0.55 x Project+ 0.05 x Attendance
where,
Exam: is the mark obtained in the Module Exam (must be >= 3).
Attendance: is the mark derived from the control of attendance at lectures (minimum 70%)
Projects: is the mark provided by the project coordinator based on the weekly follow-up of the project and deliverables (must be >= 5). All accordingly with specific criteria such as:
- Participation in discussion sessions and in team work (inter-member evaluations)
- Delivery of mandatory and optional exercises.
- Code development (style, comments, etc.)
- Report (justification of the decisions in your project development)
- Presentation (Talk and demonstrations on your project)
Only those students that fail (Final Mark < 5.0) can do a retake exam.
Bibliography
Journal papers:
- Barber, D. “Bayesian Reasoning and Machine Learning”. Cambridge University Press, 2012.
- Yoshua Bengio. “Learning Deep Architectures for AI”. Foundations and Trends in Machine Learning, Vol. 2, No. 1, 2009.
- Christopher J. C. Burges. “Dimension Reduction: A Guided Tour”. Foundations and Trends in Machine Learning, Vol. 2, No. 4, 2009.
- Christoph H. Lampert. “Kernel Methods in Computer Vision”. Foundations and Trends in Computer Graphics and Vision, Vol. 4, No. 3, 2008.
- Tinne Tuytelaars and Krystian Mikolajczyk. “Local Invariant Feature Detectors: A Survey”. Foundations and Trends in Computer Graphics and Vision, Vol. 3, No. 3, 2007.
Books:
- Ian Goodfellow, Yoshua Bengio and Aaron Courville. “Deep Learning”. 2016. Cambridge, MA, USA: The MIT Press. ISBN: 978-0262035613
- Mehryar Mohri, Afshin Rostamizadeh, and Ameet Talwalkar, “Foundations of Machine Learning”
MIT Press, 2012. http://www.cs.nyu.edu/~mohri/mlbook/ - Z.H. Zhou. Ensemble Methods: Foundations and Algorithms. Chapman & Hall/CRC, 2012.
Reports:
- Criminisi, A. and Shotton, J. and Konukoglu, E. “Decision Forests for Classification, Regression, Density Estimation, Manifold Learning and Semi-Supervised Learning”. Technical report MSR-TR-2011-114. Microsoft Research, 2011. http://research.microsoft.com/pubs/155552/decisionForests_MSR_TR_2011_114.pdf
Software
Groups and Languages
Please note that this information is provisional until 30 November 2025. You can check it through this link. To consult the language you will need to enter the CODE of the subject.
| Name | Group | Language | Semester | Turn |
|---|---|---|---|---|
| (PLABm) Practical laboratories (master) | 1 | English | first semester | morning-mixed |
| (PLABm) Practical laboratories (master) | 2 | English | first semester | morning-mixed |
| (TEm) Theory (master) | 1 | English | first semester | morning-mixed |