STATISTICAL INFERENCE AND LEARNING

Academic year
2023/2024 Syllabus of previous years
Official course title
STATISTICAL INFERENCE AND LEARNING
Course code
CM0471 (AF:398329 AR:215034)
Modality
On campus classes
ECTS credits
6
Degree level
Master's Degree Programme (DM270)
Educational sector code
SECS-S/01
Period
1st Semester
Course year
2
Where
VENEZIA
Moodle
Go to Moodle page
This course belongs to the educational activities of the Master in Computer Science that allow the student to acquire advanced instruments for data analysis and machine learning. The objective of the course is to develop statistical skills for the analysis of high dimensional data and for solving forecasting and classification problems occurring in a wide variety fields, including technological, scientific, biomedical, economic and business fields.
Regular and active participation in the teaching activities offered by the course and in independent research activities will enable students to:
1. (knowledge and understanding)
- know and understand advanced statistical learning methods for synthesis, prediction and classification
2. (applying knowledge and understanding)
- autonomously apply advanced statistical methods to synthetize information, make predictions and classifications even with data characterized by high-dimensionality
- autonomously use statistical software to analyse datasets characterized by high dimensionality
3. (making judgements)
- formulate autonomous judgements on the validity and feasibility of different statistical techniques and understand their impact on the results of the analyses
It is assumed that students have achieved the educational objectives of the Applied Probability for Computer Science course (https://www.unive.it/data/educazione/335487 ) even without having necessarily passed the exam. In particular, it is important that students are thoroughly familiar with the basic concepts of probability calculus, random variables, simulation techniques and the basic tools of statistical inference.
The course program includes presentation and discussion of the following topics:
1. linear prediction models
2. classification techniques
3. resampling methods
4. model selection and regularization
5. nonlinear models
Applications with R language (www.r-project.org) are an integral part of the course.
- James G, Witten D, Hastie T, Tibshirani R (2015). An Introduction to Statistical Learning. 6th version. Springer. Webpage http://www-bcf.usc.edu/~gareth/ISL/
- Further reading and materials distributed during the course through Moodle
The achievement of the course objectives is assessed through the oral discussion of a project agreed with the teacher. The project consists in the analysis of a data set using the methods learned in the course. The student is required to prepare a report describing the analyses and then discuss the report with the teacher.

Students will be evaluated in terms of
- quality of their statistical analyses
- correct use of the technical terminology
- correct conclusions
- quality of presentation (report)
- quality of the oral discussion

Rules:
1) if the student fails the exam, they can try another session with the *same* project. However, if the exam is failed again, then a *new* project must be considered for the subsequent exam sessions
2) if the student passes the exam but decides to decline the score, then a *new* project must be considered for the subsequent exam sessions
Conventional theoretical lectures complemented by exercises, discussion of case studies and computer labs. Teaching material prepared by the teacher will be distributed during the course through the Moodle platform. The statistical software used in the course is R (www.r-project.org).
English
oral
Definitive programme.
Last update of the programme: 13/02/2023