PREDICTIVE ANALYTICS
- Academic year
- 2024/2025 Syllabus of previous years
- Official course title
- ANALISI PREDITTIVA
- Course code
- CT0429 (AF:402027 AR:218249)
- Modality
- On campus classes
- ECTS credits
- 6
- Degree level
- Bachelor's Degree Programme
- Educational sector code
- SECS-S/01
- Period
- 1st Semester
- Course year
- 3
- Where
- VENEZIA
- Moodle
- Go to Moodle page
Contribution of the course to the overall degree programme goals
The course covers the main concepts in linear models and generalized linear models and possibly further extension of these modelling frameworks including time series analysis. The focus is placed on providing the main insights on the statistical/mathematical foundations of the models and on showing the effective implementation of the methods through the use of statistical software. This is achieved by a mixture of theory and reproducible code. Real data examples and case studies are also introduced.
Expected learning outcomes
Identify the most appropriate data analysis techniques for each problem and know how to apply the techniques for the analysis, design and solution of the problems.
Apply data processing techniques to real data using suitable statistical software
Be able to generate new ideas (creativity) and anticipate new situations, in the contexts of data analysis and decision making.
* Specific competences
Use classic results of inference and regression as a basis for advanced methods of prediction and classification.
Be able to interpret and present the results of a statistical analysis, in particular knowing how to traslate statistical concepts into common language and vice-versa.
Correctly identify the type of statistical problem corresponding to certain objectives and data, as well as the most appropriate methodologies to apply to the given objectives and data.
Know how to design specific data processing systems for a type of statistical problem (classification, estimation, prediction, etc.)
Use linear algebra knowledge for its application in methods for analysing data.
Pre-requirements
Calculus 1 and 2
Algebra
Probability and Statistics
Data Analysis
although it is not formally required to have passed the examination.
Contents
1.1 Course overview
1.2 What is predictive modeling?
1.3 General notation and background
2. Linear models I: simple and multiple linear model
2.1 Model formulation and least squares
2.2 Assumptions of the model
2.3 Inference for model parameters
2.4 Prediction
2.5 ANOVA
2.6 Model fit
3. Linear models II: model selection, extensions, and diagnostics
3.1 Model selection
3.2 Use of qualitative predictors
3.3 Nonlinear relationships
3.4 Model diagnostics
3.5 Potential critical issues in regression models
4. Generalized linear models
4.1 Model formulation and estimation
4.2 Inference for model parameters
4.3 Prediction
4.4 Deviance
4.5 Model selection
4.6 Model diagnostics
If time allows:
5. Time-series forecasting
5.1: elements of time series
5.2: auto-regressive models
5.3: forecasting by means exponential smoothing
The program might be slightly modified during the semester. Students are encouraged to actively request for the course to also cover specific statistical questions of interest.
Referral texts
Julian J. Faraway, 2016. Extending the Linear Model with R: Generalized Linear, Mixed Effects and Nonparametric Regression Models, Second Edition Chapman and Hall/CRC
Peter H. Westfall, Andrea L. Arias, Understanding Regression Analysis - A Conditional Distribution Approach, Chapman and Hall/CRC
James, Gareth, Daniela Witten, Trevor Hastie, and Robert Tibshirani. 2013. An Introduction to Statistical Learning. Springer
Assessment methods
1. the theoretical knowledge of the course topics,
2. the ability to apply them for solving real data problems.
3. the ability to use R and interpret its output to solve real data problems.
4. the ability to use the R software to present the results of a statistical data analysis.
Typically the grading will follow the following criteria:
A. grades between 18 and 22 will be assigned when there is evidence of
- comprehension of basic theoretical concepts underlying statistical predictive analysis;
- limited ability to interpret and present a statistical predictive analysis;
- limited ability to adapt the predictive analysis to the problem address in a specific real case;
B. grades between 23 and 26 will be assigned when there is evidence of
- comprehension of theoretical concepts underlying statistical predictive analysis beyond the basic ones;
- moderate ability to interpret and present a statistical predictive analysis;
- moderate ability to adapt the predictive analysis to the problem address in a specific real case;
C. grades between 27 and 30 will be assigned when there is evidence of
- good or very good comprehension of theoretical concepts underlying statistical predictive analysis;
- good or very good ability to interpret and present a statistical predictive analysis;
- good or very good ability to adapt the predictive analysis to the problem address in a specific real case;
D. the honors grade (lode) is assigned when a student shows particular ability in completing all assigned task with exceptional care and skill level