Intended learning outcomes
Students who successfully complete this course unit will be able to:
- Characterize the challenges of processing and analysing large volumes of data;
- Apply programming models and frameworks for data processing;
- Know and apply dimensionality reduction techniques to data sets;
- Know and apply sampling techniques;
- Know and apply techniques to manipulate streaming data;
- Know and apply large-scale data mining algorithms;
- Evaluate the quality of the models produced and the results obtained in the mining tasks;
- Understand existing solutions for data mining in different domains;
- Write technical reports and prepare technical presentations with comparative and detailed analysis of different approaches for a given problem.