Module Details

Module Code: SOFT8032
Title: Programming for Data Analytics
Long Title: Programming for Data Analytics
NFQ Level: Advanced
Valid From: Semester 1 - 2017/18 ( September 2017 )
Duration: 1 Semester
Credits: 5
Field of Study: 4814 - Computer Software
Module Delivered in: 4 programme(s)
Module Description: Data analytics is a set of techniques and processes that can be used to provide identify and analyze patterns from data. In this module the learner will be provided with the skills to import and manipulate various forms of data and perform exploratory data analysis. The learner will also be equipped with the skills to effectively visualize different aspects of data and perform basic classification and clustering techniques.
 
Learning Outcomes
On successful completion of this module the learner will be able to:
# Learning Outcome Description
LO1 Apply programming techniques to facilitate the importation, manipulation and cleaning of data.
LO2 Implement exploratory data analysis techniques and interpret results.
LO3 Choose and employ appropriate visualization techniques for depicting data.
LO4 Select and apply basic classification and clustering techniques to a range of datasets.
LO5 Evaluate the accuracy and interpret the results of classification algorithms.
Dependencies
Module Recommendations

This is prior learning (or a practical skill) that is strongly recommended before enrolment in this module. You may enrol in this module if you have not acquired the recommended learning but you will have considerable difficulty in passing (i.e. achieving the learning outcomes of) the module. While the prior learning is expressed as named MTU module(s) it also allows for learning (in another module or modules) which is equivalent to the learning specified in the named module(s).

Incompatible Modules
These are modules which have learning outcomes that are too similar to the learning outcomes of this module. You may not earn additional credit for the same learning and therefore you may not enrol in this module if you have successfully completed any modules in the incompatible list.
No incompatible modules listed
Co-requisite Modules
No Co-requisite modules listed
Requirements

This is prior learning (or a practical skill) that is mandatory before enrolment in this module is allowed. You may not enrol on this module if you have not acquired the learning specified in this section.

No requirements listed
 
Indicative Content
Introduction
Overview of the terminology and applications in the area of data science and analytics. Importance of data analytics in industry.
Data Extraction and Manipulation
Importing data from different sources in different formats. Applying data manipulation techniques such as reshaping, pivoting, array-based indexing, joining, cleaning and munging, grouping, aggregation.
Visualization and Exploratory Data Analysis
Overview of a range of visualization techniques such as histograms, scatter plots, heatmaps, clustered matrices, boxplots, regression plots. Critically evaluate and assess the applicability of different visualization techniques for a specific task. Obtaining basic summary statistics such as mean, median, standard deviation, variance and range. Using programming techniques to perform more advanced analysis such as multi-collinearity analysis.
Clustering Analysis
Introduction to the concept and motivation for clustering. Examine the operation of a standard clustering algorithm such as k-means. Case study illustrating the advantages and limitations of applying k-means to a dataset form a specific application domain.
Classification
Overview of the concept of classification and its role solving real-world problems. Introduction to a standard category of classification algorithm such as decision trees. Case study to illustrate the application of classification to a dataset. Overview of k-fold cross validation as an appropriate method of evaluation.
Module Content & Assessment
Assessment Breakdown%
Coursework100.00%

Assessments

Coursework
Assessment Type Open-book Examination % of Total Mark 20
Timing Week 6 Learning Outcomes 1,2
Assessment Description
Perform importation, cleaning and manipulation of a dataset and perform exploratory data analysis.
Assessment Type Project % of Total Mark 30
Timing Week 8 Learning Outcomes 1,2,3
Assessment Description
Complete a comprehensive analysis of a real-world dataset and produce a report documenting findings and incorporating appropriate visualizations.
Assessment Type Project % of Total Mark 50
Timing Sem End Learning Outcomes 3,4,5
Assessment Description
Select and apply appropriate classification techniques to a dataset from a specific application domain. Findings should be documented and supported with appropriate visualisations.
No End of Module Formal Examination
Reassessment Requirement
Coursework Only
This module is reassessed solely on the basis of re-submitted coursework. There is no repeat written examination.

The University reserves the right to alter the nature and timings of assessment

 

Module Workload

Workload: Full Time
Workload Type Contact Type Workload Description Frequency Average Weekly Learner Workload Hours
Lecture Contact Lecture delivering theory underpinning learning outcomes. Every Week 2.00 2
Lab Contact Practical computer-based lab supporting learning outcomes. Every Week 2.00 2
Independent Learning Non Contact Independent Student Learning. Every Week 3.00 3
Total Hours 7.00
Total Weekly Learner Workload 7.00
Total Weekly Contact Hours 4.00
Workload: Part Time
Workload Type Contact Type Workload Description Frequency Average Weekly Learner Workload Hours
Lab Contact Practical computer-based lab supporting learning outcomes. Every Week 2.00 2
Lecture Contact Lecture delivering theory underpinning learning outcomes. Every Week 2.00 2
Independent Learning Non Contact Independent Student Learning. Every Week 3.00 3
Total Hours 7.00
Total Weekly Learner Workload 7.00
Total Weekly Contact Hours 4.00
 
Module Resources
Recommended Book Resources
  • Wes McKinney. (2012), Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython, 1st. O'Reilly Media, [ISBN: 9781449319793].
  • John V. Guttag. (2016), Introduction to Computation and Programming Using Python: With Application to Understanding Data, 2nd. MIT Press, [ISBN: 9780262529624].
  • Joel Grus. (2015), Data Science from Scratch: First Principles with Python, 1st. O'Reilly Media, [ISBN: 9781491901427].
Supplementary Book Resources
  • Clinton W. Brownley. (2015), Foundations for Analytics with Python, 1st. O'Reilly Media, [ISBN: 9781491922538].
  • Fabio Nelli. (2015), Python Data Analytics: Data Analysis and Science using pandas, matplotlib and the Python Programming Language, 1st. Apress, [ISBN: 9781484209592].
This module does not have any article/paper resources
Other Resources
 
Module Delivered in
Programme Code Programme Semester Delivery
CR_ESMPR_8 Bachelor of Engineering (Honours) in Smart Product Engineering 8 Group Elective 2
CR_KSDEV_8 Bachelor of Science (Honours) in Software Development 5 Mandatory
CR_KDNET_8 Bachelor of Science (Honours) in Computer Systems 5 Elective
CR_KCOMP_7 Bachelor of Science in Software Development 5 Mandatory