Module Details
Module Code: |
SOFT8032 |
Title: |
Programming for Data Analytics
|
Long Title:
|
Programming for Data Analytics
|
NFQ Level: |
Advanced |
Valid From: |
Semester 1 - 2017/18 ( September 2017 ) |
Field of Study: |
4814 - Computer Software
|
Module Description: |
Data analytics is a set of techniques and processes that can be used to provide identify and analyze patterns from data. In this module the learner will be provided with the skills to import and manipulate various forms of data and perform exploratory data analysis. The learner will also be equipped with the skills to effectively visualize different aspects of data and perform basic classification and clustering techniques.
|
Learning Outcomes |
On successful completion of this module the learner will be able to: |
# |
Learning Outcome Description |
LO1 |
Apply programming techniques to facilitate the importation, manipulation and cleaning of data. |
LO2 |
Implement exploratory data analysis techniques and interpret results. |
LO3 |
Choose and employ appropriate visualization techniques for depicting data. |
LO4 |
Select and apply basic classification and clustering techniques to a range of datasets. |
LO5 |
Evaluate the accuracy and interpret the results of classification algorithms. |
Dependencies |
Module Recommendations
This is prior learning (or a practical skill) that is strongly recommended before enrolment in this module. You may enrol in this module if you have not acquired the recommended learning but you will have considerable difficulty in passing (i.e. achieving the learning outcomes of) the module. While the prior learning is expressed as named MTU module(s) it also allows for learning (in another module or modules) which is equivalent to the learning specified in the named module(s).
|
|
Incompatible Modules
These are modules which have learning outcomes that are too similar to the learning outcomes of this module. You may not earn additional credit for the same learning and therefore you may not enrol in this module if you have successfully completed any modules in the incompatible list.
|
No incompatible modules listed |
Co-requisite Modules
|
No Co-requisite modules listed |
Requirements
This is prior learning (or a practical skill) that is mandatory before enrolment in this module is allowed. You may not enrol on this module if you have not acquired the learning specified in this section.
|
No requirements listed |
Indicative Content |
Introduction
Overview of the terminology and applications in the area of data science and analytics. Importance of data analytics in industry.
|
Data Extraction and Manipulation
Importing data from different sources in different formats. Applying data manipulation techniques such as reshaping, pivoting, array-based indexing, joining, cleaning and munging, grouping, aggregation.
|
Visualization and Exploratory Data Analysis
Overview of a range of visualization techniques such as histograms, scatter plots, heatmaps, clustered matrices, boxplots, regression plots. Critically evaluate and assess the applicability of different visualization techniques for a specific task. Obtaining basic summary statistics such as mean, median, standard deviation, variance and range. Using programming techniques to perform more advanced analysis such as multi-collinearity analysis.
|
Clustering Analysis
Introduction to the concept and motivation for clustering. Examine the operation of a standard clustering algorithm such as k-means. Case study illustrating the advantages and limitations of applying k-means to a dataset form a specific application domain.
|
Classification
Overview of the concept of classification and its role solving real-world problems. Introduction to a standard category of classification algorithm such as decision trees. Case study to illustrate the application of classification to a dataset. Overview of k-fold cross validation as an appropriate method of evaluation.
|
Module Content & Assessment
|
Assessment Breakdown | % |
Coursework | 100.00% |
Assessments
No End of Module Formal Examination |
Reassessment Requirement |
Coursework Only
This module is reassessed solely on the basis of re-submitted coursework. There is no repeat written examination.
|
The University reserves the right to alter the nature and timings of assessment
Module Workload
Workload: Full Time |
Workload Type |
Contact Type |
Workload Description |
Frequency |
Average Weekly Learner Workload |
Hours |
Lecture |
Contact |
Lecture delivering theory underpinning learning outcomes. |
Every Week |
2.00 |
2 |
Lab |
Contact |
Practical computer-based lab supporting learning outcomes. |
Every Week |
2.00 |
2 |
Independent Learning |
Non Contact |
Independent Student Learning. |
Every Week |
3.00 |
3 |
Total Hours |
7.00 |
Total Weekly Learner Workload |
7.00 |
Total Weekly Contact Hours |
4.00 |
Workload: Part Time |
Workload Type |
Contact Type |
Workload Description |
Frequency |
Average Weekly Learner Workload |
Hours |
Lab |
Contact |
Practical computer-based lab supporting learning outcomes. |
Every Week |
2.00 |
2 |
Lecture |
Contact |
Lecture delivering theory underpinning learning outcomes. |
Every Week |
2.00 |
2 |
Independent Learning |
Non Contact |
Independent Student Learning. |
Every Week |
3.00 |
3 |
Total Hours |
7.00 |
Total Weekly Learner Workload |
7.00 |
Total Weekly Contact Hours |
4.00 |
Module Resources
|
Recommended Book Resources |
---|
-
Wes McKinney. (2012), Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython, 1st. O'Reilly Media, [ISBN: 9781449319793].
-
John V. Guttag. (2016), Introduction to Computation and Programming Using Python: With Application to Understanding Data, 2nd. MIT Press, [ISBN: 9780262529624].
-
Joel Grus. (2015), Data Science from Scratch: First Principles with Python, 1st. O'Reilly Media, [ISBN: 9781491901427].
| Supplementary Book Resources |
---|
-
Clinton W. Brownley. (2015), Foundations for Analytics with Python, 1st. O'Reilly Media, [ISBN: 9781491922538].
-
Fabio Nelli. (2015), Python Data Analytics: Data Analysis and Science using pandas, matplotlib and the Python Programming Language, 1st. Apress, [ISBN: 9781484209592].
| This module does not have any article/paper resources |
---|
Other Resources |
---|
-
Website, Python Documentation,
-
Website, Python Data Analysis Library,
-
Website, Numerical Python Library,
-
Website, Python Interactive Computing,
-
Website, Seaborn Visualization,
-
Website, Matplotlib Visualization,
|
|