Course Title: Practical Data Science
Part A: Course Overview
Course Title: Practical Data Science
Credit Points: 12.00
171H School of Science
|Sem 1 2017|
Course Coordinator: Associate Professor Falk Scholer
Course Coordinator Phone: +61 3 9925 9831
Course Coordinator Email: firstname.lastname@example.org
Course Coordinator Location: 14.9.22
Course Coordinator Availability: By appointment
Pre-requisite Courses and Assumed Knowledge and Capabilities
Required Prior Study (not enforced pre-requisite): Before starting this course, you should complete the Lab and Unix Induction programs. For more information, go to the following link: www.rmit.edu.au/compsci/orientation
The course gives you a set of practical skills for handling data that comes in a variety of formats and sizes, such as texts, spatial and time series data. These skills cover the data analysis lifecycle from initial access and acquisition, modeling, transformation, integration, querying, application of statistical learning and data mining methods, and presentation of results. This includes data wrangling, the process of converting raw data into a more useful form that can be subsequently analysed. The course is hands-on, using python, in the iPython interactive computing framework.
Objectives/Learning Outcomes/Capability Development
Program Learning Outcomes
This course contributes to the following Program Learning Outcomes for MC267Master of Data Science:
Enabling Knowledge: You will gain skills as you apply knowledge with creativity and initiative to new situations. In doing so, you will:
- Demonstrate mastery of a body of knowledge that includes recent developments in computer science and information technology;
- Understand and use appropriate and relevant, fundamental and applied mathematical and statistical knowledge, methodologies and modern computational tools;
- Recognise and use research principles and methods applicable to data science.
Critical Analysis: You will learn to accurately and objectively examine, and critically investigate computer science, information technology (IT) and statistical concepts, evidence, theories or situations, in particular to:
- Analyse and model complex requirements and constraints for the purpose of designing and implementing software artefacts and IT systems;
- Evaluate and compare designs of software artefacts and IT systems on the basis of organisational and user requirements;
- Bring together and flexibly apply knowledge to characterise, analyse and solve a wide range of statistical problems.
Problem Solving: Your capability to analyse complex problems and synthesise suitable solutions will be extended as you learn to:
- Design and implement software solutions that accommodate specified requirements and constraints, based on analysis or modelling or requirements specification;
- Apply an understanding of the balance between the complexity / accuracy of the mathematical / statistical models used and the timeliness of the delivery of the solution.
Course Learning Outcomes
On completion of this course you should be able to:
- Wrangle data including: selecting, uploading, cleaning up and transforming the data into a format suitable for a data science pipeline
- Extract an interpretation of data using exploratory data analysis
- Manipulate data by creating new features, reducing dimensionality, and by handling outliers in the data
- Apply simple machine learning tools to the data
- Visualise and plot graphical representations of data.
Overview of Learning Activities
You will learn about key concepts in lectures, classes or online, where you can engage with course material and the subject matter being illustrated through demonstrations and examples.
Tutorials, workshops and/or labs and/or group discussions (including online forums) focused on projects and problem solving will provide you practice in the application of theory and procedures. You will explore the concepts with teaching staff and other students, and receive feedback on your progress. You will develop an integrated understanding of the subject matter through private study by working through the course as presented in classes. Comprehensive learning materials will aid you in gaining practice at solving conceptual and technical problems.
This course includes 2 hours per week of lectures and 2 hours per week of tutorial/laboratory classes. To achieve high levels of academic results you are expected to spend on average an additional 6 hours per week on self-directed independent learning (reading, online activities and assignments).
Overview of Learning Resources
You will make extensive use of computer laboratories and relevant software provided by the School. You will be able to access course information and learning materials through myRMIT and may be provided with copies of additional materials in class or via email.
Overview of Assessment
This course has no hurdle requirements.
The assessment for this course comprises practical work involving the development of computer programs, a class test, and a final exam. For standard assessment details, relating to Computer Science and IT courses see: http://www.rmit.edu.au/csit/cgi
Assessment Task 1: Practical Assignment 1
This assessment task supports CLOs 1,2,5
Assessment Task 2: Practical Assignment 2
This assessment task supports CLOs 3,4,5
Assessment 3: Exam
This assessment supports CLOs 1,2,3,4,5