Course Title: Advanced Programming for Data Science

Part A: Course Overview

Course Title: Advanced Programming for Data Science

Credit Points: 12.00

Terms

Course Code

Campus

Career

School

Learning Mode

Teaching Period(s)

COSC2820

City Campus

Postgraduate

171H School of Science

Face-to-Face

Sem 2 2021

COSC2820

City Campus

Postgraduate

175H Computing Technologies

Face-to-Face

Sem 2 2022,
Sem 1 2024,
Sem 2 2024,
Sem 1 2025

COSC3015

RMIT University Vietnam

Postgraduate

175H Computing Technologies

Face-to-Face

Viet3 2022,
Viet2 2024

Course Coordinator: Prof Jeffrey Chan

Course Coordinator Phone: +61 9925

Course Coordinator Email: jeffrey.chan@rmit.edu.au


Pre-requisite Courses and Assumed Knowledge and Capabilities

Enforced Pre-Requisite Courses
Successful completion of the following course/s:

Note: it is a condition of enrolment at RMIT that you accept responsibility for ensuring that you have completed the prerequisite/s and agree to concurrently enrol in co-requisite courses before enrolling in a course.
For information go to RMIT Course Requisites webpage.

If you have completed prior studies at RMIT or another institution that developed the skills and knowledge covered in the above course/s you may be eligible to apply for credit transfer. Alternatively, if you have prior relevant work experience that developed the skills and knowledge covered in the above course/s you may be eligible for recognition of prior learning. Please follow the link for further information on how to apply for credit for prior study or experience.


Course Description

​This is an advanced programming course, designed specifically for students who are interested in the field of Data Science.  

Advanced programming concepts and techniques for the purposes of data processing (e.g., data parsing, cleansing, integration, etc.) will be taught, enabling more complex data pre-processing and getting data ready for down-stream analysis. These include, for example, the handling of data stored in different formats (e.g., CSV, JSON, XML,), the handling of bad and missing data, and the integration of data from different sources.  The course will also introduce both fundamental and the state-of-the-art advanced techniques for text pre-processing, to convert raw natural language text data to feature representations that can be directly used in downstream analysis. The course will also explore a simple web app development framework, which enables students to deploy their developed data driven applications online.  

A Python environment will be used for implementation throughout the course. 


Objectives/Learning Outcomes/Capability Development

Program Learning Outcomes

This course contributes to the program learning outcomes for the following program(s):

MC267 - Master of Data Science

PLO 1    Enabling Knowledge
PLO 2    Critical Analysis
PLO 3    Problem Solving
PLO 4    Communication
PLO 5    Team Work

For more information on the program learning outcomes for your program, please see the program guide.


Upon successful completion of this course, you will be able to:

  1. Programmatically parse data in the required format; 
  2. Programmatically identify and resolve data quality issues; 
  3. Programmatically integrate data from various sources for data enrichment; 
  4. Pre-process natural language text data to generate effective feature representations; 
  5. Document and maintain an editable transcript of the data pre-processing pipeline for professional reporting; 
  6. Build small to medium scale data-driven applications using a Web development framework. 
 


Overview of Learning Activities

You will be actively engaged in a range of learning activities such as lectorials, tutorials, pre-recorded videos, activity notebooks, workshops, practicals, laboratories, seminars, project work, class discussion, individual and group activities. Workshops will focus on hands-on activities and problem solving, allowing exploration of concepts with teaching staff and other students, to provide feedback on progress and understanding. Delivery may be face to face, online or a mix of both.

You are encouraged to be proactive and self-directed in your learning, asking questions of your lecturer and/or peers and seeking out information as required, especially from the numerous sources available through the RMIT library, and through links and material specific to this course that is available through myRMIT Studies Course.


Overview of Learning Resources

RMIT will provide you with resources and tools for learning in this course through myRMIT Studies Course. You will make extensive use of computer laboratories and relevant software provided by the School and/or available for download onto private laptops/machines.

There are services available to support your learning through the University Library. The Library provides guides on academic referencing and subject specialist help as well as a range of study support services. For further information, please visit the Library page on the RMIT University website and the myRMIT student portal.


Overview of Assessment

This course has no hurdle requirements.

Assessment Task 1: In-class Coding Exercise
Weighting 15%
This assessment task supports CLOs 1, 2, 3

Assessment Task 2: Assignment 1
Weighting 20%
This assessment task supports CLOs 1, 2, 3, 5  

Assessment Task 3: Assignment 2
Weighting 35%
This assessment task supports CLOs 4, 5, 6

Assessment Task 4: Technical Interview
Weighting 30%
This assessment supports CLOs 1, 2, 3, 4

If you have a long-term medical condition and/or disability it may be possible to negotiate to vary aspects of the learning or assessment methods. You can contact the program coordinator or Equitable Learning Services if you would like to find out more.