Course Title: Practical Data Science with Python

Part A: Course Overview

Course Title: Practical Data Science with Python

Credit Points: 12.00

Terms

Course Code

Campus

Career

School

Learning Mode

Teaching Period(s)

COSC2670

City Campus

Postgraduate

171H School of Science

Face-to-Face

Sem 1 2017,
Sem 1 2018,
Sem 1 2019,
Sem 1 2020,
Sem 1 2021,
Sem 2 2021

COSC2670

City Campus

Postgraduate

175H Computing Technologies

Face-to-Face

Sem 1 2022,
Sem 2 2022,
Sem 1 2023,
Sem 2 2023,
Sem 1 2024,
Sem 2 2024

COSC2999

RMIT University Vietnam

Postgraduate

175H Computing Technologies

Face-to-Face

Viet3 2022,
Viet3 2023,
Viet3 2024

Flexible Terms

Course Code

Campus

Career

School

Learning Mode

Teaching Period(s)

COSC2791

RMIT Online

Postgraduate

171H School of Science

Internet

JulDec2019 (KP6)

COSC2791

RMIT Online

Postgraduate

171H School of Science

Internet

JulDec2020 (KP4)

COSC2791

RMIT Online

Postgraduate

171H School of Science

Internet

JanJun2021 (KP2)

COSC2791

RMIT Online

Postgraduate

171H School of Science

Internet

JulDec2021 (KP6)

COSC2791

RMIT Online

Postgraduate

175H Computing Technologies

Internet

JanJun2022 (KP1),

JanJun2022 (KP3)

COSC2791

RMIT Online

Postgraduate

175H Computing Technologies

Internet

JulDec2022 (KP5)

COSC2791

RMIT Online

Postgraduate

175H Computing Technologies

Internet

JanJun2023 (KP1),

JanJun2023 (KP3)

COSC2791

RMIT Online

Postgraduate

175H Computing Technologies

Internet

JulDec2023 (KP5)

COSC2791

RMIT Online

Postgraduate

175H Computing Technologies

Internet

JanJun2024 (KP1),

JanJun2024 (KP3)

COSC2791

RMIT Online

Postgraduate

175H Computing Technologies

Internet

JulDec2024 (KP5)

Course Coordinator: A/Prof. Yongli Ren

Course Coordinator Phone: via email

Course Coordinator Email: yongli.ren@rmit.edu.au

Course Coordinator Availability: By appointment


Pre-requisite Courses and Assumed Knowledge and Capabilities

None


Course Description

The course gives you a set of practical skills for handling data that comes in a variety of formats and sizes, such as texts, spatial and time series data. These skills cover the data analysis lifecycle from initial access and acquisition, modelling, transformation, integration, querying, application of statistical learning and data mining methods, and presentation of results. This includes data wrangling, the process of converting raw data into a more useful form that can be subsequently analysed. The course is hands-on, using Python.


Objectives/Learning Outcomes/Capability Development

This course contributes to the following Program Learning Outcomes for MC267 Master of Data Science:

 

Enabling Knowledge: You will gain skills as you apply knowledge with creativity and initiative to new situations. In doing so, you will:

  • Demonstrate mastery of a body of knowledge that includes recent developments in computer science and information technology;
  • Understand and use appropriate and relevant, fundamental and applied mathematical and statistical knowledge, methodologies and modern computational tools;
  • Recognise and use research principles and methods applicable to data science.

 

Critical Analysis: You will learn to accurately and objectively examine, and critically investigate computer science, information technology (IT) and statistical concepts, evidence, theories or situations, in particular to:

  • Analyse and model complex requirements and constraints for the purpose of designing and implementing software artefacts and IT systems;
  • Evaluate and compare designs of software artefacts and IT systems on the basis of organisational and user requirements;
  • Bring together and flexibly apply knowledge to characterise, analyse and solve a wide range of statistical problems.

 

Problem Solving: Your capability to analyse complex problems and synthesise suitable solutions will be extended as you learn to:

  • Design and implement software solutions that accommodate specified requirements and constraints, based on analysis or modelling or requirements specification;
  • Apply an understanding of the balance between the complexity / accuracy of the mathematical / statistical models used and the timeliness of the delivery of the solution


On completion of this course you should be able to:

  1. Use industry and evidence-based tools and approaches to transform raw data into a format suitable for a data science pipeline
  2. Identify scenarios where a machine learning approach may support effective data analysis
  3. Generate an interpretation and visualisation of data using exploratory data analysis in Python
  4. Construct and document an experimental methodology for analysis of data
  5. Select appropriate models, and apply simple machine learning tools and feature selection strategy for a defined data science problem
  6. Apply professional standards to allow reproducibility of analysis


Overview of Learning Activities

You will learn about key concepts in pre-recorded lecture videos, where you can engage with course material and the subject matter being illustrated through demonstrations and examples.

Tutorials, workshops and/or labs and/or group discussions (including online forums) focused on projects and problem solving will provide you with the opportunity to practice the application of theory and procedures. You will explore the concepts with teaching staff and other students and receive feedback on your progress. You will develop an integrated understanding of the subject matter through private study by working through the course as presented in classes. Comprehensive learning materials will aid you in gaining practice at solving conceptual and technical problems. 

 

RMIT Online Overview of Leaning Activities

This course uses highly structured learning activities to guide your learning and prepare you to complete the assessment tasks. These activities consist of a combination of individual, peer-supported and facilitator-guided activities, and where possible project-led, with opportunities for regular feedback. 

Authentic and industry-relevant learning is critical to this course as you will be expected to critically evaluate current thinking and practice within data science applications. You will apply your thinking by producing relevant real-world assessment tasks and engage with scenarios and case studies.  

You will be expected to participate in class and group activities, as well as provide and receive peer feedback on drafts of work as social learning is an important component of this course.


Overview of Learning Resources

You will make extensive use of computer laboratories and relevant software provided by the RMIT University. You will be able to access course information and learning materials through myRMIT.

Lists of relevant reference texts[1], resources in the library and freely accessible Internet sites will be provided within the course materials.

[1] For example: Luca Massaron, Alberto Boschetti, Python Data Science Essentials - Learn the fundamentals of Data Science with Python, Packt Publishing, 2015,ISBN: 978-1785280429.

 

RMIT Online Overview of Learning Resources

The learning and teaching approaches used in this program may include webinars, problem-based learning and case studies. RMIT will provide you with resources and tools for learning in this course through our online systems.  

The activities and tasks are designed to facilitate the application of theory and encourage peer learning in a collaborative, open manner using online tools and interactive discussion forums. Assessment is integrated throughout the program to ensure that you graduate with a set of applicable skills and knowledge of blockchain fundamentals and applications.  

There are services available to support your learning via the RMIT University Library. The Library provides guides on academic referencing and subject specialist help as well as a range of study support services.  

RMIT Online provides support and equal opportunities for students with a disability, long-term illness and/or mental health condition and primary carers of individuals with a disability. If you need assistance, please speak to your Program Manager or contact the Equitable Learning Services (ELS).  

At RMIT you can apply for credit so your previous learning or experience counts toward your RMIT Online program. For further information on how to apply for credit, please click here.  

Please view the Assessment and Assessment Flexibility Policy for further information regarding applying for an extension, special consideration, equitable assessment arrangements and supplementary assessment. 


Overview of Assessment

The assessment for this course comprises practical, written, and presentation assignments, including data pre-processing, data analysis and data modelling. The assessment tasks involve the processing and analysis of various types of datasets, and the applications of various machine learning models. While this course will use machine learning tools, the focus of the assessment is on analysis, application and problem solving. Across all assessment tasks, students are required to demonstrate their knowledge of theoretical concepts and practical techniques, including identifying the appropriate techniques and applying them to new situations.

This course has no hurdle requirements.

On campus

Assessment Task 1: Weekly Quizzes
Weighting 10%
This assessment task supports CLOs 1, 2, 3, 4, 5, 6

Assessment Task 2: Practical Assignment (individual)  
Weighting 25% 
This assessment task supports CLOs 1, 3, 4, 6

Assessment Task 3: Practical and Written Assignment (individual)
Weighting 35%
This assessment task supports CLOs 1, 2, 3, 4, 5, 6

Assessment Task 4: Practical and Written Assignment (individual)
Weighting 30%
This assessment task supports CLOs 1, 2, 3, 4, 5, 6

 

RMIT Online Offering

You will be assessed on how well you meet the course learning outcomes and on your development against the program learning outcomes.

Assessment Task 1: Dataset preparation report  
Weighting 25%.  This assessment task supports CLOs 1, 3, 4, 6

Assessment Task 2: Data modelling report
Weighting 45%.  This assessment task supports CLOs 2, 3, 4, 5, 6

Assessment Task 3: Data modelling presentation
Weighting 30%.  This assessment task supports CLOs 2, 3, 5

 

Feedback will be given on all assessment tasks. 

If you have a long-term medical condition and/or disability it may be possible to negotiate to vary aspects of the learning or assessment methods. You can contact the program coordinator or Equitable Learning Services if you would like to find out more. 

Your course assessment conforms to RMIT assessment principles, regulations, policies, procedures and instructions.