Note: This unit is an archived version! See Overview tab for delivered versions.

INFO2150: Introduction to Health Data Science (2018 - Semester 2)

Download UoS Outline

Unit: INFO2150: Introduction to Health Data Science (6 CP)
Mode: Normal-Day
On Offer: Yes
Level: Intermediate
Faculty/School: School of Computer Science
Unit Coordinator/s: Dr Poon, Josiah
Session options: Semester 2
Versions for this Unit:
Campus: Camperdown/Darlington
Pre-Requisites: (INFO1003 OR INFO1903 OR INFO1103 OR INFO1110 OR INFO1910 OR DATA1002 OR DATA1902) AND (DATA1001 OR MATH1005 OR MATH1905 OR MATH1015).
Co-Requisites: DATA2001 OR DATA2901 OR ISYS2120 OR INFO2120 OR INFO2820 OR INFO1903.
Brief Handbook Description: Health organisations cannot function effectively without computer information systems. Clinical data are stored and distributed in different databases, different formats and different locations. It requires a lot of effort to create an integrated and clean-up version of data from multiple sources, This unit provides basic introduction to the process and knowledge to enable the analysis of health data. The unit will be of interest to students seeking the understanding of the various coding standards in health industry, data retrieval from databases, data linkage issue, cleaning and pre-processing steps, necessary statistical techniques and presentation of results.

It will be valuable to those who want to work as health-related occupations, such as health informatics analysts, healthcare administrators, medical and health services manager or research officers in hospitals, government health agencies and research organisations. Having said that, a good understanding of health data analysis is a useful asset to all students.
Assumed Knowledge: Basic knowledge of Entity Relationship Modelling, database technology and SQL
Lecturer/s: Dr Poon, Josiah
Dr Lam, Mary
Timetable: INFO2150 Timetable
Time Commitment:
# Activity Name Hours per Week Sessions per Week Weeks per Semester
1 Workshop 2.00 1 13
2 Independent Study 6.00 1 13

Attributes listed here represent the key course goals (see Course Map tab) designated for this unit. The list below describes how these attributes are developed through practice in the unit. See Learning Outcomes and Assessment tabs for details of how these attributes are assessed.

Attribute Development Method Attribute Developed
Students learn and practice the health data analysis processes. They have to design the pre-processing task sequence and choice of analytical methods. Design (Level 2)
Students are required them to apply relevant pre-processing techniques to clean up a set of given health data. Students should be capable to explain their choice of methods & tools used in the data analysis process. Engineering/IT Specialisation (Level 3)
Students are able to apply various statistical methods to analyse a clean-up data. Through their assignment, students are expected to articulate the rationale and any assumption behind their decision, as well as drawing conclusion and inference from the data. Maths/Science Methods and Tools (Level 3)
Students practice their written and oral communication skills through
the assessments. They need to articulate well the aim, issues arising from the data characteristics, the reasons behind decision choices. They should be able to discuss and draw insights from the results through their analytical work. It is expected the students can present their results in the most appropriate graphical and tablular format.
Communication (Level 3)
Students are given the context to work in team to integrate data analysis results, as well as the preparation of written report and verbal presentation. Project and Team Skills (Level 2)

For explanation of attributes and levels see Engineering & IT Graduate Outcomes Table 2018.

Learning outcomes are the key abilities and knowledge that will be assessed in this unit. They are listed according to the course goal supported by each. See Assessment Tab for details how each outcome is assessed.

Engineering/IT Specialisation (Level 3)
1. Student understands the role of data analysis in decision-making
2. Student understands the technical issues that are present in the collection of clinical data and the necessary of different pre-processing issues to clean-up a dataset for analysis.
Maths/Science Methods and Tools (Level 3)
3. Student can identify and select appropriate statistical techniques to summarise and analyse clinical data set, and able to articulate a justification of choice of methods.
4. Student can apply health concepts and terms to describe and analyse the role of a data analysis task.
Information Seeking (Level 3)
5. Student can identify explicit and implicit requirements for carrying out a health data analysis task to meet stakeholder purposes
Communication (Level 3)
6. Student can communicate the results produced by an analysis pipeline, in oral and written form, including meaningful diagrams
7. Student can communicate the process used to analyse health data set, and justify the techniques & methods used.
Design (Level 2)
8. Students can carry out (in guided stages) the whole design and implementation cycle of a simple health data analysis task
Assessment Methods:
# Name Group Weight Due Week Outcomes
1 Lab Activities No 20.00 Multiple Weeks 2, 3, 4,
2 Presentation Yes 10.00 Week 5 1, 2, 3, 4, 5, 6, 7, 8,
3 Project-M1 No 10.00 Week 7 1, 2, 3, 4, 5, 6, 7, 8,
4 Case Study (Statistics) No 10.00 Week 11 3, 4, 5,
5 Project-M2 No 15.00 Week 12 1, 2, 3, 4, 5, 6, 7, 8,
6 Quiz (in-class) No 25.00 Week 13 1, 2, 3, 4, 5, 6, 7, 8,
7 Health Text Analytic No 10.00 Week 13 1, 2, 3, 4, 7,
Assessment Description: Lab Activities: A student has to participate and engage in the activities in the workshop (from w2-w12). Each week`s work worths 2 mark.

Presentation of Privacy. It is a group activity when members have to present on a specific privacy/ethical issue related to health data.

Project. A dataset is given such that each student has to perform necessary tasks and analysis to arrive observation and to infer conclusion. The project is to be staged in two milestones to help students achieving a final executive report.

Case Study (Statistics): Using statistical instrument to analyse a given health case.

Health Text Analytic: Write a small script to extract information from a given dataset of health text for analysis.

Quiz: A quiz to assess a student`s understanding of knowledge content taught.
Grading:
Grade Type Description
Standards Based Assessment Final grades in this unit are awarded at levels of HD for High Distinction, DI (previously D) for Distinction, CR for Credit, PS (previously P) for Pass and FA (previously F) for Fail as defined by University of Sydney Assessment Policy. Details of the Assessment Policy are available on the Policies website at http://sydney.edu.au/policies . Standards for grades in individual assessment tasks and the summative method for obtaining a final mark in the unit will be set out in a marking guide supplied by the unit coordinator.
Minimum Pass Requirement It is a policy of the School of Computer Science that in order to pass this unit, a student must achieve at least 40% in the written examination. For subjects without a final exam, the 40% minimum requirement applies to the corresponding major assessment component specified by the lecturer. A student must also achieve an overall final mark of 50 or more. Any student not meeting these requirements may be given a maximum final mark of no more than 45 regardless of their average.

Note that the "Weeks" referred to in this Schedule are those of the official university semester calendar https://web.timetable.usyd.edu.au/calendar.jsp

Week Description
Week 1 Lecture: Introduction
Lecture: Presentation of Scientific Information
Week 2 Lecture: Understanding Data
Week 3 Lecture: (Video session) Data Preprocessing, HL7
Week 4 Lecture: Data Quality
Week 5 Lecture: Privacy and Ethics
Assessment Due: Presentation
Week 6 Lecture: Data Linkage
Week 7 Lecture: Recalling Statistics (1): Chi-square, T-Test
Assessment Due: Project-M1
Week 8 Lecture: Recalling Statistics (2): ANOVA, Correlation and Regression
Week 10 Lecture: Statistical Interpretation from Health
Week 11 Lecture: Extracting Health Information from Health Text
Assessment Due: Case Study (Statistics)
Week 12 Lecture: Data Mining Techniques: Clustering
Assessment Due: Project-M2
Week 13 Lecture: Quiz and Health Big-Data
Assessment Due: Quiz (in-class)
Assessment Due: Health Text Analytic

Course Relations

The following is a list of courses which have added this Unit to their structure.

Course Year(s) Offered
Advanced Computing / Science 2018, 2019, 2020, 2021, 2022, 2023, 2024, 2025
Advanced Computing / Science (Medical Science) 2018, 2019, 2020, 2021, 2022, 2023, 2024, 2025
Bachelor of Advanced Computing (Computational Data Science) 2018, 2019, 2020, 2021, 2022, 2023, 2024, 2025
Advanced Computing / Commerce 2018, 2019, 2020, 2021, 2022, 2023, 2024, 2025
Bachelor of Advanced Computing (Computational Data Science) - Mid-Year 2021, 2022, 2023, 2024, 2025
Bachelor of Advanced Computing (Computer Science) 2018, 2019, 2020, 2021, 2022, 2023, 2024, 2025
Bachelor of Advanced Computing (Cybersecurity) 2022, 2023, 2024, 2025
Bachelor of Advanced Computing (Computer Science) - Mid-Year 2021, 2022, 2023, 2024, 2025
Bachelor of Advanced Computing (Cybersecurity) - Mid-Year 2021, 2022, 2023, 2024, 2025
Bachelor of Advanced Computing (Information Systems) (not offered from 2022+) 2018, 2019, 2020, 2021
Bachelor of Advanced Computing (Software Development) 2018, 2019, 2020, 2021, 2022, 2023, 2024, 2025
Bachelor of Advanced Computing (Software Development) - Mid-Year 2021, 2022, 2023, 2024, 2025
Bachelor of Computer Science and Technology 2015, 2016, 2017, 2025
Biomedical Engineering (mid-year) 2016, 2017, 2018, 2019, 2020
Biomedical Engineering 2016, 2017, 2018, 2019, 2020
Software Engineering (mid-year) 2019, 2020
Software Engineering 2018, 2019, 2020

Course Goals

This unit contributes to the achievement of the following course goals:

Attribute Practiced Assessed
Engineering/IT Specialisation (Level 3) Yes 28.25%
Maths/Science Methods and Tools (Level 3) Yes 37%
Information Seeking (Level 3) No 11.5%
Communication (Level 3) Yes 18.5%
Design (Level 2) Yes 4.75%
Project and Team Skills (Level 2) Yes 0%
Professional Conduct (Level 2) No 0%

These goals are selected from Engineering & IT Graduate Outcomes Table 2018 which defines overall goals for courses where this unit is primarily offered. See Engineering & IT Graduate Outcomes Table 2018 for details of the attributes and levels to be developed in the course as a whole. Percentage figures alongside each course goal provide a rough indication of their relative weighting in assessment for this unit. Note that not all goals are necessarily part of assessment. Some may be more about practice activity. See Learning outcomes for details of what is assessed in relation to each goal and Assessment for details of how the outcome is assessed. See Attributes for details of practice provided for each goal.