QUT offers a diverse range of student topics for Honours, Masters and PhD study. Search to find a topic that interests you or propose your own research topic to a prospective QUT supervisor. You may also ask a prospective supervisor to help you identify or refine a research topic.
Found 28 matching student topics
Displaying 1–12 of 28 results
Gamified process-data cleaning
Despite the importance of data quality, it is often compromised. The majority of the time and energy in most data science projects is spent on data cleaning. Process-oriented data mining (process mining) is not an exception. A recent process mining survey shows that more than 60% of the time and effort is spent on data transformation and pre-processing. While, in most cases, the engagement of domain experts is required for accurate data cleaning, it is challenging to engage them in …
- Study level
- PhD, Master of Philosophy, Honours
- Faculty
- Faculty of Science
- School
- School of Information Systems
Praeclarus process-data quality framework
Praeclarus is an open-source software framework that aims to facilitate data pre-processing for process mining. Process mining is specialised data mining focusing on process-data. It is of high interest to industry, with the market doubling every two years (e.g., increasing from $550M in 2020 to $1B in 2022). This market increase has meant that big companies like Microsoft, SAP, and IBM are acquiring process mining vendors such is Minit, Signavio, and myInvenio.Recent process mining surveys show that more than 60% …
- Study level
- PhD, Master of Philosophy, Honours
- Faculty
- Faculty of Science
- School
- School of Information Systems
Information retrieval and coding methods for large scale bioinformatics
Advances in sequencing technologies over the past two decades have led to an explosion in the availability of genomic sequence data and an increasingly urgent need for scalable clustering and search facilities. One approach is to encode sequences as binary vectors in a high-dimensional space, simplifying the comparison and allowing it to be computed very rapidly using bit-level operations.Coupled with these ideas is the need to provide clustering methods and efficient indexing and lookup in response to search queries. One …
- Study level
- PhD, Master of Philosophy, Honours
- Faculty
- Faculty of Science
- School
- School of Computer Science
- Research centre(s)
- Centre for Data Science
Overcoming the challenges of sensitive data via synthetic data generation (case study)
In the 21st Century, there is an abundance of data, often containing insights that could benefit a number of stakeholders. However, despite this opportunity, it is often the case that the data is sensitive and can not be released by organisations or government agencies due to privacy concerns. One possible solution to the above dilemma is to instead carefully construct a 'twin' data set that contains similar information (and ideally, the same insights) as the original data set, but without …
- Study level
- Honours
- Faculty
- Faculty of Science
- School
- School of Mathematical Sciences
- Research centre(s)
- Centre for Data Science
Visualisation and sonification for genomic data sets
Successive revolutions in sequencing technology over the past two decades have led to an explosion in the availability of genomic data. Analysing biological datasets and identifying relationships within them is challenging - some of the process can be automated but interactive exploration offers a number of advantages, and supports serendipitous discovery.This project looks at visual analytics and sonification - the use of sound and musical encodings - to enhance our understanding of biological networks.
- Study level
- PhD, Master of Philosophy, Honours
- Faculty
- Faculty of Science
- School
- School of Computer Science
- Research centre(s)
- Centre for Data Science
Surprising genomes
Genomic sequencing has changed radically since the first public sequencing projects more than 25 years ago. The original human genome project cost more than two billion dollars; sequencing a human genome now costs as little as a thousand, and we may sequence whole viruses and bacteria as a matter of routine.The challenge now lies in rapidly analysing these genomes as they appear, and understanding quickly whether there is anything interesting in the new sequence to warrant further inquiry. This project …
- Study level
- PhD, Master of Philosophy, Honours
- Faculty
- Faculty of Science
- School
- School of Computer Science
- Research centre(s)
- Centre for Data Science
Security analysis of open-source software
Several open-source projects drive modern-day IT applications. However, some open-source projects get compromised by malicious attackers, who include malware to the code to compromise the security of the application users.This project will investigate approaches for securing the open-source software.
- Study level
- Honours
- Faculty
- Faculty of Science
- School
- School of Computer Science
Representation learning for anti-microbial resistance
This project is about using neural network models help us understand Anti-Microbial Resistance (AMR), a phenomenon in which bacteria adapt to reduce the effectiveness of antibiotics, usually through a process known as Lateral or Horizontal Gene Transfer - where genes are included in the organism from other sources.Our focus will be on learning compact vector representations of biological sequences known to be associated with AMR genes. By encoding DNA sequences in this way we can more rapidly identify AMR genes …
- Study level
- PhD, Master of Philosophy, Honours
- Faculty
- Faculty of Science
- School
- School of Computer Science
- Research centre(s)
- Centre for Data Science
Transport big data analytics: Imputing missing data
The missing data problem is often unavoidable for real-world data collection systems because of a variety of factors, such as sensor malfunctioning, maintenance work, transmission errors, and so on. Filling in missing information in a dataset is an important requirement for many machine-learning algorithms that require a complete dataset as input. Data imputation algorithms aim at filling the missing information in a dataset. Many missing data imputation techniques exist in the literature, with applications demonstrated on various types of datasets. …
- Study level
- PhD, Master of Philosophy, Honours
- Faculty
- Faculty of Engineering
- School
- School of Civil and Environmental Engineering
- Research centre(s)
- Centre for Data Science
Efficient parameter estimation for agent-based models of tumour growth
Cancer is an extremely heterogeneous disease, particularly at the cellular level. Cells within a single cancerous tumour undergo vastly different rates of proliferation based on their location and specific genetic mutations. Capturing this stochasticity in cell behaviour and its effect on tumour growth is challenging with a deterministic system, e.g. ordinary differential equations, however, is possible with an agent-based model (ABM). In an ABM, cells are modelled as individual agents that have a probability of proliferation and movement in each …
- Study level
- Master of Philosophy, Honours
- Faculty
- Faculty of Science
- School
- School of Mathematical Sciences
- Research centre(s)
- Centre for Data Science
Making the most of many models
In the age of Big Data, machine learning methods, and modern statistics the adage "all models are wrong but some are useful" has never been so true. This project will investigate data science approaches where more than one model makes sense for the data. Is it better to choose a single model or is there something to be gained from multiple models?This project will look at variable selection methods, penalised regression, Bayesian model averaging and conformal prediction. The research has …
- Study level
- Honours
- Faculty
- Faculty of Science
- School
- School of Mathematical Sciences
- Research centre(s)
- Centre for Data Science
Predicting good sleep using computer science: Can we use machine learning to find out 'what's the best bed?'
In the Westernised world a person typically spends one third of their life in bed, with more time spent sleeping in a bed than in any other single activity. Sleep amount and quality of sleep have a direct impact on mood, behaviour, motor skills and overall quality of life. Yet, despite how important restful sleep is for the body to maintain good health, there is a comparatively small amount of studies evaluating key multi-factorial determinants of restful sleep in non-pathological, …
- Study level
- PhD
- Faculty
- Faculty of Engineering
- School
- School of Mechanical, Medical and Process Engineering
- Research centre(s)
- Centre for Biomedical Technologies
Contact us
If you have questions about the best options for you, the application process, your research topic, finding a supervisor or anything else, get in touch with us today.