Skip to content

Machine learning for early warning systems in a clinical setting?

Dortmund, 23rd May 2024

Artificial intelligence (AI) has evolved into an indispensable health research tool in recent years, with researchers increasingly relying on machine learning (ML) to analyse the huge data sets involved. This sub-field of AI enables computers to learn from data and recognise patterns within it, allowing researchers to better understand complex relationships, such as those that exist between courses of disease and symptoms. However, clinical data is often multi-layered. Variable and interconnected data points, such as results from a series of blood tests, quickly push standard ML algorithms to their limits. That is why researchers from ISAS, the Otto von Guericke University of Magdeburg, Bielefeld University and the German Centre for Higher Education Research and Science Studies in Hanover, together with cooperation partners from the University of Leipzig Medical Centre and Greifswald University Hospital, are turning to an alternative ML method. Taking a model for predicting sepsis (blood poisoning) as an example, the team was able to show that not only the data itself, but also the connections between the data points provide important information to facilitate early diagnosis. 

Clinical practice is often a race against time. So, to be able to treat illnesses appropriately, it is essential to diagnose them early. This is exactly the case with sepsis (see info box). This life-threatening infection often progresses so rapidly that the risk of the patient dying increases by around eight percent with every hour that it goes untreated1. But in its early stages, imminent sepsis is frequently hard to detect. If sepsis is suspected, doctors can treat it initially with a broad-spectrum antibiotic, but to diagnose it specifically using a bacterial culture or by evaluating symptoms takes time. By the time the medical experts are certain which bacteria they need to treat and how, they are often battling what is already an advanced and difficult-to-treat inflammation.

SEPSIS

Sepsis, also known as blood poisoning, is a deadly disease that costs more lives every year than breast, prostate and bowel cancer combined2. The disease is caused by the immune system failing to contain a localised infection and allowing messengers and toxins to spread through the bloodstream. The body responds by sending out leukocytes (white blood cells) to fight the pathogens (causative organisms). This can cause the blood vessels to expand. Since the patient's blood pressure then falls, vital organs such as the lungs, kidneys or heart are no longer being supplied with enough oxygen via the blood and, in serious cases, may fail. The patient enters septic shock, which means their life is in acute danger.

Machine learning to facilitate early diagnosis

In future, machine learning (see info box) could help doctors to diagnose time-critical diseases at an early stage: “Using AI, we are able to look at blood counts and predict which patients may be at risk of developing sepsis, for example,” says Prof. Dr Robert Heyer, Head of the Multidimensional Omics Data Analysis junior research group at ISAS. He goes on to say that corresponding models do already exist: “But they reach their limits when asked to take complex time series into account.” Time series are made up of sequences of data points, which are collected at regular intervals. In a hospital setting they may contain information about changes to patients' vital signs such as blood levels, heart rate or blood glucose level, for example.

Portrait von Prof. Dr. Robert Heyer.

Our results underline how important data collected regularly from patients is in providing basic information for predictive models in the field of health research.

Prof. Dr Robert Heyer

The team of interdisciplinary researchers put a specific type of algorithm, called a graph neural network (GNN), to the test. These networks are particularly well suited to analysing data organised in the form of graphs – such as time series. In a graph, the data points (nodes) are connected to one another by “edges”, which represent the relationships between the nodes. This gives rise to complex network structures, in which the nodes take their own information into account as well as information about their neighbouring nodes. By following the links within the data network, GNNs decode how various factors affect one another and reveal hidden connections. “Our objective was to find out how suitable GNNs are for analysing complex clinical data and whether integrating time series improves the predictive accuracy of the models,” says Daniel Walke, doctoral candidate at the Otto von Guericke University of Magdeburg and lead author of the joint pre-print (pre-release of a scientific paper that has not yet been peer reviewed).

Portrait von Daniel Walke.

Daniel Walke is a PhD student in the Database and Software Engineering group at Otto von Guericke University Magdeburg.

© Privat.

MACHINE LEARNING

Machine learning (ML) is a discipline of artificial intelligence. With the help of ML, computers are trained to process data and previous experiences independently and adapt to them accordingly. An example of ML are artificial neural networks (ANNs), which are modelled on the human brain. They consist of artificial neurons, which are arranged in layers and connected to one another. These neurons process inputs, perform calculations and provide outputs. Training the network on sample data allows it to detect patterns and relationships so it can perform tasks such as making predictions or recognising patterns, for example. Graph neural networks (GNNs) are a particular kind of ANN. They can also take account of additional information from linked measurements, often referred to as “message passing”. To make their predictions and classifications, GNNs use the structure and relationships (edges) within a graph to understand how the data points (nodes) interact with and influence one another.

Time series improve predictive power 

The researchers based their work on a data set containing information about over 528,000 people, who were treated on the wards (with the exception of intensive care) at the University of Leipzig Medical Centre and Greifswald University Hospital between the years 2014 and 2021. Some of them suffered from sepsis during their time in hospital, while others did not. The researchers used the extensive data from Leipzig to first train their GNNs to retrospectively predict the probability of sepsis. Applying the GNNs to the data set from Leipzig, and another from Greifswald, showed similar results to those achieved using conventional ML algorithms and other types of neural network. However, applying GNNs to time series data, which incorporate test results for the same patients, gave significantly better results. Unlike previously, when similar test results from various patients were linked to one another, the nodes now represent full blood counts from just one person at different points in time. The researchers used the AUROC (Area Under the Receiver Operating Characteristic) curve to measure the reliability of the predictions. The closer the value is to 1, the better the model is performing. Heyer and his team were able to improve the AUROC values from under 0.88 to over 0.95 by integrating time series. “The fact that time series have such a big impact on the reliability of the predictions underlines how important data collected regularly from patients is in providing basic information for predictive models in the field of health research,” sums up Heyer.

Not a black box: medicine needs transparency

At the moment, how well GNNs can really be integrated into everyday medicine is still largely untested. Another challenge is this: “GNNs and other complex machine learning algorithms (e.g. XGBoost) are often treated as black-boxes limiting their interpretability and transparency which is essential for medical applications,” write Heyer and his fellow researchers in their paper. It was therefore important for the authors to understand exactly what the algorithms were basing their predictions on. “That's why we didn't just leave it as a black box. We tried to find out what the algorithms had learnt from the patient data. We wanted to know what factors were behind their predictions,” says Heyer. When it comes to sepsis, the analysis shows that, besides the varying number of white blood cells, the key factor is their interactions with other types of blood cell.

Advanced ML tools could potentially save countless lives – not only from sepsis, but from other diseases too. In future, GNN analyses of blood count data could help to diagnose thrombosis or leukaemia, for example.

Article Recommendation

Walke, D., Steinbach, D., Gibb, S., Kaiser, T., Saake, G., Ahrens, P., Broneske, D., Heyer, R.
(2023). Edges are all you need: Potential of Medical Time Series Analysis with Graph Neural Networks. PREPRINT (Version 1) available at Research Square: https://doi.org/10.21203/rs.3.rs-3573549/v1.

 

1 https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8210984/
2 https://www.england.nhs.uk/blog/beating-sepsis-with-early-detection-and-prompt-treatment/


(Cheyenne Peters/ Ute Eberle)

Share

Further articles

21st October 2024

Between Progress and Footprint: Artificial Intelligence in Healthcare

The potential applications of artificial intelligence in healthcare are constantly increasing. However, with ever better and faster models, the question of sustainability is also coming to the fore. A group of international researchers, including Dr Jianxu Chen from ISAS's AMBIOM group, has been looking into the increased consumption of resources and possible solutions to the impending sustainability problem.

Dr. Jianxu Chen (links)  Prof. Dr. Yiyu Shi im Foyer des ISAS.
15th October 2024

Science Meets Art: Immersion in the Secret World of the Immune System

Diving into the smallest structures of the heart via microscope images? This was possible at the first Dortmund Science Night. "The secret world of the immune system", the immersive 3D experience by storyLab kiU at Fachhochschule Dortmund University of Applied Sciences and Arts and ISAS, gave visitors the chance to experience immune cells up close after a heart attack.

Das Foto zeigt (v.l.n.r.) Dr. Malte Roeßing, Dr. Ali Ata Tuz, Lara Janz, Flora Weber, Sara Regein und Cheyenne Peters vor dem Stand des ISAS bei der Science Night 2024.
20th September 2024

"Book a Scientist": Register Now for a Virtual Chat with Researchers

The format "Book a Scientist" offers those interested in science the opportunity to book individual appointments with researchers within the Leibniz Association. Among them are two ISAS researchers. Their topics are "overzealous" immune cells after a heart attack and animal experiments and their alternatives.

Book a Scientist.
10th September 2024

Fascinating Insights: Virtual World in the Classroom

What actually is reality? How do we perceive our environment? And how can augmented reality support researchers in their work? Along with class 3a of the Don Bosco elementary school in Bochum, two ISAS researchers explored these questions.

Marie steht im Klassenzimmer und trägt eine VR-Brille. In den Händen hält sie jeweils einen Controller.
29th August 2024

Protein interaction paralyses a young patient

Two mutations in the genetic material of a boy lead to the rare neuromuscular disease NEDHFBA. Investigations of the young patient's samples by researchers at Essen University Hospital and ISAS shed light on the previously unrecognized mechanisms behind the symptoms: Proteins influence the development of the rare disease.

Bild von Dr. Andreas Hentschel, Mitarbeiter in der Arbeitsgruppe Translationale Analytik.
16th August 2024

"We want to improve the fundamental understanding of disease models"

Why is it exciting to decode the lipid signature of individual cells? And which cell type is particularly suitable for allowing this process? Prof Dr Sven Heiles gives the answers in an interview.

Portrait von Jun.-Prof. Dr. Sven Heiles.
8th August 2024

"Fatty" signatures: How lipid patterns can be used as a diagnostic tool

Microglial cells, the immune cells of the central nervous system, are known for their different appearances. In order to demonstrate this heterogeneity for the first time at the lipid and thus metabolic level for individual cells, a team of researchers has set itself a special analysis goal: Decoding the unique and "fatty" signatures of individual cells.

Das Foto zeigt Jun.-Prof. Dr.-Ing. Sven Heiles und Doktorandin Chiahsin Chi am MALDI-Massenspektrometer.
2nd August 2024

Thrombocyte Proteome: Tracking down Life-threatening Events in the Bloodstream

Thrombocytes (blood platelets) can link up at lightning speed and block a vessel as a thrombus. Possible consequences include heart attacks or strokes. Scientists at ISAS are therefore looking for molecular markers for platelet activation. In future, this knowledge could help doctors recognise a thrombosis before it occurs and react accordingly.

Dr. Fiorella Solari.
5th July 2024

 Thyroid hormones: Timers for the heart?

The Collaborative Research Centre "Local Control of Thyroid Hormone Action - LOCOTACT" investigates the local control of the effects of thyroid hormones in organs such as the heart or liver. In order to find new therapeutic approaches for cardiovascular diseases, for example, researchers at ISAS want to know: How does the body control the transport, metabolism and mechanism of action of thyroid hormones in the heart?

Videograf André Zelck (rechts im Bild) begleitet die Doktorandin Stefanie Dörr (Kardiovaskuläre Pharmakologie) bei ihrer Arbeit am Echokardiographiegerät.