Subscribe to the Teradata Blog

Get the latest industry news, technology trends, and data science insights each week.



I consent that Teradata Corporation, as provider of this website, may occasionally send me Teradata Marketing Communications emails with information regarding products, data analytics, and event and webinar invitations. I understand that I may unsubscribe at any time by following the unsubscribe link at the bottom of any email I receive.

Your privacy is important. Your personal information will be collected, stored, and processed in accordance with the Teradata Global Privacy Policy.

Advanced Analytics for Coronavirus – Trends, Patterns, Predictions

Advanced Analytics for Coronavirus – Trends, Patterns, Predictions

Introduction

What happens when a virus-based disease spreads stealthily across the world, at an alarming spread rate, without sight of it ending anytime soon?
 
Relatively unheard of just a couple of months ago, the new coronavirus disease (COVID-19) that emerged from Wuhan has already triggered a global health emergency. This infection outbreak has since spread to more than 150 countries, affecting more than 170,000 cases, and resulting to more than 6,500 deaths. 
 
Globally, the rate of infection has grown at such an alarming rate that many countries have already declared state of emergency, with those hardest hit declaring lockdowns to prevent further spread. Current statistics show a spread rate double that of seasonal flu, and fatality rate more than 10 times of that for seasonal flu.
 

Challenges

The increasing global infection figures have shown that COVID-19 is spreading stealthily across the world. Asymptomatic carriers, scarcity of test kits and COVID-19 as a new pathogen complicates efforts in trying to contain the transmission. The increased infection cases will inevitably overwhelm the health system as doctors and nurses are overworked to keep up with new cases, as well as the number of hospital facilities especially intensive care units (ICU). Scientists have started work on finding treatments and vaccines to address the outbreak, however drug discovery is a lengthy process, which is likely to take at least a year.

The challenges posed by the coronavirus is wide-ranging: from early detection, containment and isolation, to mitigation and segmentation, and ultimately to the end of the disease. 
 

Data Science in Healthcare

We are dealing here with a new coronavirus, far more complex than its predecessors, and with a spread rate (called R0 pronounced as R-naught) almost double that of normal flu. Based on current statistics, SARS-CoV-2 virus which, causes the disease COVID-19, has an R0 of 2.2 which means a single infected person will infect 2.2 others on average. With the advent of data science and advanced analytics, we are also now able to access tons of data, collected from the various facets and phases of the virus and the disease. You can see this data in various forms and shapes in charts, graphs and tables, to help visualize the facts in simpler and more practical ways.
 
From detecting coronavirus through computerized tomography scans, to more intelligent contact tracing, to analytics-based cluster predictions, to developing powerful drugs that can tackle this coronavirus - big data and artificial intelligence (AI) play a broad role in our fight against COVID-19. Speed is of essence in restraining the outbreak. Contact tracing is imperative to ensure that anyone who has close contact with the infected patients are traced, isolated and monitored for symptoms. 
 
Spatial-temporal Analysis
 
Advancements in Geographic Information Systems (GIS) tools allow the spread of COVID-19 to be modelled through various spatial and temporal scales. Structured mathematical and statistical techniques, such as Bayesian inference methods facilitate the analysis of infectious disease incidence through time and space. A spatial-temporal mapping tool can allow user to visualize and scroll by infectious disease incidences through timeline.
 
Path Analysis 
 
Sankey diagram helps to visualize the flow of patients between various phases of patient journeys. The multidimensional visualizations help to understand the flow and category relationships of patients to specific outcomes. An example is illustrated in a Sankey diagram by WHO, showing patterns of COVID-19 disease progressions in China. The thickness of arrows is indicative of the proportion of patients who died or recovered.
 
Graph Analysis

The historical contagion is useful in understanding and predicting the spread among a local population during this COVID-19 outbreak. Identifying ‘super-spreaders’ or super-spreading events among the infected patients can be conveniently studied and visualized from a network perspective. The degree of interconnectedness between cases and infected clusters can be visualized through a network map. A founder of a Singapore’s coding academy had built a network map of COVID-19 cases organized by known locally transmitted clusters within Singapore. Each node represents an infected person and the edge represent the transmission of the contagion through a known contact.
Coronavirus predictor, coronavirus predictions, prediction of coronavirus
Figure 1: Anti-clockwise from top: Spatial-Temporal Analysis, Path Analysis and Graph Analysis.

Enter Telco Analytics

What single device is considered the most personal accessory that caters to people of (almost) all ages and demographies, from the youngest to the most elderly, all genders, societal levels, and ethnicities? And with it, carries nearly all personal, behavioural, mobility and social records of information. Enter the personal electronic devices (PEDs), ranging from smartphones, tablets, smartwatch, fit-trackers and other electronic accessories that people bring around with them.

In performing contact tracing for example, health agencies normally rely on combinations of personal declarations (of affected or suspected person), travel records, CCTVs and police discovery through investigations. These sources of information are not always easy or available to gather and synchronize. This is where PEDs come in handy. Taking mobile phones alone for example, it is estimated that every single person across the world has at least one phone, on average. There are also many countries where mobile phone penetration rate is more than 150%, which means a person has more than one device.

And what does the smartphone carry with you? A typical smartphone will have the owner’s personal details way beyond name, location both present and historical, mobility patterns (smartphone can intelligently know what is home and work location), browsing patterns of which sites or portals were visited, buying patterns, close circle of friends through record of most frequent contacts, social media accounts and interactions, and sentiments through posts and tweets. For a lot of countries, General Data Protection Regulation (GDPR) and other privacy laws prevent telcos from “sniffing” this goldmine of personal and social data, but the capability is always there. Imagine reducing contact tracing and cluster predictions from days or weeks, into a matter of minutes, when telcos are allowed to sniff all available information when national concern and state of health emergency requires it.

The role of big data and artificial intelligence becomes even more relevant in trying to contain the spread of the disease. More than just geolocation and mobility information, the billions of smartphones globally give both historical collection of data and real-time (or near real-time) snapshots that give better insights for analysis and more accurate predictions. More advanced algorithms and computational models can be developed to utilize all these megasets or supersets of collected information in order to get answers to questions, predict several scenarios under varied conditions, and come up with best recommendations and prescriptions.

Simple practical examples can include, but not limited to:
  • Quicker assessment of probability of exposure in a given area or cluster through cross-matching of smartphone locations of affected and suspected individuals
  • More accurate and detailed time-based historical location information of infected individuals and close circles (potential infections) to establish chains of transmission
  • Complementing CCTV and other video records to identify people in particular infection hotspots
  • Using database of medical prescriptions and purchase of medicines to trace individuals who are in risk of avoiding quarantine
  • Sniffing through social media tweets and posts for potential discovery of unrecorded cases

In Closing

Advanced analytics and AI can significantly accelerate the data processing required to get the insights, answers and recommendations to handle and address the COVID-19 pandemic - from days to minutes, and with better-than-expected accuracy. Telco analytics can take it a step further in providing deeper insights into people trends and patterns, making intelligent predictions about the disease possible. With responsible guidelines in place, and when it is a matter of national emergency, telco analytics, together with big data and AI, can definitely do things faster and more accurately in containing, mitigating, and eventually ending this global problem. Something to consider.
 
Portrait of Albert Nombres

(Author):
Albert Nombres

Albert Nombres - Senior Practice Partner leads Teradata’s telco practice for Asia Pacific and Japan. In his role, Albert works with telco customers and partners to help them leverage their data goldmine for network intelligence and operational efficiency, optimized 5G deployment, telco monetization for enterprise, and differentiated customer experiences. He brings more than 20 years of industry and consultative solution selling experience in the areas of mobile broadband, radio and core networks, transport, analytics, BSS, OSS, 5G, cloud, IoT and geospatial analytics having worked in UK and in Asia. View all posts by Albert Nombres
Portrait of Yam Guan Goh

(Author):
Yam Guan Goh

Yam Guan Goh has more than 15 years of extensive experience in solutioning, consultative selling and service delivery in the IT Healthcare industry. He has worked on multiple projects with healthcare providers to improve operational efficiency and reduce cost through IT transformations. His area of interests lies in leveraging healthcare informatics in solving real-world problems and staying in touch with the latest trends in Healthcare. Yam Guan holds a Double Masters in Industrial Engineering and Supply Chain Management, and is also a Certified Professional In Healthcare Information and Management Systems (CPHIMS).
  View all posts by Yam Guan Goh

Turn your complex data and analytics into answers with Teradata Vantage.

Contact us