Unlocking Data Streams

Created: 2021-03-17 12:36
Institution: Isaac Newton Institute for Mathematical Sciences
Description: Tuesday 16th March 2021
Background

Sequential streams of information are pervasive; things happen and are recorded. These streams can be regular with all channels updating at once like sound. Alternatively, channels can update one at a time and maybe not at all, as things happen. An example of this is an electronic health record – which might capture hospital admission, a blood test, or perhaps a continuing ECG measurement.

Managing this heterogeneous stream of data is a challenge. Often there is important information in the order of events that links the channel behaviour together. In this case smoothing the data out channel by channel, like binning data, is damaging. A powerful unifying approach is to regard the data to be the input, and let it control a dynamical system. Different behaviour can be distinguished via the different responses of the system. One can modify the system so that it is not affected by the aspects of the stream that are of little interest.

With five years funding from UKRI, the DataSıg Programme looked to address this key challenge of data science – to better understand multimodal data streams. It sought to do this by developing mathematical descriptions of these streams, using ‘rough path’ (RP) theory; RP theory allows for the direct capture of the order in which events happen and in many cases can better model the effects of these data streams via a top down signature description of the stream that summarises the data effectively without exposing the individual data points to direct analysis.

The one day workshop highlighted a number of exciting research activities and outlined some of the successful collaborations within the DataSıg Programme. Collaborations included:

Security and defence - action detection using signatures and computer vision to classify physical human actions from real time data

Human computer interfaces – such as translation handwriting on mobile devices

Astronomy – rough path models to aid the development of measurement instruments and processing techniques for astronomy telescopes

Mental health – development of a tool which looks in an automated way at self-reported data (such as speech and mood information). Enables positioning of individuals on spectrums and potentially better feedback for clinicians/clients.

Human disease – identifying the evolution of cancer cell lines and an early warning system for sepsis detection



Aims and Objectives

Our experience of the world is multimodal and understanding multimodal data streams (complex sequences of data from different sources), is a key challenge of rough path theory and more generally for data science. This workshop therefore aimed to increase awareness of the research and applications being undertaken by the DataSıg team. The Programme sought to further develop signature-based mathematical tools for dealing with complex streamed data, and connect with partners who have the capability and the challenges to benefit from and achieve significant outcomes with the methodology. The meeting was of interest to end-users from multiple settings including, industry, business, public sector and clinicians who were interested in collaborating and who could:

Benefit from the development of useful open sources software tools that could be utilized in various machine learning environments

Have needs around the interaction with complex, real world evolving data, to be able to easily tackle questions where there is a variety of different data to consume.

Talks at this workshop highlighted state-of-the-art research and success stories. Presentations featured various examples of rough paths in action, understanding clouds (collections) of paths and their applications, log signals and controlled differential equations, as well as applications and challenges from end-users perspectives. The day was of relevance to multiple application areas and sectors including engineering, agriculture, security, communications, human health and the social sciences.

A programme of this past event is now available. Please follow the link.
 

Media items

This collection contains 9 media items.

Note: some media items are not shown, because they are only visible to Raven users. To see these media items, you must log in.
  •  

Media items

A Data-Driven Market Simulator for Small Data Environments

   12 views

Horvath, B
Wiese, M
Tuesday, March 16, 2021 - 15:10 to 15:35

Collection: Unlocking Data Streams

Institution: Isaac Newton Institute for Mathematical Sciences

Created: Wed 17 Mar 2021


Decision Making with Lung Cancer Trees of Genetic Mutations

   5 views

Huebner, A
Cris Salvi, C
Tuesday, March 16, 2021 - 12:10 to 12:35

Collection: Unlocking Data Streams

Institution: Isaac Newton Institute for Mathematical Sciences

Created: Wed 17 Mar 2021


Inference from Evolving Populations: Agriculture

   7 views

Lemercier, M
Tuesday, March 16, 2021 - 11:00 to 11:25

Collection: Unlocking Data Streams

Institution: Isaac Newton Institute for Mathematical Sciences

Created: Wed 17 Mar 2021


Inference from trees: Cybersecurity

   11 views

Thomas
Tuesday, March 16, 2021 - 11:45 to 12:10

Collection: Unlocking Data Streams

Institution: Isaac Newton Institute for Mathematical Sciences

Created: Wed 17 Mar 2021


Log Signatures and Neural Controlled Differential Equations

   5 views

Morrill, J
Tuesday, March 16, 2021 - 14:00 to 14:25

Collection: Unlocking Data Streams

Institution: Isaac Newton Institute for Mathematical Sciences

Created: Wed 17 Mar 2021


Neural Controlled Differential Equations

   8 views

Kidger, P
Tuesday, March 16, 2021 - 13:35 to 14:00

Collection: Unlocking Data Streams

Institution: Isaac Newton Institute for Mathematical Sciences

Created: Wed 17 Mar 2021


Path Signature Example Notebooks

   8 views

Foster, P
Tuesday, March 16, 2021 - 14:25 to 14:50

Collection: Unlocking Data Streams

Institution: Isaac Newton Institute for Mathematical Sciences

Created: Wed 17 Mar 2021