Project Details
Projekt Print View

SFB 1597:  Small Data

Subject Area Medicine
Biology
Computer Science, Systems and Electrical Engineering
Mathematics
Physics
Social and Behavioural Sciences
Term since 2023
Website Homepage
Project identifier Deutsche Forschungsgemeinschaft (DFG) - Project number 499552394
 
The recent progress in artificial intelligence has been facilitated by big data volumes and data-driven modeling approaches. In particular, this includes deep learning techniques, mainly developed in computer science. However, there is a much larger number of applications, particularly in biomedical settings, where data analysis has to be performed with a relatively small number of observations. In data-driven modeling, this can be addressed by transferring additional information. Alternatively, modeling can be based on imposing stronger structural assumptions, e.g., reflecting the input of biomedical experts, in knowledge-driven approaches, frequently developed in mathematics and statistics/systems modeling. Thus, approaches for small data challenges are currently scattered across different data science disciplines. For creating comprehensive solutions that fuse exciting emerging ideas, it is therefore necessary to integrate contributions from computer science, mathematics, and statistics/systems modeling. This methods development also needs input from application domains, such as biomedicine. Correspondingly, we have designed our CRC Initiative SmallData with a strong focus on developing an interdisciplinary methods framework. We focus on the key small data tasks of combining similar datasets and transferring information from additional sources, while taking into account and reducing uncertainty. This is reflected in the three key areas of SmallData, similarity, transfer, and uncertainty. In terms of methods, we focus on combining data-driven and knowledge-driven modeling approaches, e.g., based on neural networks and differential equations. Meta-learning and pre-training are two further components of our framework for transferring information on model parameters or tuning parameters between datasets. We have furthermore designed a Fusion Hub, which addresses overarching topics that link the three areas, by fusing methods from different disciplines. This includes approaches based on the concepts of attention and few-shot learning, which have recently been advanced in computer science. Development of theory will go hand in hand with tailoring methods to prototypical biomedical applications from forensic medicine, gene therapy, nephrology, psychiatry, radiology, and rare diseases. In addition, our integrated research training group will foster a shared language across disciplines. We have furthermore designed the SmallData Compendium as a web platform that will reflect our interdisciplinary methods framework for exchanging concepts and methods with the international community, and thus shaping the small data field.
DFG Programme Collaborative Research Centres

Current projects

Applicant Institution Albert-Ludwigs-Universität Freiburg
 
 

Additional Information

Textvergrößerung und Kontrastanpassung