Skip to content

Integrative Scalable Computing Laboratory

A research group at the Department of Information Technology, Uppsala Universtity.

  • Home
  • Projects
  • People
  • Publications
  • Teaching
  • Software
  • Recruitment
  • About us
  • Toggle search form

Machine learning-assisted analysis of stochastic biochemical reaction networks

Posted on October 15, 2018October 16, 2020 By Prashant Singh No Comments on Machine learning-assisted analysis of stochastic biochemical reaction networks

Biochemical reaction networks represent complex cellular regulatory mechanisms. These networks are typically analyzed using discrete stochastic simulation models. The models typically involve numerous reactions involving a large number of chemical species, governed by highly uncertain parameters.

Likelihood-free parameter inference

Given existing data pertaining to a biochemical reaction network, one is often interested in inferring the values of the model parameters that likely generated the data. The data itself may come from models simulated in the past, or physical experiments. Approximate Bayesian Computation (ABC) is a proven approach that effectively solves such parameter inference problems by using simulation models as a tool to find the region in the parameter space corresponding to least deviation from given data.

The rejection sampling algorithm forms the basis of the ABC framework. Samples are drawn from a specified prior distribution, and subsequently simulated. The simulated responses are compared to existing data by means of a distance function and appropriate summary statistics. Samples that result in distance function values below a specified tolerance threshold are accepted, and the rest rejected. The sampling algorithm proceeds until the desired number of accepted samples have been obtained. The inferred parameters are then reported as the mean parameter values corresponding to the accepted samples.

Design choices such as selection of distance functions, summary statistics and acquisition function for the inference process have a deep impact on the solution quality. Furthermore, increasing problem complexity often leads to impractically high inference times using rejection sampling.

Our research explores methods to accelerate high-quality parameter inference by leveraging state-of-the-art methods from the fields of computational biology, machine learning, optimization and statistics. Some of our active research topics include investigating intelligent construction of priors, methods for automated large-scale summary statistic selection,  and training fast local and global approximations or surrogate models of computationally expensive simulators.

Model exploration

The exploration of a system described by a non-linear, high-dimensional and stochastic computational model is a fundamental problem in all scientific disciplines relying on modeling and simulation.  In this project we are interested in the scenario where a modeler has no or very limited prior knowledge about what type of qualitative interesting behavior the model can display over the large parameter space. The tools we develop should help the modeler discover those behaviors with a small computational budget, and as little manual work as possible. By utilizing human-in-the-loop machine learning  we are developing a smart parameter sweep workflow. An example is shown in the image below, where a high-dimensional parameter sweep application is augmented with automated feature extraction and clustering, followed by training a model for classification based on user-defined labels (such as interesting or non-interesting realizations). With this model, the smart sweep application will learn to more efficiently explore areas of interestingness in the parameter space.  

ReserachSlider

Post navigation

Previous Post: MSc projects available in multicellular systems biology
Next Post: Highly Scalable Federated Machine Learning

More Related Articles

StochSS: Stochastic Simulation Service ReserachSlider
Highly Scalable Federated Machine Learning ReserachSlider
Scalable simulation of stochastic multicellular systems ReserachSlider
Hierarchical Analysis of Spatial and Temporal Data Data Science
Multiscale simulations of chemical kinetics ReserachSlider

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Data-and simulation-driven life science. Much of our work in eScience and applied ML has applications in life science, and in Systems Biology in particular. We aim to enable data-and simulation-driven scientific discovery.

HASTE - a cloud native framework for intelligent processing of image streams: http://haste.research.it.uu.se/

Follow us on twitter

Andreas HellanderFollow

Andreas Hellander
A_HellanderAndreas Hellander@A_Hellander·
11 May

Are you using StochSS? Please help us gather insights into what is working well and what can be improved by filling in this short survey https://forms.gle/mEqfASuUd3MDWuPS9

@LindaPetzold @briandrawert @mhucka

Reply on Twitter 1524477596930654214Retweet on Twitter 15244775969306542141Like on Twitter 15244775969306542141Twitter 1524477596930654214
A_HellanderAndreas Hellander@A_Hellander·
9 May

Apply to this PhD student position in the eSSENCE and SciLifeLab graduate school in data-intensive science!

This project is the intersection of cybersecurity and big data with main supervisor @sztoor.
https://www.uu.se/en/about-uu/join-us/details/?positionId=501061

Reply on Twitter 1523559501965987841Retweet on Twitter 1523559501965987841Like on Twitter 15235595019659878411Twitter 1523559501965987841
A_HellanderAndreas Hellander@A_Hellander·
28 Apr

PhD position in the eSSENCE/@scilifelab graduate school in data-intensive science with @cnettel: https://uu.se/en/about-uu/join-us/details/?positionId=501716 Apply and be part of a new interdiciplinary research effort @UU_University!

Reply on Twitter 1519564476856557568Retweet on Twitter 15195644768565575686Like on Twitter 151956447685655756810Twitter 1519564476856557568
A_HellanderAndreas Hellander@A_Hellander·
23 Apr

If you are a current user of StochSS please let us know your thoughts by filling out this brief user survey: https://forms.gle/3r836iph8gqFEpZX7

#systemsbiology #stochss @LindaPetzold @briandrawert @prashant_rsingh

Reply on Twitter 1517753867403993089Retweet on Twitter 15177538674039930892Like on Twitter 15177538674039930891Twitter 1517753867403993089
Retweet on TwitterAndreas Hellander Retweeted
AssistSweASSIST Sweden@AssistSwe·
14 Apr

Soon the partners in the ASSIST project will attend a workshop on federated learning arranged by Scaleout. Partners from different countries (Sweden, Belgium, Netherlands, Turkey) will contribute with nodes that train a segmentation network.

Reply on Twitter 1514516928823447553Retweet on Twitter 15145169288234475532Like on Twitter 15145169288234475531Twitter 1514516928823447553
Load More...

Decentralized AI, Federated Learning. One focus area of the group is development of methods and software to address decentralized and privacy-preserving AI. We are core contributors to the FEDn open source framework for scalable federated machine learning:

https://github.com/scaleoutsystems/fedn
Introduction to Federated Learning by Andreas Hellander
Join the discussion on Decentralized AI:

Scaleout Systems is a spin-out from ISCL on a mission to enable decentralized AI and federated learning to production.

https://www.scaleoutsystems.com/

Copyright © 2022 Integrative Scalable Computing Laboratory.

Powered by PressBook Blog WordPress theme