Our research in cloud computing and data engineering investigates means to better transport, process, store, and visualize large matrices, and other datasets. We study how (and how well) enterprise software such as Apache Spark can be repurposed for scientific computing applications, as well as develop our own frameworks. This work gives rise to optimization problems relating to resource allocation, container placement, and cloud-edge transport. This research finds application in the group’s core experimental work, as well as projects such as HASTE, where microscopy imaging is a key use case.