Skip to article frontmatterSkip to article content

Pan-Plasma: Plasma physics needs an open scientific python ecosystem following Pangeo

Pan-Plasma: Plasma physics needs an open scientific python ecosystem following Pangeo

Tom Nicholas, 29/06/2021

TLDR: The MCF and plasma physics community should copy the Pangeo project and its (xarray + dask-based) software ecosystem, because geosciences have already solved many of the software problems we still struggle with.

Problems to solve

My experience of software practices in fusion research has been that there are some common and widespread problems. In particular:

Geosciences have already solved these!

It turns out that another field of science has already found a good solution to many of these problems: the Pangeo project for Big Data Geoscience.

Whilst most if not all fields of science could benefit from efforts like Pangeo, the crossover is especially clear for fusion research, as geoscientists and plasma physicists have extremely similar problems computationally. Both fields deal primarily with datasets which are:

Pangeo is both a set of packages and a community, which aims to solve the problems I mentioned earlier.

How would this fit in with other fusion software efforts?

Whilst the majority of fusion researchers are using scattered codebases, there are at least three significant efforts to improve the software experience for plasma physicists. Each of these would benefit from a pangeo-like ecosystem.

Specific needs and vision

Certain packages and specifications would be particularly useful within a plasma ecosystem. Each of these suggestions either currently does not exist, or exists in multiple places unneccesarily, without a standardized interface. Several of these suggestions have clear analogies to existing geoscience libraries used in Pangeo.

Would immediately unlock powerful existing features

Whilst the full creation of a complete ecosystem might take a while, refactoring to use some of pangeo’s most popular packages would bring immediate benefits for many fusion researchers.

Conclusion

Successful software efforts in the geosciences have provided us plasma physicists with a blueprint for improving our software, workflows, user experience, and science. Realising these benefits in fusion research requires an explicit focus on modularity, re-using powerful domain-agnostic tools, and planning for big data challenges early.