147
Views
0
CrossRef citations to date
0
Altmetric
Meeting Report

The First European Spallation Source Data Management and Software Centre Summer School

ORCID Icon, , ORCID Icon, ORCID Icon, ORCID Icon, , ORCID Icon, , ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon show all

The role that data plays in neutron scattering is changing, more impetus is being placed on the use of complex data reduction and analysis [Citation1], as well as the importance of FAIR and open data [Citation2]. European Spallation Source (ESS) has recognised this and created the Data Management and Software Centre (DMSC), based in Copenhagen and linked to the ESS Lund site for fast data transfer. This makes ESS unique among neutron scattering facilities, coming online with data as a primary partner in user experiments, alongside the ­instrumentation itself.

The data from experiments at ESS will be processed by a neutron data pipeline including acquisition, reduction, analysis and cataloguing. Therefore, the aim of the DMSC Summer School was to provide an in-depth introduction to these. At the summer school the data acquisition step was replaced by simulations such that the students were introduced to the data pipeline illustrated in .

Modern computational practices, in particular the use of the Python programming language and Jupyter Notebooks [Citation3], are required to get the most from neutron scattering data. Therefore, the first day of the summer school focused on introducing the Python programming language and the Jupyter Notebook interface. This meant that all the students had a firm grounding in Python and were ready to work confidently on the material in later days. The first day ended with a presentation introducing FAIR data practices and discussing the role of FAIR in the ESS facility; these concepts would be consolidated throughout the school. Moreover, students were introduced to what encompasses a good proposal for beam time.

The second day was spent in Lund, first to visit the MAX IV synchrotron source, which is a close neighbour of ESS. After a tour of MAX IV, the students were introduced to the neutron instrumentation software McStas [Citation4–6]. At this point, the students were asked to select one of two paths for the summer school: small-angle neutron scattering (SANS) or backscattering quasi-elastic neutron scattering (QENS). The data that they simulated using McStas would be carried with them throughout the summer school, through reduction, analysis, and storage. In the end, the 14 students split approximately 50:50 between the two neutron scattering techniques. While at the ESS site, the students were also given a tour of the ESS facilities and participated in a poster session, where they were able to discuss their shared interests around neutron scattering data.

The following day, the students returned to the Copenhagen to continue working with their simulated data. The data that they had simulated was equivalent to raw detector data, therefore it was necessary to transform this into something that can be analysed with traditional methods. For this, the Scipp package [Citation7] was used, which enables informative visualisations and most importantly the ability to efficiently manipulate and histogram data. Using Scipp, the students were able to reduce their McStas data to a one-dimensional dataset of intensity as a function of either wavevector (SANS) or energy transfer (QENS). This included concepts of masking data, including normalisations, and processing multiple neutron pulses together.

With the data reduced, the following day, the students made use of the EasyScience framework [Citation8], which enables the analysis of experimental data using a model-­dependent approach. In this context, the SANS data were interpreted with a spherical model, while the QENS analysis made use of a simple Lorentzian function convolved with a Gaussian resolution function. Using EasyScience, the students were able to define functional forms for their models and optimise the parameters of interest. Beyond standard fitting, the students were introduced to Bayesians methods; Markov chain Monte Carlo and Nested Sampling (using the emcee [Citation9] and dynesty [Citation10] packages) to investigate the correlations in the model parameters and compare different analytical models, respectively.

The week ended with continuing the discussion introduced on the first day around the importance of data storage, cataloguing, and FAIR data, in this case using the SciCat [Citation11] metadata catalogue in use at ESS and multiple other photon and neutron sources. In this session, the students were asked to consider what constituted a “dataset” and tasked with using the Scitacean package [Citation12] to upload their compiled datasets to the ESS SciCat catalogue. This allowed students to reconsider, from their conceptions at the start of the week, what data means in terms of neutron scattering, following a week of working closely with data in various different forms.

Alongside the teaching, there was a range of social activities that the students were invited to participate in—with the aim to build a cohort of data-focused neutron scatterers. This started on the Sunday before the school began, with a visit to a board game café in central Copenhagen for the “Welcome Reception”. On the Wednesday evening, the students (and many of the teachers) participated in a programming competition, a game that is regularly played between staff of the DMSC, where they were tasked with writing a Python program, an “AI”, to compete in a Battle Royale style tournament, with one “AI” named as the winner. Finally, there was a summer school dinner on the Thursday evening, which took place within the Tivoli Gardens in central Copenhagen.

All in all, the first DMSC Summer School was successful in its aim to introduce members of the neutron scattering community to the growing importance of data in the field. The feedback from the participants has been extremely positive and given that the event itself was 300% oversubscribed, there is ambition to run it again in the future. Hopefully, we will be able to welcome you to Copenhagen in the future for another DMSC Summer School.

Figure 1. The data pipeline that students were introduced to at the summer school. The associated software is illustrated above the pipeline.

Figure 1. The data pipeline that students were introduced to at the summer school. The associated software is illustrated above the pipeline.

Figure 2. Photographs from the day at the ESS site in Lund, (left) the school students and teachers and (right) the poster session in the ESS atrium.

Figure 2. Photographs from the day at the ESS site in Lund, (left) the school students and teachers and (right) the poster session in the ESS atrium.

Acknowledgments

We would like to acknowledge the Carlsberg Foundation, the Danish Scattering Association (DanScatt), and the European Spallation Source for generous sponsorship of the event. We are also grateful to the Niels Bohr Institute of University of Copenhagen for hosting us.

References