Written evidence from The Royal Society (DTA 49)
Public Administration and Constitutional Affairs Committee
Data Transparency and Accountability: Covid 19 inquiry
1) The Royal Society is the national academy of science for the UK. Its Fellows include many of the world’s most distinguished scientists working across a broad range of disciplines in academia, industry, charities and the public sector. The Society draws on the expertise of the Fellowship to provide independent and authoritative advice to UK, European and international decision makers.
2) Expert Fellows and researchers funded by the Society are working alongside researchers around the world to tackle COVID-19. The Royal Society has convened two groups using data science methods to respond to the pandemic. Data Evaluation and Learning for Viral Epidemics (DELVE) is a multi-disciplinary group, supporting a data-driven approach to learning from the different approaches countries are taking to managing the pandemic, analysis to complement the evidence base informing the UK’s strategic response. Rapid Assistance in Modelling the Pandemic (RAMP) is an initiative to bring modelling expertise from a diverse range of disciplines to support the pandemic modelling community already working on Coronavirus (COVID-19). RAMP is designed to; provide support for existing research groups; create new models or insights that can be used to inform the work of the Government’s scientific advisors, through data science-based approaches; apply knowledge from related epidemiology domains; and triage incoming literature to ensure effective information flows. In addition, the Royal Society has established an internal group, Science for Emergency Tasking – Covid (SET-C) to rapidly analyse and synthesise evidence relevant to the pandemic response.
3) This response draws on experience and insight from across these three initiatives. It also draws on the Royal Society’s body of policy advice relating to data and digital technologies, which has highlighted the strategic and societal importance of well-governed access to data.
Did the Government have good enough data to make decisions in response to Covid-19, and how quickly was the Government able to gather new data?
4) There has been significant recent investment in surveillance initiatives in PHE (and devolved nation counterparts), the Office for National Statistics (ONS), Health Data Research UK (HDRUK) and the Joint Biosecurity Centre (JBC). However, a better coordinated and responsive surveillance system is required both to understand the nature of the pandemic and to respond in real time. Comprehensive, accessible national data on the proportion of people who have been infected with COVID in different groups sampled is currently limited. Better data is essential to understanding risk, depending on sex, ethnicity, age, occupation, socio-economic factors and comorbidities. Understanding these risks is vital to inform an effective response to protect those most vulnerable and drive down transmission. For example, DELVE’s work on nosocomial infections (infections acquired in hospital and health-care settings) highlighted important gaps in data: “ although surveillance systems and large-scale hospital-based studies have recently been set up, there remain gaps in availability of surveillance data on hospital-acquired infections, particularly of healthcare workers (including agency staff) and in nursing homes; important questions remain unanswered, including about the impact on Black, Asian and minority ethnic (BAME) health and social care workers.”
5) Disinvestment in both central and local public health departments mean that current systems are now fragmented. A high-quality modern system, with first-class and appropriate expertise, infrastructure and resources is needed for the public health system and NHS to be able to collect and use data to protect the public. In the immediate future, clear, mandated leadership is needed within the Department for Health and Social Care to enable the collection and connection of data from across the health system, building on important work carried out by Health Data Research UK and the Office for National Statistics (ONS) during the pandemic.
Was data for decision making sufficiently joined up across Departments?
6) There is a clear need to connect data across the different parts of the health system. Connected COVID-19 data systems across community, care institutions and the NHS are essential to understand where and how transmission is occurring and, for example, to identify the risk of importing and exporting cases across the health and social care system. Local and national data systems need to work in synergy. This is critical to local public health teams, and local authorities and hospitals, in the rapid identification, investigation and management of outbreaks.
7) Health data also need to be connectable to other forms of socio-economic data to better understand what is happening at the individual and population level in a way that protects personal or sensitive information to ensure individual’s privacy. From the point of view of the research community responding to the crisis, a critical weakness in the system was access to important social and behavioural datasets. These are the kinds of data routinely used by major corporations for marketing and business development but inaccessible for scientific research and for decision-making in relation to public policy. Examples include retail sales and loyalty cards, financial transactions, mobile telephone records, smart tickets for transport. These data have the ability to richly inform patterns of mobility, interaction and human contact (hence the likelihood of transmission in business as usual and various lockdown/ pandemic stages) but they were not available to access and exploit at suitable levels of geographical and temporal resolution.
8) There needs to be much more individual-based data available to scientists, public health professionals and political decision-makers regarding the way the density of populations in venues ranging from schools and supermarkets to different outlets in the hospitality sector, showing how people move. Hence, better connecting health and mobility data can have a major impact. There may have been such connections within government, but from the point of view of external researchers collecting and connecting such data was a challenge.
Was relevant data disseminated to key decision makers in: Central and local Government; other public services (like schools); businesses; and interested members of the public?
9) From the point of view of researchers in universities and industry, aiming to help support the COVID-19 response, there was a significant challenge in access data. This was especially the case for researchers with appropriate methodological skills who do not primarily work with health data. The Control of Patient Information (COPI) notice has been helpful, but access to patient data has been difficult to achieve because of the terms imposed by the Data Controllers. Accessing sensitive data has, of course, to be closely managed, following guidelines such as the Five Safes to protect patient privacy. It is however the case that the Trusted Research Environments have found it hard to meet the demand for their services, adding delays to the process. Access can be costly with approval processes for even anonymised data taking up to a year, according to a blog article published by the British Medical Journal which highlighted barriers to data access. Similar calls were made in the Proceedings of the National Academy of Sciences on the urgent need for international social and demographic data. Researchers working with the Royal Society and British Academy have struggled to get data disaggregated by ethnicity, region, age and sex – which is essential to understanding inequalities in excess death rates during the COVID pandemic.
10) European Union nations, which are also subject to GDPR, have had public, unrestricted access to crucial Covid-19 data.  At the click of a button, a UK scientist can see the daily flow of people from origin to destination between more than 3000 districts in Spain – a country that complies with the same data protection rules as the UK – but this data is not readily available for the UK. Valuable UK-based research was carried out over these datasets in the absence of UK data and the opportunity to build valuable knowledge was missed as a result of this absence of data.
Were key decisions (such as the “lock downs”) underpinned by good data and was data-led decision making timely, clear and transparently presented to the public?
11) DELVE work on Economic Aspects of the COVID-19 Crisis in the UK highlighted the importance of bringing together epidemiological and economic data to better understand the impact of non-pharmaceutical interventions.  As argued in this report:
12) The same DELVE report recommended that “UK government and supporting bodies continue pursuing one of the most important and urgent goals of obtaining more and more representative and fine-grained transactions data from UK financial institutions for the purpose of monitoring the impact of COVID-19 and associated policy interventions. Such data are valuable assets in the current situation, and the UK should be the vanguard in efforts to harness its power for effective and timely policymaking.”
13) The granularity and nature of data at a local level is also an important concern. The current lockdowns are based on crude data with respect to localities, as one of the key issues is movement of people between places and those movement patterns need to be taken account of. Consideration of the density of populations and how these vary over periods of hours and weeks can lead to a better understanding of the ‘temporal’ aspects of transmission: in other words, do infections spread faster during the evenings when people are out at pubs and restaurants; in work where there is sustained activity but social distancing? Temporal restrictions could then become a greater tool in the armoury of social distancing, building on current curfews. Much more could be done with better data and better understanding.
14) Both data, and modelling and simulation using that data, are important for decision making. Modelling and simulation capabilities demonstrated by the Royal Society RAMP initiative in partnership with the Alan Turing Institute shows the potential to inform lockdown decisions at a local scale, to project the evolution of cases through time, and most importantly to evaluate the impact of alternative lockdown scenarios on future R-numbers, incidence of cases and the disease burden.
Was data shared across the devolved administrations to enable mutually beneficial decision making?
15) We do not have a detailed overview of the extent of data sharing and mutual decision making across the devolved administrations, however it is important to note that this is an area that may benefit from greater coordination – for example for a long period, the devolved administrations had different criteria for what constitutes death from COVID-19, creating challenges in extracting a consistent measure.
Is there sufficient understanding of statistics by the public to understand the detailed information that is being regularly published during the pandemic? Equally, is there enough understanding among journalists and parliamentarians to enable reasonable presentation and interpretation of data for the public; and informed questions to be asked of the Government? What could be done to improve understanding and who could take responsibility for this?
16) The current crisis has tested public in trust in science internationally, including in the UK. Science and scientists have been under the spotlight and at the heart of political decision-taking like never before.
17) We have learned a lot amazingly fast. This is a new virus about which we knew nothing just a few short months ago. However, there is a lot that we still do not know.
18) At the frontiers of science, there is always uncertainty, and to pretend otherwise would be foolish. What science does is to try to gather evidence to reduce the uncertainty, but this happens only gradually as data are gathered and hypotheses tested and discarded until some idea of the truth emerges. But even those “truths” can fall by the wayside in the face of new and contradictory evidence. The entire process is based on honesty, openness and transparency, in which the evidence is published for all to see and argue about. This can, however, be a very confusing process for those observing from the outside, who may be unfamiliar with the process of science and the uncertainties that it operates within and therefore may be left unsure who, or what, to believe. Research suggests that there is a correlation between poor numerical literacy and susceptibility to mis-information about COVID-19. It is important to be open and honest with the public about what we know and the ongoing uncertainties.
19) It is right for government to take an evidence-based approach. However when talking about this, government ministers need to be clear that they are making the best decisions now based on the available evidence, but are also prepared to change tack later when further evidence comes to light. They must also make clear that science advice is only one of the things under consideration when they are making hard choices about how to respond to the pandemic. Also under consideration are economics, ways of implementation, broad consequences to society, and the need to take the public with them. This clarity, that it is not simply a case of ‘following the science’ is important for parliamentarians, the media and the public who wish to scrutinise government decisions. Alongside access to the scientific evidence, it will enable them to direct their questions more effectively.
20) During the pandemic to date, we have observed a rapid increase in the familiarity of the UK government, parliamentarians, the media and the public with complex statistical terms, such as the ‘R number’. Discussions of the latest scientific evidence are hosted on mainstream media. As noted above, this represents valuable transparency of the scientific process in action, but can also be confusing. While there may be an increase in scientific literacy, there may also be simply an increase in familiarity with scientific terms with little understanding of their meaning. There are a number of steps that can be taken to ensure that this higher profile is helpful:
How will the change in responsibility for government data impact future decision making?
21) The use of data, not just in and by government, but in partnership with scientists and academic research teams, should be a key part of long-term strategy. This needs to be driven across government and the Cabinet Office is well-placed to do so. Such partnerships would support not just the immediate response, but research to better understand the impact of COVID-19 and similar or related shocks in the future, and potentially build public trust given relatively high levels of confidence in science and universities. Such work could potentially be more actively promoted through partnerships between government, academics and business organisations possibly with more emphasis on the development of Data Trusts, or data sharing agreements, following on from the Hall-Pesenti review into AI. The National Data Strategy led by DCMS, and the Cabinet Office drive to improve government use of data, offer the opportunity to embed such valuable partnerships and must be closely coordinated. The Royal Society will be responding to the consultation on the National Data Strategy, expanding on the points made here.
 Royal Society DELVE initiative (2020) Scoping Report on Hospital and Health Care Acquisition of COVID-19 and its Control https://rs-delve.github.io/reports/2020/07/06/nosocomial-scoping-report.html
 ONS (2017) The ‘five safes’ – data privacy at ONS https://blog.ons.gov.uk/2017/01/27/the-five-safes-data-privacy-at-ons/
 The BMJ opinion (2020) Reducing barriers to data access for research in the public interest—lessons from covid-19 https://blogs.bmj.com/bmj/2020/07/06/reducing-barriers-to-data-access-for-research-in-the-public-interest-lessons-from-covid-19/
 COVID-19 Belgium Epidemiological Situation. Municipality Monitoring. https://datastudio.google.com/embed/reporting/c14a5cfc-cab7-4812-848c-0369173148ab/page/giyUB, accessed 15 October 2020
 Robert Koch-Institut (Germany) infections binned into medium sized geographical regions https://www.arcgis.com/home/item.html?id=f10774f1c63e40168479a1feb6c7ca74, accessed 15 October 2020
 Roozenbeek et al (2020). Susceptibility to misinformation about COVID-19 around the world https://royalsocietypublishing.org/doi/10.1098/rsos.201199