Dynamics of Data Science
Dynamics of Data Science
science skills
How can all sectors benefit
from data science talent?
Dynamics of data science skills: How can all
sectors benefit from data science talent?
Issued: May 2019 DES5847
ISBN: 978-1-78252-395-6
Contents
Foreword by Professor Andrew Blake FREng FRS 4
Executive summary 6
Introduction 17
Dynamics of data science: career paths and talent flows between sectors 39
Conclusion 85
Acknowledgements 88
Glossary 93
Data appendix 94
Dynamics of Data Science: How can all sectors benefit From data science talent? 3
FOREWORD
Foreword
In the 40 years since I began a doctorate in Data science and engineering are growing
artificial intelligence vision at the University fast and broadening in scope. No longer just
of Edinburgh, AI has changed out of all the preserve of highly technical STEM- or
recognition. Where once a costly computer finance-orientated roles in London, data
would spend hours in contemplation of one science increasingly pervades modern
image, now a mobile phone can track faces business, scientific endeavour and public
in real time, and is backed by the immense affairs. Children in primary school today will
Professor Andrew Blake
power of the cloud to search millions of enter the workforce in roles that don’t exist yet
FREng FRS documents, and recognise speech at because of the way data, and data-enabled
sometimes human-level performance. That technologies such as artificial intelligence,
sort of computational facility, combined are transforming the economy. This is leading
with the power of statistical analysis, also to a dramatic shift in the demand for data
gives us unprecedented analytical power science skills, nationwide and the need for
to understand large and complex data sets. data scientists working across all sectors. Our
This combination of capabilities, that we term analysis shows that over the last five-and-a-
data science, is effecting a revolution in the half years there has been a sharp rise in UK
way we do business, access knowledge, job-listings for ‘Data Scientists and Advanced
communicate, and understand the world. Analysts’ (+ 231%) driven predominately by
increased numbers of vacancies for Data
The Royal Society has encouraged the Scientists and Data Engineers.
development and use of science for
the benefit of humanity since 1660. We This report provides more evidence for the
commissioned this project, Dynamics of case the Royal Society has already made, in
Data Science Skills, because we share the Changing education: Creating the conditions
vision of the UK as a leading data science for a broad, balanced and connected
research nation with a sustainable flow of curriculum, to change post-16 education
expertise. We believe that data science can within the next ten years1. The report also
be an exciting and fulfilling career, that also builds upon findings and recommendations
addresses society’s needs. That requires of our work in data management (Data
the right higher education and training to management and use: governance in the
be made available. More broadly, users, 21st century, with the British Academy)2,
analysts and citizens of the future will need artificial intelligence (Machine Learning: the
to be comfortable with the application of data power and promise of computers that learn
science to societally pressing questions. This by example)3, and computing education
calls for data science skills to be thoroughly (After the Reboot: computing education
integrated into the school curriculum too. in UK schools)4.
1. Royal Society 2019. Jobs are changing, so should education. See https://royalsociety.org/~/media/policy/
Publications/2019/12-02-19-jobs-are-changing-so-should-education.pdf?la=en-GB (accessed 15 April 2019).
2. Royal Society. 2017 Data management and use: Governance in the 21st century. See https://royalsociety.org/-/
media/policy/projects/data-governance/data-management-governance.pdf (accessed 15 April 2019).
3. Royal Society. 2017 Machine learning: the power and promise of computers that learn by example. See https://
royalsociety.org/~/media/policy/projects/machine-learning/publications/machine-learning-report.pdf (accessed 15
April 2019).
4. Royal Society. 2017 After the reboot: computing education in UK schools. See https://royalsociety.org/~/media/
events/2018/11/computing-education-1-year-on/after-the-reboot-report.pdf (accessed 15 April 2019).
4 Dynamics of Data Science: How can all sectors benefit From data science talent?
FOREWORD
The case to upgrade data analysis education We also share examples of exciting or
and skills provision has also been made innovative models and mechanisms that are
by other organisations, most notably in the already in place around the country that could
‘Analytic Britain’ briefing that was published be spread more widely. Those examples are
by Nesta and Universities UK in 20155. available as a separate booklet, Dynamics
This briefing drew on two research reports of data science: models and mechanisms,
on the state of supply and demand for that institutions and individuals can use to
broad analytical skills in the UK to make read about opportunities and resources
recommendations spanning the whole in data science training and practice. A
analytical talent pipeline, including schools, further companion booklet Dynamics of
colleges, universities and the labour market data science: what do data professionals
and industry. We hope that our report chimes say about data science collects some of
with and reiterates the messages of Analytic the fascinating personal stories of career
Britain, whilst adding further evidence and paths that we encountered across academia,
new data to remedy skills shortages and industry, charities, and government. Our
ensure movement of talent. We also hope to interviewees include a recent apprentice, an
continue championing the good work that is international entrepreneur, a physician who is
already taking place across sectors. also a data scientist, self-taught researchers,
and data scientists working in finance, and
Our report is an extensive exploration of global development. The case studies
the current UK data science landscape. It include their reflections on their careers, their
looks at the demand for data professionals experiences of moving across sectors, their
(including data analysts, data engineers, observations about their role models, mentors,
and data scientists), how this has changed and professional communities, and their
in recent years, and how it varies across suggestions for improving the way things are
industrial sectors and UK regions. We use the done in data science.
analysis to identify four major areas for action:
developing foundational knowledge and skills; Finally, I would like to say that working on
advancing professional skills and nurturing this report has been a fascinating experience.
talent; enabling movement and sharing of The Royal Society policy staff have been a
talent; and widening access to data in a well- pleasure to work with, and have driven the
governed way. Within these areas for action programme with a sure touch. I also want
we identify priority needs and make some to thank my colleagues on the steering
recommendations for addressing them. group for their insights, and the dozens of
people who have contributed at roundtables
and workshops and with helpful comments
and contributions.
Andrew Blake
April 2019
5. M
ateos-Garcia J, Windsor G and Roseveare S. 2015 Analytic Britain: Securing the right skills for the data-driven
economy. See https://media.nesta.org.uk/documents/analytic_britain.pdf (accessed 15 April 2019).
Dynamics of Data Science: How can all sectors benefit From data science talent? 5
EXECUTIVE SUMMARY
Executive summary
The skills of data scientists and engineers are Foundational training in data skills begins
in high demand. They enable organisations at school, and there is an opportunity
to extract valuable insights from data, and and a need to consult broadly on a future
use them for substantial societal benefit. As curriculum that addresses the breadth of data
data analysis methodology grows in power, skills across mathematics and science, the
and the volume of data collected increases arts and humanities. Students need to be well
rapidly, the number and variety of roles in informed about the ever-widening range of
data science are also growing significantly. opportunities working with data. New hybrid
forms of education in data science at school
However, with major industry players hiring age and later, such as apprenticeships, are
many of the most experienced data scientists becoming more prevalent.
and AI researchers, media reports have
suggested that the natural flow of researchers Across all sectors, those with a foundation
from academia to industry may be reaching of skills and capabilities need opportunities
unsustainable levels6,7. Incentives available to deepen them and acquire new ones.
from industry compete hard with those offered Professional-level courses should be flexible
by academia, and large tech companies and responsive. Training may need to be
now even allow and encourage researchers industry-approved and accredited, and
to publish, once a particular advantage coordination is needed between industry
of working in universities. It is one further and universities. More informal mechanisms
factor in drawing talent away from even the such as online material are also needed to
strongest university research groups. allow people to (re)train through self-learning.
Universities will need more than ever to retain
There is considerable strength in UK data and create the trained staff to meet these
science in academic, industrial, charitable and teaching needs.
government sectors. We were able to draw on
the experience of representative institutions
to arrive at what we hope will be helpful
recommendations for employers, practitioners,
and decision-makers in the private, public and
university sectors. Research commissioned
for this report shows an increasing need for
people with data science skills, with a sharp
rise in demand for Data Scientists and Data
Engineers in the last five-and-a-half years.
The demand spans all sectors, with specialists
sought after everywhere from government
departments to technology start-ups. These
findings suggest that further skills gap analysis
is needed to quantify the number of employed
workers per opening.
6. S
ample I. 2017 ‘We can’t compete’: why universities are losing their best AI scientists. The Guardian. 1 November
2017. See https://www.theguardian.com/science/2017/nov/01/cant-compete-universities-losing-best-ai-scientists
(accessed 15 April 2019).
7. B
oland H. 2018 Britain faces an AI brain drain as tech giants raid top universities. The Telegraph. 2 September
2018. See https://www.telegraph.co.uk/technology/2018/09/02/britain-faces-artificial-intelligence-brain-drain/
(accessed 15 April 2019).
6 Dynamics of Data Science: How can all sectors benefit From data science talent?
EXECUTIVE SUMMARY
Data science particularly lends itself to The challenge for UK data science is to
movement of talent between sectors, reinforce the landscape by deploying the
including on shorter timescales. The ability UK’s substantial skills base flexibly, exploiting
to do this will be enhanced by recognising the strength of its institutions to generate
the value of cross-sectoral working and and nurture more talent, and mobilising its
braided careers (reciprocal arrangements valuable data resources ever more openly
that enable an individual to pursue dual, or and effectively.
even multiple, employment opportunities).
This requires each sector to broaden its We are still a long way from realising the full
criteria and incentives to recognise and potential of data-enabled technologies. This
welcome more diverse forms of experience, report highlights some of the challenges and
for example academics gaining experience opportunities of data science careers and
in the public sector or in start-ups. Attention some ways forward in addressing skills and
must be paid to the needs of small and mobility challenges.
medium size businesses, charities and the
public sector, addressing the challenge of
large multinationals that have resources
and appetite to absorb data science talent
at scale. Joint positions between sectors
have an important role to play in sharing
talent sustainably, and in promoting diversity
of experience.
Dynamics of Data Science: How can all sectors benefit From data science talent? 7
ABOUT THIS REPORT
8 Dynamics of Data Science: How can all sectors benefit From data science talent?
VISION
VISION
The UK is a leading data science research nation with a sustainable
flow of expertise. Diverse data science skills are integrated into curricula
in order to develop future users, developers and citizens. Data science
provides an exciting and fulfilling career choice. Data skills and appropriate
infrastructure are available across sectors. Data science is applied to
acheive broad societal benefit.
To achieve this vision, we focus on the These ideas are sketched out across the
four major areas for action: next pages and set out in more detail in the
following sections, along with priority needs,
• D
eveloping foundational knowledge
models and mechanisms, and recommended
and skills;
actions for addressing these needs.
• A
dvancing professional skills and
nurturing talent;
• E
nabling movement and sharing of
talent; and
• W
idening access to data in a well
governed way.
Dynamics of Data Science: How can all sectors benefit From data science talent? 9
RECOMMENDED ACTIONS
Need:
Building knowledge and skills from school level to degree level.
10 Dynamics of Data Science: How can all sectors benefit From data science talent?
RECOMMENDED ACTIONS
Need:
Widening access to data science education.
Dynamics of Data Science: How can all sectors benefit From data science talent? 11
RECOMMENDED ACTIONS
Need:
Developing skills in the workforce
12 Dynamics of Data Science: How can all sectors benefit From data science talent?
RECOMMENDED ACTIONS
Need
Creating the right research and
working culture for data science
Recommended action
Dynamics of Data Science: How can all sectors benefit From data science talent? 13
RECOMMENDED ACTIONS
Need Need
Enable movement through Recognising diverse
braided careers research outputs
14 Dynamics of Data Science: How can all sectors benefit From data science talent?
RECOMMENDED ACTIONS
Need
Establishing a coherent
approach to policy
Recommended action
Dynamics of Data Science: How can all sectors benefit From data science talent? 15
RECOMMENDED ACTIONS
Need Need
Opening data and providing Providing the computing power
secure access for use by the growing data
science community
Recommended action
Recommended action
Encourage data sharing where
possible Provide access to computing
Greater transparency of private sector data power
could help build public trust in the use of Improving the UK’s computing research
data and how it is used for decision-making infrastructure will better enable data scientists
purposes. The public sector could usefully to access the necessary computing power
consider how to widen access to its data, to release the value from data and address
including sharing data, and data challenges to research challenges, and will be vital for
researchers. Journal editors should normally the UK to remain competitive with other
ensure that data is being made available countries such as the US and China. BEIS
to other researchers in its original form, or and UKRI could usefully consider the need
via appropriate summary statistics where for continuing to improve access for data
sensitive personal information is involved. scientists working across all disciplines
The Royal Society has published a report on to high-power computing, and this could
Privacy Enhancing Technologies which sets helpfully be included as part of the UKRI
out how greater use of data could potentially Infrastructure Roadmap.
be enabled by PETs8.
Recommended action
8. T
he Royal Society. 2019 Protecting privacy in practice: The current use, development and limits of Privacy
Enhancing Technologies in data analysis. See https://royalsociety.org/-/media/policy/projects/privacy-enhancing-
technologies/privacy-enhancing-technologies-report.pdf (accessed 15 April 2019).
16 Dynamics of Data Science: How can all sectors benefit From data science talent?
INTRODUCTION
Introduction
Data science is a rapidly developing field, Definitions of data science have evolved
and in some ways a relatively new and over time, partly as a reflection of changes “A lot of what we
emerging discipline. Its development out in technology and data handling. In 1962, are doing in data
of different disciplines, and its potential John Tukey called for a reformation of science involves the
impact on society and the economy, requires academic statistics, although there was same computers,
a workforce with new skills. This section also a move to resist it. In The future of maths, stats, data
illustrates this critical moment and outlines data analysis, he pointed to the existence and computationally
our methodology for analysing how these of an as-yet unrecognised science, whose based research that
needs can be met in practice. subject of interest was learning from data, we have been doing
or ‘data analysis’12. Tukey worked between for as long as I have
Data science as a developing discipline academia and industry, and provides a been involved in
Making sense of data has a long history. notable precedent for ‘braided careers’ in research, with the
Historically, the notion of finding useful patterns data science, working jointly at Bell Labs and current trendy label
has been given a variety of names such as Princeton University Statistics department. for it.”
data mining, knowledge extraction, information Over the last 50 years, statisticians, data
harvesting and data pattern processing. It analysts and computer scientists have played Dr James Hetherington,
was performed by scientists, statisticians, a part in the invention and development of Director of Research
Engineering at the Alan
librarians, computer scientists and others9. For computational environments for data analysis13.
Turing Institute.
example, in 1854, John Snow's map of cholera
cases alerted him to the cause of the cholera A new workforce
outbreak. A cluster of dots located close to a In 2012, Harvard Business Review described
single water supply on a map changed how the modern day data scientist as “a high
we analyse and visualise data today10. ranking professional with the training and
curiosity to make discoveries in the world of
The term ‘data science’ emerged in the big data” 14. D J Patil, LinkedIn Chief Scientist
1960s to designate a new profession that and author of Data Scientist: the sexiest job
was expected to make sense of increasingly of the 21st century, explained that the focus
large stores of data. It wasn’t until the early of teams at LinkedIn was to work on data
2000s that the first data science journal applications that would have an immediate
was launched by the Committee on Data for and massive impact on the business. The
Science and Technology (CODATA) of the term that seemed to fit best was data
International Council for Science11. scientist: those who use data and science
to create something new.
9. Fayyad U et al. 1996 From Data Mining to Knowledge Discovery in Databases. American Association for Artificial
Intelligence 17, 37-54.
10. Rogers S. 2013 John Snow’s data journalism: the cholera map that changed the world. The Guardian. 15 March
2013. See https://www.theguardian.com/news/datablog/2013/mar/15/john-snow-cholera-map (accessed 27
November 2018).
11. In 2013, Forbes technology contributor Gil Press put together a timeline which traces the evolution of the term
‘data science’ and its use, attempts to define it, and related terms. Press G. 2013 A Very Short History Of Data
Science. Forbes. 28 May 2013. See https://www.forbes.com/sites/gilpress/2013/05/28/a-very-short-history-of-data-
science/#30ad96c655cf (accessed 27 November 2018).
12. Donoho D. 2017 50 Years of Data Science. Journal of Computational and Graphical Statistics 26:4, 745-766.
13. Donoho D. 2017 50 Years of Data Science. Journal of Computational and Graphical Statistics 26:4, 745-766.
14. D
avenport T and Pail DJ. 2012 Data Scientist: The Sexiest Job of the 21st Century. Harvard Business Review.
October 2012. See https://hbr.org/2012/10/data-scientist-the-sexiest-job-of-the-21st-century (accessed 27
November 2018).
Dynamics of Data Science: How can all sectors benefit From data science talent? 17
INTRODUCTION
In 2007, Microsoft Professor Jim Gray coined However, there is not yet a consistent
the era of massive data the ‘Fourth Paradigm,’ definition of data science or the role of a data
stating that our capacity for collecting data scientist16. Data science can cover a range
has outstripped our present capacity to of activities from rapid analysis of real-time
analyse it and so our focus should be on data to long term evidence collection in the
developing people with the skills to make sciences. There is a wide variety of skills
sense of it15. under the label ‘data science’ and people
with relevant skills may associate with
other disciplines.
figure 1
15. Hey T, Tansley S and Tolle K. 2009. The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research.
16. P
ress G. 2013 A Very Short History Of Data Science. Forbes. 28 May 2013. See https://www.forbes.com/sites/
gilpress/2013/05/28/a-very-short-history-of-data-science/#30ad96c655cf (accessed 27 November 2018).
18 Dynamics of Data Science: How can all sectors benefit From data science talent?
INTRODUCTION
Background to this study: Why now? each year. In the current age, an increasing
Greater exchange, storage and use of data: volume of information is being collected “To be a good data
Commercial, industrial and academic uses from a greater range of sources and at scientist you really
of data have expanded considerably over greater speed than ever before17. According do need a hell of a
recent years. Changes to the volume, variety, to research from McKinsey, the volume of lot of skills”
and velocity of data collection have created data continues to double every three years
a potentially rich resource for the digital from digital platforms, wireless sensors and Chanuki Illuska Seresinhe,
economy. One estimate suggests that open mobile phones18. Senior Data Scientist,
Channel 4.
data could help create $3 trillion of value
figure 2
The greater exchange and use of data and the greater potential for efficiency and
innovation has led to the development of a unique, interdisciplinary workforce with
new and evolving skills.
Greater exchange,
storage and use of
accessible data
An interdisciplinary
workforce with new
and evolving skills
17. Manyika J et al. 2013 Open data: unlocking innovation and performance with liquid information. McKinsey Global
Institute. See http://www.mckinsey.com/business-functions/digital-mckinsey/our-insights/open-data-unlocking-
innovation-and-performance-with-liquid-information (accessed 27 November 2018).
18. Henke N et al. 2016 The age of analytics: Competing in a data-driven world. See https://www.mckinsey.com/
business-functions/mckinsey-analytics/our-insights/the-age-of-analytics-competing-in-a-data-driven-world
(accessed 27 November 2018).
Dynamics of Data Science: How can all sectors benefit From data science talent? 19
INTRODUCTION
Increasing analytical and computing power: The Royal Society has responded to this
“Whether people In the past, the complexity of big data moment in its recent reports Machine
are in the limited the effectiveness of existing methods learning: the power and promise of
meteorological of analysis. Now, data scientists have computers that learn by example,
office, in the unprecedented computing power at their which highlighted among a range of
City looking at disposal. In the commercial sector, analysis recommendations the need for data skills at
financial data, or of data has become central to achieving all levels – from foundational data skills to
in government competitive advantage in some sectors and machine learning expertise at the leading
statistics looking has led to new applications of predictive edge of research21.
at sociological analytics and machine learning to address
data, they have business problems19. The data scientists we This report identified the conditions
been confronted interviewed gave some examples of what this for enabling the UK to benefit from the
with questions and means for data science. opportunities presented by machine learning.
problems that go These included the need for an amenable
beyond traditional More decision making based on data data environment and to generate a healthy
notions of statistics science techniques: skills pipeline from school level through
or data analysis. In the public sector, combining data with intelligent users of machine learning to high
This has forced them analytics can, for example, allow a better level research. However, at the time of its
to grab this notion understanding of population needs and help publication there were media reports of a brain
of data science. To design and deliver services accordingly. drain from academia into industry. This led
me, that is what Better use of data can improve the design, the Royal Society to consider whether there
really justifies its efficiency and outcomes of services20. are aspects of the industry/academia/public
existence.” sector interface in data science that are unique
compared with other sciences.
Professor Graham
Cormode, Professor in Data management and use: governance for
Computer Science at the
the 21st century, produced with the British
University of Warwick.
Academy, highlighted the need for new ways
to govern new uses of data, including the
need for responsible sharing of data and
using data analysis for public benefit22. This
report addresses some of the ways that these
needs can be met in practice.
19. M
cKinsey Analytics. 2018 Analytics comes of age. See https://www.mckinsey.com/~/media/McKinsey/Business%20
Functions/McKinsey%20Analytics/Our%20Insights/Analytics%20comes%20of%20age/Analytics-comes-of-age.ashx
(accessed 29 November 2018).
20. Timmis S, Heselwood L and Harwich E. 2018 Sharing the benefits: How to use data effectively in the public sector.
See https://reform.uk/research/sharing-benefits-how-use-data-effectively-public-sector (accessed 27 November 2018).
21. Royal Society. 2017 Machine learning: the power and promise of computers that learn by example.
See https://royalsociety.org/~/media/policy/projects/machine-learning/publications/machine-learning-report.pdf
(accessed 15 April 2019).
22. Royal Society. 2017 Data management and use: Governance in the 21st century. See https://royalsociety.org/-/
media/policy/projects/data-governance/data-management-governance.pdf (accessed 15 April 2019).
20 Dynamics of Data Science: How can all sectors benefit From data science talent?
INTRODUCTION
Workshops Statistics
To explore this further, we held scoping The Royal Society commissioned Burning “The pace of
roundtables in 2018 to explore key questions Glass Technologies, an analytics software technology
with a range of stakeholders, including: company who provide real-time data on job evolution is ever
growth, skills in demand, and labour market increasing. I think
• w
hat draws data science talent to particular
trends to provide data and analyses on UK we should all be
sectors and organisations?
job adverts related to data science and doing everything
• h
ow can universities and the public sector analytics. This was in order to determine we can to enable
learn from industry’s success? demand trends, salary changes, location of people to be part
positions, skills and experience requirements. of that. I would
• w
hat kinds of models should we explore to
This analysis also sets out a taxonomy of the support training
enable a thriving landscape?
various, interconnected occupations related up of technical
• h
ow can we promote collaboration and to generation and use of data. individuals who
cross-sector careers? are open about
Interviews with data scientists from academia, intending to take
Participants highlighted a number of existing charities, private and public sectors careers in industry.
and potential new models to attract, educate To understand the drivers and blockers in I think that should
and retain data science talent, and ensure a more detail, a series of interviews was carried be a strategic
healthy research landscape. out with researchers working within the objective for a
broad field of data science and particularly country.”
We also explored the drivers for individual those who have worked across academia,
data scientists to pursue careers in different government and industry. They revealed a Dr Ilya Zheludev, Chief
complex network of factors that impacted Data Officer at Jasmine22.
sectors and to further interrogate the models
for upskilling, retaining and sharing data on career choices. Interviewees also spoke
science talent and determine drivers for the about the mechanisms that have allowed
movement of data scientists. Participants them to thrive in multiple roles across the
also discussed the application challenges landscape, or move between them, and
and conditions needed for success, such as highlighted where they perceived skills gaps
infrastructure and funding. and gave suggestions for how to fill them.
Dynamics of Data Science: How can all sectors benefit From data science talent? 21
INTRODUCTION
22 Dynamics of Data Science: How can all sectors benefit From data science talent?
Dynamics of Data Science: How can all sectors benefit From data science talent? 23
24 Dynamics of Data Science: How can all sectors benefit From data science talent?
Chapter one
Image © monsitj.
Dynamics of Data Science: How can all sectors benefit From data science talent? 25
CHAPTER ONE
26 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER ONE
25. Office for National Statistics. 2019 Labour market overview, UK Statistical bulletins. See https://www.ons.gov.
uk/employmentandlabourmarket/peopleinwork/employmentandemployeetypes/bulletins/uklabourmarket/
previousReleases (accessed 15 April 2019).
26. Djumalieva J & Sleeman C. 2018 Linking skills to occupations: Using big data to build a new occupational
taxonomy for the UK. See https://www.nesta.org.uk/blog/linking-skills-to-occupations-using-big-data-to-build-a-new-
occupational-taxonomy-for-the-uk/ (accessed 15 April 2019).
27. D
jumalieva J and Sleeman C. 2018 Making Sense of Skills. See http://data-viz.nesta.org.uk/skills-taxonomy/index.
html (accessed 15 April 2019).
28. ibid.
Dynamics of Data Science: How can all sectors benefit From data science talent? 27
CHAPTER ONE
Table 1
Framework categories showing increasing levels of analytical rigour across all Data Science
and Analytics (DSA) jobs.
28 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER ONE
BOX 1
Dynamics of Data Science: How can all sectors benefit From data science talent? 29
CHAPTER ONE
30 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER ONE
Regional demand (see Figures 3 and 4, in 2017/18. However, for Data Scientists and
and Appendix Table 2) Advanced Analyst job vacancies (fig 4), growth
Regional breakdowns show the dominance was larger (relative to the base amount) in
of London for Data Scientist and Advanced Northern Ireland (563%), the North West
Analysts, accounting for 58% of all postings (269%) and the East of England (250%).
Table 2
Job postings and salary information for data science and analytics jobs (including a detailed look at the Data Scientists
and Advanced Analysts category).
Dynamics of Data Science: How can all sectors benefit From data science talent? 31
CHAPTER ONE
Table 3
*8,157 job adverts included in this analysis. **27,033 job adverts included in this analysis.
32 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER ONE
Table 4
*Out of 8,157 job adverts included in this analysis. **Out of 27,033 job adverts included in this analysis.
Dynamics of Data Science: How can all sectors benefit From data science talent? 33
CHAPTER ONE
figure 3
Map showing the growth of all data job postings across the UK from 2013 to 2017 – 18.
2013 2017/2018
736,611 996,262 Scotland
2013 2017/2018
35% 36,388 41,117 13%
Growth since 2013
North East
2013 2017/2018
7,444 11,095 49%
North West
2013 2017/2018
Yorkshire and the Humber
45,286 61,871 37%
2013 2017/2018
37,885 47,014 24%
East Midlands
Northern Ireland 2013 2017/2018
2013 2017/2018 30,957 37,743 22%
4,987 11,902 139%
East of England
West Midlands 2013 2017/2018
2013 2017/2018 53,295 61,725 16%
41,771 86,563 107%
Home nations
Northern Ireland Scotland Wales England
2013 2017/2018 2013 2017/2018 2013 2017/2018 2013 2017/2018
4,987 11,902 139% 36,388 41,117 13% 9,198 14,507 58% 654,242 837,307 25%
*Regional figures do not add up to the UK total because job adverts that did not include location (eg remote working) have been excluded.
34 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER ONE
figure 4
Map showing the growth of Data Science and Advanced Analyst (DSAA) job postings across the UK from 2013 to 2017 –18.
2013 2017/2018
8,157 27,033 Scotland
2013 2017/2018
231% 528 1,114 111%
North East
2013 2017/2018
54 175 224%
North West
2013 2017/2018
Yorkshire and the Humber
320 1,182 269%
2013 2017/2018
256 746 191%
East Midlands
Northern Ireland
2013 2017/2018
2013 2017/2018 200 474 137%
46 305 563%
East of England
West Midlands
2013 2017/2018
2013 2017/2018
551 1,928 250%
256 748 192%
Wales
Greater London
2013 2017/2018
2013 2017/2018
121 216 79%
4,131 14,066 240%
Home nations
Northern Ireland Scotland Wales England
2013 2017/2018 2013 2017/2018 2013 2017/2018 2013 2017/2018
46 305 563% 528 1,114 111% 121 216 79% 7,130 22,527 216%
*Regional figures do not add up to the UK total because job adverts that did not include location (eg remote working) have been excluded.
Dynamics of Data Science: How can all sectors benefit From data science talent? 35
CHAPTER ONE
Discussion of the data and understanding The Data Skills Taskforce is currently looking
the skills gap at the gap between the supply of students
Since 2012, the data science industry has coming through with the required knowledge
moved extremely quickly. It is possible that and skills, and the demand required by
some of the jobs included in this analysis employers. No doubt data science techniques
have been relabelled as ‘Data Scientist’ or will themselves need to be applied to predict
associated roles, possibly leading to inflated future workforce needs.
or exaggerated growth rates. This is likely to
reflect wider changes in the language used BOX 2
to describe data science and data scientists.
The Data Skills Taskforce
It is very hard to estimate the true gap between
market demand and supply. Vacancies are not The Data Skills Taskforce, chaired
necessarily a good proxy for the number of by Accenture and the Alan Turing
data science jobs. One reason for this is that if Institute, sets an agenda for change to
job turnover increased, more vacancies will be inspire, educate and upskill data talent,
advertised even if there has been no change drawing on best practice from the UK’s
in the total number of people working as data leading institutions. The taskforce was
scientists. Additional analysis of the Labour established to review, promote and take
Force Survey could help to aid understanding forward key elements of Analytic Britain,
of the actual number of jobs. across schools, universities and the
labour market at large29. It comprises UK
There are also a number of challenges to businesses, data skills stakeholders and
understanding the supply of data science the Department for Digital, Culture, Media
graduates. The Higher Education Statistics and Sport.
Agency (HESA) codes that are used to
classify subjects studied do not include The main aims of the Data Skills
data science. In any case, only a very small Taskforce are to promote the importance
minority of data scientists will come from of data skills, highlight critical skills
data science courses. gaps and monitor progress against the
recommendations of Analytic Britain.
Ultimately, more needs to be done to The Data Skills Taskforce is building a
determine the existence, size and scale of the platform to help SMEs develop their data
skills gap. Further analysis is needed of the way science capabilities, working to quantify
statistics on higher education are captured, so the UK data skills gap, which is currently
as to understand better the challenges faced significant and growing, and considering
by employers. The lack of traditional labour whether a data science foundation course
market data for these roles has created an for all undergraduates is required.
information gap that is unhelpful to educators,
employers, and policymakers as they attempt
to build a workforce with the skills needed
across the landscape.
29. Mateos-Garcia J, Windsor G and Roseveare S. 2015 Analytic Britain: Securing the right skills for the data-driven
economy. See https://media.nesta.org.uk/documents/analytic_britain.pdf (accessed 15 April 2019).
36 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER ONE
Dynamics of Data Science: How can all sectors benefit From data science talent? 37
38 Dynamics of Data Science: How can all sectors benefit From data science talent?
Chapter two
Image
Image
Caption©goes
Laurence
here. Dutton.
Dynamics of Data Science: How can all sectors benefit From data science talent? 39
CHAPTER TWO
40 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER TWO
figure 5
Intellectual
freedom
Joint
Recognition
appointments
ACADEMIA
Access to Impact
data and on policy
computing and public
power services
Access
Salary
to data
Note: Some of these drivers are not the exclusive presure of any one sector but are highlighted here for illustrative purposes.
Dynamics of Data Science: How can all sectors benefit From data science talent? 41
CHAPTER TWO
figure 6
42 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER TWO
ACADEMIA: BLOCKERS
INDUSTRY: BLOCKERS
Dynamics of Data Science: How can all sectors benefit From data science talent? 43
CHAPTER TWO
30. Shead S. 2016 Apple is finally going to start publishing its AI research. Business Insider. See https://www.
businessinsider.com/apple-is-finally-going-to-start-publishing-its-artificial-intelligence-research-2016-12?utm_
source=feedly&utm_medium=webfeeds&r=US&IR=T (accessed 15 April 2019).
44 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER TWO
“ From the point of view of academic career “ The counterpoint is that mobility can be
development, my six years in Silicon harder in either direction if the gap is wider.
Valley were completely wasted. It does It is often said that it is difficult for people
not translate into anything tangible on from industry to move into academia if they
the academic career ladder. But, I suspect have not maintained a research profile.
that is part of why Cambridge University
decided to hire me. This kind of experience Equally, I think if you have been working
must be recognised. It did not lead to in an academic environment focused on
papers. It did not lead to cutting-edge stuff. publishing and you want to go to a more
But it led to loads of ideas.” industrial employer, one of the things you
will face is that they will say, ‘Before we
Dr Damon Wischik, Lecturer, Department of Computer get started, can you do some programming
Science and Technology, University of Cambridge. exercises or fairly hands-on stuff?’ For
someone who has been working on
equations and theory, it can actually be a bit
“ Once you have left academia and you of a wrench to reorient to those more hands-
are not publishing high‑quality papers, on expectations that organisations have.”
it is very difficult to go back. If there was
more funding for projects that required Professor Graham Cormode, Professor in Computer
Science at the University of Warwick.
one commercial partner and one academic
partner, that would be interesting.”
Chanuki Illushka Seresinhe, Lead Data Scientist at Popsa. “ This is a major data science/AI challenge
for government, innovative companies
and universities. In my experience, the
boundary between universities and
government is potentially difficult: the
timescales and measures of success are
very different on either side. But there are
various experiments, for example the Policy
Fellowships pioneered by the Centre for
Science and Policy, which show how the
two sides can learn from each other to the
enormous benefit of our society as a whole.”
Dynamics of Data Science: How can all sectors benefit From data science talent? 45
46 Dynamics of Data Science: How can all sectors benefit From data science talent?
Chapter three
Developing foundational
knowledge and skills
Image © jacoblund.
Dynamics of Data Science: How can all sectors benefit From data science talent? 47
CHAPTER THREE
Developing foundational
knowledge and skills
Education should provide a grounding Alongside the technical skills, interviewees in
“It is not common for to ensure that all young people develop this project also highlighted a range of other
my generation to underpinning data science knowledge core skills for working well with data including
be able to program. and skills. The data experts that we spoke adaptability, curiosity, empathy, problem-
There are loads of to highlighted a range of core skills and solving and story-telling which supports the
really great online disciplines that need to be developed early case for embedding data science in an array
resources, but I on including coding, computer science, of subjects.
think it can still be mathematics, machine learning, statistics,
difficult for people and more. There are pools of potential talent which
to see why it would could be reached to address local needs and
be useful.” Last year the Royal Society published a review there could be more courses, apprenticeships
of how data science skills are nurtured in and work placements outside of London
Dr Amy Nelson, Senior England's curriculum31. The review was written and the South East. Employer-led Trailblazer
Research Associate, UCL by a curriculum expert, Dr Vanessa Pittard, groups and public sector bodies should
Institute of Neurology.
and it identified some barriers to embedding further support and expand existing
data skills into the curriculum as well as some programmes, such as those run by the Office
opportunities for further development. for National Statistics and the BBC, and work
together to resolve knotty delivery issues.
The review found that at primary level, pupils
are likely to gain a reasonable introduction This chapter sets out two priority needs
to underpinning elements of data science if and offers specific recommendations for
they are taught well. However, at secondary developing foundational knowledge and skills
level, the lack of systematic progression in for data science. It also highlights successful
relevant aspects of computing is a major or innovative models and mechanisms
curriculum challenge. for developing these foundations and
addressing these needs.
Last year the Royal Society also conducted
a study identifying a range of initiatives that
seek to increase the proportion of young
people studying computing (particularly
girls)32. The study, co-funded by Microsoft and
Google, found that 54% of English schools
do not offer Computer Science GCSE, and
put recommendations in place to encourage
greater take up and increase teachers’
confidence. The report also found that barriers
exist around teacher confidence in applying
mathematics to domain questions, and in
conceptual and technical aspects of computing.
31. P
ittard V. 2017 The integration of data science in the primary and secondary curriculum. See https://royalsociety.
org/~/media/policy/Publications/2018/18-07-18-The%20Integration%20of%20Data%20Science%20in%20the%20
primary%20and%20secondary%20curriculum.pdf?la=en-GB (accessed 15 April 2019).
32. Royal Society. 2017 After the reboot: computing education in UK schools. See https://royalsociety.org/~/media/
events/2018/11/computing-education-1-year-on/after-the-reboot-report.pdf (accessed 15 April 2019).
48 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER THREE
Need:
Recommended action “In my apprenticeship
Widening access to data
there was a whole
science education Support teachers to teach unit dedicated to
Our work on the educational data science data skills how you work with
pipeline, discussions with data scientists and other people in an
Developing the resources, training and
analysis of job market data, showed that there office. You had to
support for teachers requires combined
is a need to follow up a number of priority learn to express your
effort from mathematics and computing
needs in order to develop the foundational opinion and convince
communities, businesses, and education
level knowledge and skills required for people of your ideas.
professionals, including the new National
professional roles working with data. These Apprenticeships
Centre for Computing Education and National
needs cover building core skills at school could be run across
Centre for Excellence in the Teaching of
level, supporting teachers, expanding different sectors
Mathematics, working with the support
apprenticeships and promoting outreach so that alumni can
and input of government departments and
initiatives to introduce young people to build professional
other partners. This can build on important
data skills. networks.”
work already carried out by the Department
for Education and others in this area, and Alexis Fernquest, Data
examples such as the Royal Geographical Scientist, Office for
Recommended action Society’s Data Skills in Geography National Statistics and
programme (see Models and mechanisms). former apprentice.
Data skills for everyone
At school level, data science knowledge and Issues around teacher confidence could be
skills would benefit from greater integration addressed through professional development
across the primary and secondary curriculum opportunities and more provision of detailed
to ensure that everyone leaves school with guidance. Subject specific knowledge
foundational data skills. Mathematics and could be delivered in collaboration with
computing communities, businesses and universities and professional bodies (eg
education professionals have a key role in subject associations). There are possibilities
ensuring that relevant insights are built into for incorporating real uses of data across the
the education curriculum and associated sciences and humanities. One area that is
enrichment activity in schools, ensuring relatively underplayed in the current curriculum
that data science skills are appropriately is developing investigation using GDS/GIS data
embedded across subjects and phases. which could be incorporated into humanities
subjects. Appropriate computing and data
infrastructure for schools will be crucial to
support the teaching of data skills.
Dynamics of Data Science: How can all sectors benefit From data science talent? 49
CHAPTER THREE
Need:
“I would like to see Recommended action
Widening access to data
more people with
developed skills Curricula fit for the future science education
in computational Looking further ahead, an analysis of the There is also a need to address the
statistics and future data science needs of students, underrepresentation of women in the talent
machine learning industry and academia should be undertaken pipeline. Across the mathematical sciences,
coming out of to inform future curriculum development. 37.1% of university students were female in
undergraduate Curriculum content groups should consider 2018. Computer science was the subject with
degrees that are the place of data science within the curricula the widest gender gap across all degree
not computer they are developing now, ahead of the next levels in 2018 (82.8% male). This was most
science and physics. curriculum review. In addition to the relevant pronounced at the undergraduate levels:
For example, the areas of mathematics, computer science and just 15.1% of first degree undergraduates
social sciences, data literacy, the ethical and social implications were female. Forbes data from a US study
life sciences and of machine learning should be included within of technical education provider, General
environmental teaching activities in related fields, such as Assembly, found that in data science
sciences.” Personal, Social and Health Education. programmes/boot camps in the US female
participation lags with 35.3% female students
Dr James Hetherington,
Post-16 curriculum change within the next ten enrolled in a five month period (2016/17)34.
Director of Research
Engineering. years will be vital to ensure young people
leave education with the broad and balanced
In the UK, just 4% of the UK tech industry is
range of skills they will need to flourish in
from a black, Asian and minority ethnic (BAME)
a changing world of work33. A review into
background, compared with 11% in the working
post-16 learning should consider the many
population as a whole, at the last census. With
ways in which the post-16 curriculum could
low diversity in academic programmes and
be improved and the factors which affect the
industry bootcamps, it is unlikely that diversity
options open to young people.
gaps will close in the near future without
significant investment35.
At university level, further consideration is
needed about how universities can teach
data science effectively, as a developing
and interdisciplinary discipline.
33. Royal Society 2019. Jobs are changing, so should education. See https://royalsociety.org/~/media/policy/
Publications/2019/12-02-19-jobs-are-changing-so-should-education.pdf?la=en-GB (accessed 15 April 2019).
34. Pricenomics. 2017 The Data Science Diversity Gap. Forbes. 28 September 2017. See https://www.forbes.com/sites/
priceonomics/2017/09/28/the-data-science-diversity-gap/#56ccb88c5f58 (accessed 15 April 2019)
35. Douglas R. 2018 Lack of diversity increases risk of tech product failures. Financial Times. 14 November 2018.
See https://www.ft.com/content/0ef656a8-cd8a-11e8-8d0b-a6539b949662 (accessed 16 April 2019).
50 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER THREE
Dynamics of Data Science: How can all sectors benefit From data science talent? 51
CHAPTER THREE
36. Poluck M. 2017 ONS opens £10 million hub to exploit digital data. UK authority. 27 March 2017. See http://www.
ukauthority.com/data4good/entry/7009/ons-opens-%C2%A310-million-hub-to-exploit-digital-data# (accessed 18
April 2018).
37. C
hu B. 2015 Bean review tells ONS to be more “intellectually curious”. Independent. 3 December 2015. https://
www.independent.co.uk/news/business/news/bean-review-tells-ons-to-be-more-intellectually-curious-a6758226.
html (accessed 18 April 2018).
52 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER THREE
Mechanism Mechanism
Alternative degree pathways. Outreach programmes. “Q-Step academics
have enriched social
Model Model science teaching
Q-Step, Nuffield Foundation, ESRC and Code First: Girls by integrating
HEFCE (now Office for Students (OfS)) Code First: Girls supports young adult and data skills into
The Q-Step programme, launched 2013/14 working age women to develop further their courses. Most
aims to address the quantitative skills gap in personal and professional skills. Code students are now
social sciences in the UK. With an initial six- First: Girls runs free coding courses; it also handling data in
year £19.5 million investment from the Nuffield connects women to a community of other R. It demonstrates
Foundation, the ESRC and HEFCE, around talented and like-minded women and that what we refer
60 new academic staff were employed companies who can support and accompany to as STEM skills
to use innovative approaches to embed them through their professional development. don’t just come
quantitative methods and data analyses into Code First: Girls helps companies train their from taking STEM
social science teaching. There are currently people, recruit new people, and develop their degrees.”
18 universities across the UK funded to talent management policies and processes so
deliver specialist undergraduate programmes, they don’t miss out on female tech talent. Dr Simon Gallacher, Head
including new courses, work placements and of Student Programmes at
the Nuffield Foundation.
pathways to postgraduate study.
Dynamics of Data Science: How can all sectors benefit From data science talent? 53
54 Dynamics of Data Science: How can all sectors benefit From data science talent?
Chapter four
Advancing professional
skills and nurturing talent
Image © FS-Stock.
Dynamics of Data Science: How can all sectors benefit From data science talent? 55
CHAPTER FOUR
Advancing professional
skills and nurturing talent
Data science is fast-moving and needs New technologies are changing the roles
“There are four main innovative mechanisms to develop that data scientists are performing and
technical skills that advanced skills quickly. When people are in this is having an impact on the depth and
a data scientist employment, there are a number of needs to breadth of skills that are being sought after.
needs: statistics, be met in enabling them to keep their data New technologies are performing roles that
machine learning, science skills up to date and to ensure that previous required highly skilled employees,
data and computer they can work most effectively. These include: making people with more junior data
science. What is qualifications increasingly effective.
• d
eveloping teams and a workplace culture
in short supply is that enables data scientists to make the
having all four of However, the commercial sector could
best contribution in their sector;
them together, plus usefully do more to help train and develop
the soft skills.” • e
nabling individuals to keep their skills employees with the right skillsets. Employers
up to date whether they are data science have a role in upskilling the workforce by
Milton Luaces, Senior specialists or people in a range of training existing employees, particularly those
Manager at Accenture – at risk of losing their jobs through automation,
professions who increasingly need to
Applied Intelligence.
work with data; and can work with universities to co-produce
training. Higher education institutions
• s upporting career changes to meet
with good industry links play a key role in
individual, organisation and regional needs.
developing appropriate professional training.
By working in collaboration with employers
In the longer term, problem solving,
they can potentially address regional skills
resilience, and continuous learning will also
gaps and productivity needs.
be necessary to enable people to adapt to
change, particularly as technology changes
“Industry can play a slightly bigger role in
jobs and the opportunities to collect more
being involved in the training of people,
and more data continue to grow.
regardless of what age or stage they are
at. At the moment, it is an employer and
There is a need to develop researchers
employee relationship. But I can see it
and employees with good data engineering
being a lot more collaborative and I have
skills who have knowledge and experience
come across institutions that are starting to
of handling ‘big data,’ huge unstructured
recognise this. They are moving towards a
datasets, data sources and/or real-time
mindset of upskilling people, exposing them
data. Across all sectors there are massive,
to their company, giving them a grounding
high volume and high velocity datasets that
and capabilities to help solve the problems
need to be stored, processed and analysed
that particular institution faces. I was
in real time, and this requires the creation
sponsored by a company during my PhD
and maintenance of infrastructure for big
and I would encourage more institutions
data. Typically this is work that would be
to do that.”
undertaken by data engineers, but currently
it is often done by data scientists because Ilya Zheludev, Chief Data Officer for
there are not enough data engineers to take Jasmine 22.
on these roles.
56 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER FOUR
In the public sector, collaboration is already Whilst Massive Open Online Courses
helping to upskill the workforce. At the Office (MOOCs) suffer from high dropout rates “I started off
for National Statistics there are Joint PhD/MSc and irregular provision, they also enable with Tableau,
projects, or placements, for example the joint people to undertake independent study storytelling and
Turing-Campus PhD programme. The ONS is and upskilling alongside other more formal data visualisation.
also involved with several of the Centres for training programmes. On the side, I was
Doctoral Training in big data, and runs CPD starting to learn
courses for analysts – typically short intensive Other informal mechanisms include peer R and Python for
courses linked to specific needs. learning such as forums and meet-ups, more of the deep
competitions and clusters. One example is dive work on
Beyond this, specialist skills are often Kaggle, an education platform that also holds the data.”
developed through self-learning programmes. competitions39. The aim of meet-ups is to
An increasing proportion of job adverts are promote free, open, dissemination of data Kevin Koene, former
calling for software skills such as Hadoop, science knowledge. They encourage data Junior Data Scientist,
Python and R. Informal learning and the open The One campaign.
science peer-to-peer learning and sharing,
source movement are enabling people to collaboration among data scientists and data
develop these skills in non-traditional ways. start-ups. They also promote open source
data science tools40.
In 2017, Bernard Marr, writing for Forbes, put
together a list of free online data science Some meet-ups focus on underrepresented
courses with links and endorsements from groups, such as the London-based ‘Inspiring
well-established institutions38: For example: Women in Data Science’, which has almost
• D
ata Science Specialization (Johns Hopkins 1,000 members and runs several events
University and Coursera): One of the longest- throughout the year41. In addition, there are
established online data science courses; and other groups that focus on specialisms within
data science including Women in ML, Black
• D
ata Science Essentials (EdX and Microsoft):
in AI, Queer in AI, LatinX in AI, the AI Club for
Part of the Microsoft Professional Program
Gender Minorities, RLadies and PyLadies, an
Certificate in Data Science;
international mentorship group for women
• B
ecome a Data Scientist (Dataquest with who code in Python with chapters in London,
endorsements from Uber, Amazon and Edinburgh and Dublin.
Spotify): Independent online training
provider offering three pathways (analyst, Community knowledge-sharing events such
scientist and engineer); and as panel/roundtable events on specific hot
• D
ata Mining Course (KDNuggets): Well- topics also bring together data scientists
known business and data science website to discuss, share and take advantage of
that has compiled its own free data-mining community knowledge.
syllabus.
38. Marr B. 2017 The 9 Best Free Online Big Data And Data Science Courses. Forbes. 6 June 2017. See
https://www.forbes.com/sites/bernardmarr/2017/06/06/the-9-best-free-online-big-data-and-data-science-
courses/#466bc8543cdb (accessed 15 April 2019)
39. Kaggle (no date) Kaggle is the place to do data science projects. See https://www.kaggle.com/ (accessed 15 April 2019).
40. Meetup (no date) Inspiring Women in Data Science. See https://www.meetup.com/IWDSuk/ (accessed 15 April 2019).
41. S
ekinah T. 2017 Info on Women in Data groups. DataIQ. 6 December 2019. See https://www.dataiq.co.uk/blog/info-
women-data-groups (accessed 15 April 2019).
Dynamics of Data Science: How can all sectors benefit From data science talent? 57
CHAPTER FOUR
42. Royal Society. 2017 Machine learning: the power and promise of computers that learn by example.
See https://royalsociety.org/~/media/policy/projects/machine-learning/publications/machine-learning-report.pdf
(accessed 15 April 2019).
58 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER FOUR
43. Research Software Engineers Association. 2019 Welcome to the UK Research Software Engineer Association See
https://rse.ac.uk/ (accessed 15 April 2019).
Dynamics of Data Science: How can all sectors benefit From data science talent? 59
CHAPTER FOUR
To address these priority needs, mechanisms organisations bring their real-world problems
“At the Office for including enrichment and fellowship to be tackled by small groups of highly
National Statistics, schemes, capability-building programmes talented, carefully selected researchers,
they have a training and informal/ peer-to-peer mechanisms such with a diversity of thought. Researchers
programme and as online courses, meet-ups and forums can brainstorm and engineer data science
they were willing to prove effective. The following models are solutions, presenting their work at the end
develop me for the illustrations of how they can work in practice. of the week. Organisations get to quickly
next role. They also prototype possible solutions to their data
encourage us to Mechanism science challenges, and researchers get an
move around every Enrichment and fellowship schemes opportunity to put knowledge into practice
two years to get and go beyond individual fields of research
experience working Model to solve real-world problems. Knowledge is
on different types Alan Turing Institute, Enrichment Scheme exchanged among groups, and participants
of analysis. I noticed and Data Study Groups from both academia and the organisations
the investment in One of the major goals of the Alan Turing posing challenges rapidly learn new skills
people and I feel Institute is to train new generations of data during the week – from how to work in
proud of the work science and artificial intelligence leaders with secure analysis environments to learning new
I do.” the necessary breadth and depth of technical data science methods and techniques, and
and ethical skills to match the UK’s growing tools for doing data science collaboratively
Alexis Fernquest, Data industrial and societal needs. The Enrichment in groups.
Scientist at the Office Scheme offers students currently enrolled on
for National Statistics.
a doctoral programme at a UK university the Mechanism
opportunity to join its research body. Doctoral Graduate development programmes.
students typically in their second or third year
of study can undertake a 6, 9 or 12 month Model
placement at the Institute’s headquarters Office for National Statistics advanced
in London. Joining a community of more training
than 400 senior academics, early career For the more advanced level training (eg PhD
researchers and PhD students, enrichment level), partnerships were made between the
students have the opportunity to boost their ONS, the Alan Turing Institute and several
skills and experience, enrich their research universities. The ONS has built relationships
and make new collaborations during their with many departments of the UK government
time at the Alan Turing Institute. which have set up Data Science ‘hubs’,
allowing them to regularly exchange and
The Data Study Groups are five-day communicate on the needs and skills
‘collaborative hackathons’, which bring needed. This should improve the flow of data
together organisations from industry, science talent across the UK.
government and the third sector, with
talented multi-disciplinary researchers
from academia. At each event, several
60 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER FOUR
Model Model
Faculty Fellowship Pivigo data science training
The Faculty Fellowship (formerly ASI Data Pivigo is a data science marketplace and
Science Fellowship) exists to ensure that the training company based in London. It helps
brightest academics get a chance to immerse organisations to innovate through data
themselves in working life, learn about science by connecting them with their own
artificial intelligence (AI) in business and help community of data scientists. Its Science
build the future of operational AI. to Data Science programme trains and
graduates some of the world’s top scientific
Since its founding in 2014, Faculty has PhD talent in data science, with three
trained and transitioned over 250 PhD STEM programmes each year. Pivigo runs a five-
graduates into data scientist roles in industry. week programme at its London campus or
Taking place three times a year (January, online. It works with large multinationals,
May and September), the Fellowship is highly charities, SMEs, and start-ups to help
competitive and receives applications from 5 learners gain practical experience with data
to 10% of the UK's physics, mathematics and science technologies and technical skills in a
engineering postgraduate research students. commercial environment. The scheme offers
In part, this is because alumni go on to students the opportunity to boost their skills,
work for big names like Google, DeepMind, grow their networks and work alongside
Facebook and Deliveroo. researchers.
44. Government Digital Service, Government Office for Science and Office for National Statistics. 2019. Introduction
to the Data Science Accelerator programme. See https://www.gov.uk/government/publications/data-science-
accelerator-programme/introduction-to-the-data-science-accelerator (accessed 15 April 2019).
Dynamics of Data Science: How can all sectors benefit From data science talent? 61
CHAPTER FOUR
45. Roberts S. 2018 Data Team Wins Future Policy Network Award. Government Digital Service blog. 19 February 2018.
See https://gds.blog.gov.uk/2018/02/19/data-team-wins-future-policy-network-award/ (accessed 15 April 2019).
46. Government Digital Service, Government Office for Science and Office for National Statistics. 2019. Previous
Data Science Accelerator projects. See https://www.gov.uk/government/publications/data-science-accelerator-
programme/previous-data-science-accelerator-projects (accessed 15 April 2019).
62 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER FOUR
Dynamics of Data Science: How can all sectors benefit From data science talent? 63
64 Dynamics of Data Science: How can all sectors benefit From data science talent?
Chapter five
Enabling movement
and sharing of talent
Image © baona.
Dynamics of Data Science: How can all sectors benefit From data science talent? 65
CHAPTER FIVE
Enabling movement
and sharing of talent
To perform at their best, scientists and Universities and the public sector in particular
“I can see a engineers need the right environment in need to work to create a culture that nurtures
transition from a which to work, one based on freedom to and retains data science talent. This can
lot of individual collaborate and access to different kinds of include building and supporting cross-
contributor work, resource. Typically, industry and academia disciplinary data science teams, potentially
especially in the offer different experiences as working disrupting traditional organisational structures.
ingesting of data environments. Industry tends to offer access
(cleaning it, making to funding and to large amounts of data in Big internet companies are adding to pressure
it available to use real time. Academia tends to offer freedom to on universities, which are already struggling
in a downstream explore and easy routes to cross-fertilisation to retain professors and other employees.
process, for with other subjects. The best research may For sectors facing particular challenges
example modelling), well be done using both kinds of resource, with retention, there is a need to offer more
to distribution of and without forcing a long-term choice incentives and address common barriers.
labour by specialism, of sector upon the most talented people. One factor impacting on retention is that
and some of that Nowhere is this more urgent than in data UK universities are seeing more and more
labour is computer science, which is an area of explosive growth interest in their intellectual property from
driven and not and rapid change that is attracting talent from big tech companies47.
human driven.” around the globe.
“Large institutions are slowly starting to
Ilya Zheludev, Chief Data There are a number of ways to make understand how they can build IP and then
Officer for Jasmine22. moving between sectors a natural part retain it and monetise it for their own needs. If
of the data science career path. This is the UK government is interested in furthering
important on a variety of timescales from the narrative of commercialisation of IP, it is
career move to week-long study through natural progression that individuals working in
a secondment/internship. academia could be encouraged or could wish
to participate actively in commercialisation
In particular, there is a need to address of knowledge. I have definitely seen a trend
the problem of the ‘one-way door’ out where young people are interested in going
of academia which makes it difficult for into academia and being part of a university
researchers to return after spending time in that allows them to retain IP of anything they
industry or government, and one enabler to uncover themselves. That builds frameworks
this movement is the ability of researchers around allowing those individuals to turn that
to keep publishing when they are outside into commercial operations. I think that is a
of academia. great step forward.”
Ilya Zheludev, Chief Data Officer for
Jasmine 22.
47. C
arey S. 2016 How UK universities are leveraging their intellectual property to benefit from the tech boom.
Techworld. 8 November 2016. See https://www.techworld.com/data/how-uk-universities-are-leveraging-their-ip-
benefit-from-tech-boom-3648861/ (accessed 15 April 2019).
66 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER FIVE
Recommended action
Commercialise research
The ways that universities encourage and
support researchers in commercialising
research and building spin-outs can
influence researchers’ abilities to hold
joint appointments between industry
and academia. Universities may wish to
consider their strategies for research
commercialisation and policies on intellectual
property in order to build an environment that
supports cross-sector roles more freely.
Dynamics of Data Science: How can all sectors benefit From data science talent? 67
CHAPTER FIVE
68 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER FIVE
Dynamics of Data Science: How can all sectors benefit From data science talent? 69
CHAPTER FIVE
Model Model
UKRI Future Leaders Fellowships Royal Society Entrepreneur in Residence
The Future Leaders Fellowship (FLF) is UKRI’s scheme
flagship talent scheme, which aims to develop The Royal Society Entrepreneur in Residence
the next generation of research and innovation (EiR) scheme aims to increase the knowledge
leaders in the UK. It will recruit and retain and awareness in UK universities of
rising stars by attracting the brightest and cutting-edge industrial science, research
best from at home and across the world. The and innovation49. The scheme provides
FLF scheme will provide long-term funding opportunities for outstanding industrial
for each Fellow (up to £1.2 million over an scientists and entrepreneurs to spend time
initial four years, with an option to extend to working in a university to expose university
seven), allowing them to tackle difficult and staff and students to state-of-the-art industrial
novel challenges. A total of 550 FLFs will be research and development, and the scientific
awarded from 2019/20 to 2021/22 across challenges faced by industry. The scheme
six separate rounds, marking a significant also allows universities to gain from expert
investment to grow the UK’s research and advice aimed at promoting innovation and
innovation base. Although the FLF scheme is the translation of research by universities,
not prescriptive of the research and innovation and build confidence and understanding of
areas it supports, and Fellowships will be business and entrepreneurship among staff
awarded on a competitive basis, there are and students.
several features that should make it attractive
to those working in data science and AI. Model
Whereas most existing Fellowship schemes Royal Society Industry Fellowship scheme
fund only academic researchers, FLF also The Royal Society Industry Fellowship is
supports individuals in industry, as well as a paid secondment scheme for academic
those working at the interface of academia, scientists who want to work on a collaborative
industry and the public sector, encouraging project with industry and for scientists in
a new paradigm in career path that is mobile industry who want to work on a collaborative
across all three. Operating across the breadth project with an academic organisation50.
of UKRI will allow Fellows to take the most Providing a basic salary for the researcher
cross-cutting and interdisciplinary approaches and a contribution towards research costs,
to research and innovation. The open remit the Fellowship aims to enhance knowledge
of the call allows for Fellowships to be held transfer in science and technology between
across a spectrum – from those with a those in industry and those in academia in
background in AI wishing to apply their skills the UK. The scheme supports researcher-
to a wide range of disciplines and challenges, mobility and has run for over 30 years,
to those who are from different disciplinary bridging industry and academia for hundreds
backgrounds, where AI could make a of scientists.
transformational contribution to that discipline
or where that discipline could be brought to
bear on the development of AI.
70 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER FIVE
Model Model
Royal Commission for the Exhibition of 1851 Uber AI Residency (US)
The Royal Commission for the Exhibition of Established in 2018, the Uber AI Residency
1851 awards three-year research fellowships is a 12-month training programme for recent
to early career postdoc scientists or college and Master’s graduates, professionals
engineers of exceptional promise51. The who are looking to reinforce their AI skills,
Fellowship, which was founded in 1891 and and those with quantitative skills and interest
has initiated the careers of thirteen Nobel in becoming an AI researcher at Uber AI Labs
laureates to date, is open to all nationalities or Uber Advanced Technologies Group (ATG).
and fields of science, including physical or Uber AI Residents have the opportunity to
biological sciences, mathematics, applied pursue interests across academic and applied
science, and any branch of engineering. research. Uber is committed to an open and
The Commission also awards Industrial inclusive research mission that benefits the
Fellowships to encourage industry – community at large, including contributing
academia collaboration at doctoral research papers to top conferences and taking part in
level, Industrial Design Studentships for open-source projects.
postgraduates and, in partnership with the
Royal Academy of Engineerig, graduate Mechanism
Enterprise Fellowships for entrepreneurs. Approaches to commercialisation
Mechanism Model
Residencies IP free zone, Department of
Computer Science and Technology,
Model the University of Cambridge and beyond
Microsoft AI Residency program The IP free zone at the University of
(US/UK) Cambridge is part of a more general
The Microsoft AI Residency program is a framework set up by the former Head of
12-month role designed to advance a career Department of Computer Science and
in machine learning research and engineering. Technology, Professor Andy Hopper FRS. The
The goal of the AI Residency is to help strategy has been to minimise barriers to the
residents become creative and productive formation of new companies while aligning
AI researchers, scientists and engineers. incentives for staff and students, avoiding IP
Residencies are open to BSc, MSc, and PhD issues, providing mentoring, being helpful in
graduates with substantial coursework in, but every possible way, and not picking winners.
not limited to: computer science, electrical Furthermore, this has been a cradle-to-
engineering, data science, mathematics, grave approach ranging from undergraduate
physics, economics, human – computer lectures to the maintenance of an industrial
interaction, and computational biology. business club beyond the department. A
total of 270 companies have been formed by
staff and students (including Raspberry Pi),
of which 50% are active with revenues of $1
billion, and 18% sold for over $40 billion.
51. Royal Commission for the Exhibition of 1851 (no date). Our awards. See https://www.royalcommission1851.org/
(accessed 15 April 2019).
Dynamics of Data Science: How can all sectors benefit From data science talent? 71
CHAPTER FIVE
72 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER FIVE
Dynamics of Data Science: How can all sectors benefit From data science talent? 73
74 Dynamics of Data Science: How can all sectors benefit From data science talent?
Chapter six
Image © CasarsaGuru.
Dynamics of Data Science: How can all sectors benefit From data science talent? 75
CHAPTER SIX
76 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER SIX
Need: Opening up data and providing The public sector needs to work out how to
secure access. widen access to give university researchers “We have amazing
To get the best value from data for the access to better public data. Continuing to data in the NHS and
widest range of organisations means that ensure that data generated by charity-funded that is definitely
opening data in a secure and well-governed and publicly funded research are open by a resource that is
way should enable societal benefit to be default will be critical in supporting wider uses worth staying for.”
accessed most easily. of research data. Journals should normally
insist, as a condition of publication, on data Dr Amy Nelson, Senior
being made available to other researchers in Research Associate at UCL
Institute of Neurology and
Recommended action their original form, or via appropriate summary
a junior doctor.
statistics where personal information is
Encourage data sharing involved. Of course such a policy has
where possible additional potential benefits of enhancing
Greater transparency of private sector data reproducibility in research and increasing
could help build public trust in the use of data transparency of decision making, in both the
and how their data is used for decision-making public and private sectors.
purposes. The public sector could usefully
consider how to widen access to its data, The Royal Society has recently published
including sharing data, and data challenges to the report, Protecting privacy in practice,
researchers. Journal editors should normally which sets out how use of government data
ensure that data are being made available to could be enabled by Privacy Enchanting
other researchers in their original form, or via Technologies (PETs).
appropriate summary statistics where sensitive
personal information is involved.
Recommended action
Standardisation of open data can enable the
sharing of data and challenges, providing an Donate data science talent
amenable data environment for researchers.
There is value in enabling data scientists to
However, the majority of data held by
donate their time to applying data science
organisations is not open, and unlikely to be
to societal challenges. For example, through
so due to personal disclosure and commercial
probono project work along the lines of
restrictions. In areas where there are datasets
DataKind UK, RSS Statisticians for Society
unsuitable for general release, further progress
and hackathons.
in supporting access to public sector data
could be driven by creating policy frameworks
or agreements which make data available to
specific users under clear and binding legal
constraints to safeguard their use, and set out
acceptable uses. Government should further
consider the form and function of such new
models of data sharing.
Dynamics of Data Science: How can all sectors benefit From data science talent? 77
CHAPTER SIX
78 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER SIX
52. Royal Statistical Society. 2019 Statisticians for Society. See http://www.rss.org.uk/RSS/Get_involved/Statisticians_
for_Society/RSS/Get_involved/Statisticians_for_Society.aspx?hkey=c7977c58-1558-495a-9e5a-e99d64ea9cfd
(accessed 15 April 2019).
Dynamics of Data Science: How can all sectors benefit From data science talent? 79
CHAPTER SIX
Mechanism Model
Data stores UK Data Service
The UK Data Service is the UK’s only
Model nationally funded research infrastructure
The London Datastore for the curation and provision of access
The London Datastore is a free and open data- to social science data and its practices,
sharing portal where anyone can access data especially around secure access to data and
relating to the capital53. It is one of the Greater data curation, have been influential across
London Authority’s (GLA) flagship projects the world. Funded by the Economic and
and is a platform through which many of the Social Research Council (ESRC) to meet the
Smart London Plan objectives are delivered. data needs of researchers, students and
Researchers are encouraged to visualise or teachers from all sectors, its unique collection
build apps from the data available on the site. of social science data resources includes
major UK government-sponsored surveys,
Model cross-national surveys, longitudinal studies,
ONS API UK census data, international aggregate,
The Office for National Statistics API business data, and qualitative data. It brings
makes datasets and other data available together several important past investments
programmatically, allowing researchers to including the Economic and Social Data
filter datasets and directly access specific Service, Question Bank, Qualidata and
data points. Census Programme.
53. London Datastore 2019. Welcome to the Datastore. See https://data.london.gov.uk/ (accessed 15 April 2019).
80 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER SIX
BOX 2 Mechanism
Data stores
Open Data Skills Framework
(The ODI) Model
Transport for London API
Open data is a relatively new field. Its Transport for London (TFL) is the local public body
potential is being realised increasingly as responsible for public transport in London54. Every
it is slowly integrated into the strategies of year, TfL ensures the transportation of 1.37 billion
organisations. Those working with open people, with a network length of 402 km, which
data or on open data initiatives often is equivalent to 83.6 million km travelled per
have to learn the skills as they go. There year55. Over the past ten years TfL has made a
is no clear language to describe the significant amount of data accessible to the public
knowledge and skills of those working free of charge, including timetables, service status
with open data; nothing with which to and disruption information. This has allowed
benchmark a single individual’s expertise, the market to develop exponentially with the
and point to where they are in their introduction of new products and services. TfL is
learning journey. now considered as a leader in publishing open
data through APIs, the Cloud, the internet and
This is why the Open Data Institute has across its physical network. It has created over
created the open data skills framework, 700 jobs and brought £14 million per year in GVA,
a simple, three-tier framework that enabling development of UK’s skills in data.
describes the knowledge and skills of
anyone interacting with open data, from Mechanism
beginner through to expert level56. Offices of Data Analytics
54. Transport for London 2019. Facts & figures. See https://tfl.gov.uk/corporate/about-tfl/what-we-do/london-
underground/facts-and-figures (accessed 15 April 2019).
55. Deloitte 2017. Assessing the value of TfL’s open data and digital partnerships. See http://content.tfl.gov.uk/deloitte-
report-tfl-open-data.pdf (accessed 15 April 2019).
56. Open Data Institute. 2016 Open Data Skills Framework. See https://theodi.org/article/open-data-skills-framework/
(accessed 15 April 2019).
57. E
aton M and Bertocin C 2018. State of Offices of Data Analytics (ODA) in the UK. See: https://media.nesta.org.uk/
documents/State_of_Offices_of_Data_Analytics_ODA_in_the_UK_WEB_v5.pdf (accessed 15 April 2019).
Dynamics of Data Science: How can all sectors benefit From data science talent? 81
CHAPTER SIX
Mechanism: Model
Data Centres / Institutes National Innovation Centre for Data (NICD)
The National Innovation Centre for Data
Model (NICD) is a unique new facility that delivers
Health Data Research UK (HDR UK) data analytics skills into industry and the
Health Data Research UK is uniting the public sector by exploiting the knowledge and
UK’s health data to make discoveries that expertise currently locked within universities.
improve people’s lives. By bringing together A flexible rolling programme of collaborative
the sharpest scientific minds, and providing projects focused on organisations’ specific
safe and secure access to rich health data, challenges and opportunities will transfer
it aims to better understand diseases and practical data skills into the workforce of
discover new ways to prevent, treat and cure those organisations. These projects will be
them58. Its vision is for large-scale data and supported by a range of related activities,
advanced analytics to benefit every patient including awareness-raising events, themed
interaction, clinical trial, biomedical discovery business and technical seminars and technical
and improve public health. To achieve this, training courses. As a result of engagement
HDR UK is leading an ambitious training and with the Centre, organisations will be able to
talent programme, and will create a cohort increase their productivity by optimising their
and network of thousands of health data existing operations, and to grow by launching
scientists spanning all career stages, from new data-driven products and services.
school-leaver to senior research manager
and international opinion leaders. The UK
has a rich and diverse scientific talent base,
thanks to the strength of the NHS, its academic
institutions and innovative scientific and digital
industries. HDR UK plans to harness this, bring
on board international peers, to create an
intelligent cohort of health data scientists that
will dramatically change medical research,
and open up new, faster, smarter pathways to
patient care.
58. Medical Research Council. 2019. Health Data Research Institute. See https://mrc.ukri.org/about/institutes-units-
centres/uk-institute-for-health-and-biomedical-informatics-research/ (accessed 15 April 2019).
82 Dynamics of Data Science: How can all sectors benefit From data science talent?
CHAPTER SIX
Dynamics of Data Science: How can all sectors benefit From data science talent? 83
84 Dynamics of Data Science: How can all sectors benefit From data science talent?
Conclusion
Image © PeopleImages.
Dynamics of Data Science: How can all sectors benefit From data science talent? 85
CONCLUSION
Conclusion
Data professionals are in high demand from a vision for the sharing of data science
employers. Over the last five-and-a-half years, talent across all sectors. By identifying four
there has been a sharp rise in UK job-listings major areas of action with recommendations
for ‘Data Scientists and Advanced Analysts’ for addressing priority needs across the
(+231%) driven predominately by increased data science talent pipeline, from school to
numbers of vacancies for Data Scientists advanced professionals, we are hopeful that
(+1287%) and Data Engineers (+452%) we can achieve our vision for the UK to be a
leading data science research nation with a
This report has focused on Data Scientists and sustainable flow of expertise and a healthy
Advanced Analysts at the top end of analytical data science skills landscape.
rigour because this is where demand has
grown the most. However, the data shows This report also sets out a wide variety of
interesting results for data professionals across existing models and mechanisms that could
the spectrum. Moreover, our findings are likely be used more widely, from fellowship schemes
to underestimate the demand for data skills as to data stores, that represent good practice or
many jobs are not advertised online. Further innovation in supporting data science career
analysis is needed to quantify the number of development and mobility. The report features
employed workers per opening. several contributions from data scientists from
a range of backgrounds, organisations and
There is a clear need for collaborative, roles sharing their career experiences: these
sustainable mechanisms to develop data talent are also available as a separate publication,
in academia, and the charity, private and public Dynamics of Data Scientists: what data
sectors, and to allow data scientists to move professionals say about data science.
across these sectors. This report promotes
86 Dynamics of Data Science: How can all sectors benefit From data science talent?
Appendices
Image
Caption goes here.
Dynamics of Data Science: How can all sectors benefit From data science talent? 87
APPENDICES
Appendix 1:
Acknowledgements
Thank you to Will Markow and Jonathan Coutinho at Burning Glass Technologies for
providing the labour market data and advice with the analysis, the project steering group
for providing advice and guidance, the reviewers and all of those involved in the workshops
and case study interviews.
Sherry Coutu CBE, serial entrepreneur, former CEO, angel investor and non-executive director
Matthew Fryer, VP, Chief Data Science Officer, Hotels.com
Dr Robert Hercock, Chief Research Scientist in the British Telecommunications Security
Research Practice
Professor Marta Kwiatkowska FRS, Department of Computer Science, Oxford University
and Professorial Fellow of Trinity College
Professor Emma McCoy FRSS FIMA, Vice-Dean (Education), Faculty of Natural Science,
Imperial College London
Dr Tom Smith, Managing Director of the Data Science Campus, Office for National Statistics
88 Dynamics of Data Science: How can all sectors benefit From data science talent?
APPENDICES
Staff from across the Royal Society contributed to the development of the project
Dr Frances Bird, Policy Adviser
Previous Royal Society staff who contributed to the development of the project
Lara Gardellini, Programme Coordinator
Dynamics of Data Science: How can all sectors benefit From data science talent? 89
APPENDICES
Reviewers
This report has been reviewed by an independent panel of experts. The Review Panel
members were not asked to endorse the conclusions or recommendations of the report, but
to act as independent referees of its technical content and presentation. The Royal Society
gratefully acknowledges the contribution of the reviewers.
Reviewers
Professor Dame Wendy Hall, DBE, FREng, FRS, Regius Professor of Computer Science at the
University of Southampton, UK
Professor Frank Kelly CBE, FRS, Professor of the Mathematics of Systems at the Statistical
Laboratory, University of Cambridge, UK
Professor Tom McLeish FRSC, FRS, Professor of Natural Philosophy, University of York
Dame Jil Matheson DCB FAcSS, former National Statistician of the United Kingdom
Hetan Shah, Executive Director of the Royal Statistical Society
Sir Bernard Silverman FRS, former Chief Scientific Adviser at the Home Office
Expert reader
Dr Jonathan Shaw, Technical Specialist in the Economics Department at the Financial
Conduct Authority and Research Fellow at the Institute for Fiscal Studies
90 Dynamics of Data Science: How can all sectors benefit From data science talent?
APPENDICES
Dynamics of Data Science: How can all sectors benefit From data science talent? 91
APPENDICES
Workshop participants
The Royal Society would like to thank all those who contributed to the development of this
project through submission of evidence and attendance at the following workshops.
92 Dynamics of Data Science: How can all sectors benefit From data science talent?
APPENDICES
Appendix 2:
Glossary
Algorithm: A set of rules a computer follows Java: A programming language.
to solve a problem.
Machine learning: A set of rules that allows
Artificial intelligence (AI): An umbrella term systems to learn directly from examples, data
for the science of making machines smart. and experience.
Dynamics of Data Science: How can all sectors benefit From data science talent? 93
APPENDICES
Appendix 3:
Data
This appendix includes additional data tables, explanations about the Burning Glass
Technologies methodology and other findings from the data.
Understanding the Burning Glass Skills are grouped for ease of analysis of
Technologies methodology: Skills clusters broader talent requirements. Burning Glass
Skills clusters are groupings of related skills. Technologies skills hierarchy and grouping
Burning Glass Technologies has developed has been created using a combination
a taxonomy of more than 500 skills clusters of hierarchical clustering algorithms and
by grouping skills that often travel together assessment of skills similarity based on job
in job postings. Clustering has been focused postings. Using a variety of distance metrics
on the most frequently occurring skills across including Cosine, Dice, Jaccard, and others,
a range of industries, in addition to skills in the similarity of all skills combinations is
emerging areas. Burning Glass Technologies determined. Depending on how close two
labour market analysts used the following skills cluster are together based on the
three criteria to group skills: similarity measures, they get assigned to the
same skills cluster. The final step included
• r elated skills eg the skill cluster ‘Statistical
manual reviews of the clusters to resolve
Software’ includes skills such as R, SAS,
any unclear cases.
and SPSS;
• s kills that travel together eg the skill cluster Missing salary information
‘Administrative Support’, includes skills such The number of postings and the average
as meeting planning/facilitation, calendar salary have been calculated for the year 2013
management, travel arrangements, and and the last four full quarters (July 1, 2017 –
appointment setting; and June 30, 2018). However, a large proportion
of postings have no salary attached so these
• s kills that are trained together eg the skill
results should be seen as indicative rather
cluster ‘Lean Manufacturing’ includes skills
than definitive. Intriguingly the salary for the
such as Kanban, Kaizen, Six Sigma, and
‘Data Scientists’ occupation showed only a
Lean Six Sigma.
small increase, despite the huge increase
in postings.
94 Dynamics of Data Science: How can all sectors benefit From data science talent?
APPENDICES
Qualification levels
Qualification levels requested for the 'Data
Scientists and Advanced Analysts' category
have broadly increased. In 2013, 34% of
such postings required Level 6 (first degree)
or Level 7 (MSc or upwards) skills, but by
2017/18 this had increased to 42%. This was
most acutely seen in the ‘Data Scientist’
occupation, where half of all postings
required Level 6 or 7.
Dynamics of Data Science: How can all sectors benefit From data science talent? 95
APPENDICES
Appendix Table 1
The classification system used by Burning Glass Technologies to group data science
and analytics jobs in order of increasing analytical rigour (left to right).
96 Dynamics of Data Science: How can all sectors benefit From data science talent?
APPENDICES
KEY
Data Scientists and
Data Systems Developers Data Analysts Advanced Analysts Framework category
Dynamics of Data Science: How can all sectors benefit From data science talent? 97
APPENDICES
Appendix Table 2
Regional data and Data Science and Advanced Analytics (DSAA) jobs.
98 Dynamics of Data Science: How can all sectors benefit From data science talent?
APPENDICES
Appendix Table 3
*Out of 8,157 job adverts included in this analysis. **Out of 27,033 job adverts included in this analysis.
Dynamics of Data Science: How can all sectors benefit From data science talent? 99
APPENDICES
Appendix Table 4
*Out of 8,157 job adverts included in this analysis. **Out of 27,033 job adverts included in this analysis.
100 Dynamics of Data Science: How can all sectors benefit From data science talent?
Dynamics of Data Science: How can all sectors benefit From data science talent? 101
102 Dynamics of Data Science: How can all sectors benefit From data science talent?
The Royal Society is a self-governing Fellowship of many
of the world’s most distinguished scientists drawn from all
areas of science, engineering, and medicine. The Society’s
fundamental purpose, as it has been since its foundation
in 1660, is to recognise, promote, and support excellence
in science and to encourage the development and use of
science for the benefit of humanity.
9 781782 523956
ISBN: 978-1-78252-395-6
Issued: May 2019 DES5847