Human Resource Management in The Age of Big Data: September 2016
Human Resource Management in The Age of Big Data: September 2016
Human Resource Management in The Age of Big Data: September 2016
net/publication/308608900
CITATION READS
1 2,087
1 author:
Benjamin Erb
Ulm University
32 PUBLICATIONS 74 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Benjamin Erb on 25 September 2016.
Benjamin Erb
This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this license,
Title Image
http://negativespace.co/photos/skygarden/
HUMAN RESOURCES & RECRUITING IN THE AGE OF BIG DATA 3
Abstract
Big data and data analytics enable novel kinds of data-driven decision making
and management strategies in human resources. Approaches such as people analytics
have already been implemented by corporations and replace traditional procedures for
recruiting, hiring, and staff development. Due to the disruptive force of automation and
the momentum of artificial intelligence in other fields, we argue that human resource
management will increasingly be transformed and reshaped by these developments.
There are arguments that heavily data-driven approaches are beneficial for both the
workforce and companies and that they are even necessary for the work environment of
the future. On the other hand, there are also cautions against such rule of algorithms
due to privacy implications and ethical issues.
HUMAN RESOURCES & RECRUITING IN THE AGE OF BIG DATA 4
Contents
Introduction 5
Background 5
Big Data, Data Mining & Machine Learning . . . . . . . . . . . . . . . . . 5
People Analytics: Data Analytics in Human Resources . . . . . . . . . . . 6
Conclusion 10
HUMAN RESOURCES & RECRUITING IN THE AGE OF BIG DATA 5
Introduction
Background
The term big data describes a shift of paradigms for data processing that emerged
when storage capacities had been exceeded to an extent where traditional mechanisms
became inappropriate (McAfee, Brynjolfsson, Davenport, Patil, & Barton, 2012). Big
data is often defined using multiple dimensions of data (Laney, 2001): volume (i.e.,
the quantity of data), variety (i.e., the structure of data), and velocity (i.e., growth
of data). Recently, the three dimensions have been complemented with additional
properties which represent challenges for (big) data processing: variability (i.e., varying
consistency of various data) and veracity (i.e., varying quality of various data).
Big data has been enabled mainly by a number of advances in computer science
(e.g., distributed computing and data mining), information technology (e.g., cloud
computing and virtualization), and data storage technology (e.g., inexpensive mass
storage devices). For instance, Google’s MapReduce paradigm (Dean & Ghemawat,
2008) provides a scalable approach for computations on Petabytes of data.
HUMAN RESOURCES & RECRUITING IN THE AGE OF BIG DATA 6
Apart from the pure scale of data, big data has been accompanied by a number of
advances in data analytics, also known as knowledge discovery, data mining, and to
some extent, artificial intelligence. The application of statistical methods to large-scale
data sets aims for the extraction of information and the identification of insights (Chen,
Chiang, & Storey, 2012). This includes a lot of different techniques and methods (Han,
Kamber, & Pei, 2011; Leskovec, Rajaraman, & Ullman, 2014), such as text mining
(analyzing textual data; e.g., sentiment analysis of user comments), clustering (grouping
of similar items; e.g., detecting customer groups), classification (categorizing novel
data into known structures; e.g., spam detection), real-time analytics (low-latency
processing; e.g., high-frequency trading), or anomaly detection (the identification of
outliers in data patterns; e.g., credit card fraud detection).
Data mining techniques are also used to create, feed and tune predictive models.
The case of a Target customer who has been identified as being pregnant due to
slight changes in her buying behavior (Hill, 2012) is a textbook example for predictive
analyses and major implications thereof. Machine learning represents another approach
for devising models in data analysis by feeding data as input and generating predictive
models like artificial neural networks.
The thorough success of big data and data analytics in many business sectors also
reached HR since the early 2010s, mainly fueled by the availability of widely collected
HR data as well as strong expectations regarding valuable predictions for HR. Data
to be considered for such analyses range from detailed candidate profiles and various
individual performance indicators to complex behavioral data like mail communication
and interaction patterns (Peck, 2013). Especially the fusion of different data sets,
e.g., the identification of many-sided, multifaceted prediction variable sets, provides
extremely relevant insights and differs from simpler, traditional regression analyses.
So-called people analytics covers data analytics and predictive analysis of the entire
workforce and in all stages of employment (Waber, 2013). This includes predictions in
terms of skill analysis, selection and hiring of employees, talent identification, attrition
HUMAN RESOURCES & RECRUITING IN THE AGE OF BIG DATA 7
retention problems (Sullivan, 2013). Similar to digital recruiting and hiring processes,
solutions for people analytics require higher initial expenditures, but provide scalable
mechanisms and cost-effective solutions in the long run (de Romree et al., 2016).
In a data-driven HR economy, employees receive a more accurate placing as well
as fair opportunities according to their potential (Peck, 2013). Another common
argument that favors data-driven HR over traditional HR is the neutrality of data and
algorithms, compared to humans. Even the most experienced HR managers cannot
entirely set aside their exposure to a wide range of human biases. When selection and
hiring algorithms are configured to focus solely on the key requirements of applicants,
the algorithms yield teams more diverse and functioning than human recruiters would
choose (de Romree et al., 2016; Peck, 2013). People analytics is also a valuable tool
for supporting change management (Waber, 2013). Furthermore, predictive analytics
known from the medical sector also can be used for the early detection and prevention
of work-related conditions, such as occupational burnouts (Waber, 2013).
A smaller countermovement opposes the idea of bias-free algorithms and brings the
role of algorithms into question. Their arguments are not specific to people analytics,
but apply to other aspects of big data related to human behavior, predictive analytics,
and computational decision making.
While algorithms are per se free of any biases, their programmers are not. Hidden
assumptions can very well influence the actual behavior of the algorithm compared
to the intended and designated behavior. A more imminent threat is the latent
information hidden in the data used as input. For predictive analytics, algorithms
rely on complex models to project future states. These models are typically built and
generated using data from the past. If the input data is subject to a bias, this bias
inevitably becomes part of the model. Even worse, the predictions will then again be
subject to the underlying bias. A substantial machine bias has been investigated by
Angwin, Larson, Mattu, and Kirchner (2016) for crime prediction, where algorithms
that are already in use today showed a consistent bias against minorities.
HUMAN RESOURCES & RECRUITING IN THE AGE OF BIG DATA 9
A deep neural network is a machine learning approach where a program learns from
vast amounts of training data by building complex artificial neural networks. This
approach has recently achieved unprecedented success in certain problem domains
previously not solvable for computers, e.g., recognizing complex patterns or playing the
board game Go (Mnih et al., 2015). A downside of this approach is the general absence
of a human comprehensible model representing result of the learning process. When
such an algorithm is fed with 100,000 candidate profiles and their respective hiring
outcomes, once trained, the algorithm can then be used to predict the outcome of
new candidates. However, the rationale of the decision — while being 100% reasonable
(based on the training data) — is not necessarily obvious to any human. It is the sole
outcome of a highly connected network of weighted, artificial neurons reacting to a
stimulus (i.e., the profile data). As opposed to other algorithmic approaches, the
results of a machine learning mechanism may be even more prone to latent biases in
the data sets, as the repercussions of biases become less obvious to humans.
Morozov (2013) provides a comprehensive perspective on the implications of
algorithmic decision making and argues that a rule by algorithm, also called algocracy
represents a serious threat to democracy. Especially the usage of algorithms for
policing increasingly affects our everyday lives (Hickman, 2013). Data sovereignty and
the prerogative of interpretation are among the primary issues (Danaher, 2014), but it
is primarily the untransparent, and closed character of decision making processes based
on data mining, that represents the real menace (Morozov, 2013). Recent examples
of potential consequences became apparent in stock market fluctuations caused by
algorithmic trading or the manipulation of public opinion by corporate algorithms
that modify and filter social networking contents.
Apart from a dystopian fear of the rule of algorithms, people analytics also
raises several issues regarding privacy, ethics, and lawfulness. Data is the fuel of data
analytics, and in the case of people analytics, such data is always highly personal (Peck,
2013). So there is a thin line between detailed, individual performance metrics collection
HUMAN RESOURCES & RECRUITING IN THE AGE OF BIG DATA 10
Conclusion
References
Garvin, D. A. (2013). How google sold its engineers on management. Harvard Business
Review, 91 (12).
Han, J., Kamber, M., & Pei, J. (2011). Data mining: concepts and techniques. Elsevier.
Hickman, L. (2013, July 1). How algorithms rule the world. The Guardian. Retrieved
from https://www.theguardian.com/science/2013/jul/01/how-algorithms-rule-
world-nsa
Hill, K. (2012). How target figured out a teen girl was pregnant before her father did.
Retrieved from http://www.forbes.com/sites/kashmirhill/2012/02/16/how-
target-figured-out-a-teen-girl-was-pregnant-before-her-father-did/
Ignatius, D. (2016, August 11). The brave new world of robots and lost jobs. The
Washington Post. Retrieved from https://www.washingtonpost.com/opinions/
the-brave-new-world-of-robots-and-lost-jobs
Introna, L. D. (2000, December). Workplace surveillance, privacy and distributive
justice. SIGCAS Comput. Soc. 30 (4), 33–39. doi:10.1145/572260.572267
Laney, D. (2001). 3D data management: controlling data volume, velocity, and variety.
META Group.
Leskovec, J., Rajaraman, A., & Ullman, J. D. (2014). Mining of massive datasets.
Cambridge University Press.
McAfee, A., Brynjolfsson, E., Davenport, T. H., Patil, D., & Barton, D. (2012). Big
data. Harvard Business Review, 90 (10), 61–67.
Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A. A., Veness, J., Bellemare, M. G., . . .
Ostrovski, G., et al. (2015). Human-level control through deep reinforcement
learning. Nature, 518 (7540), 529–533.
Morozov, E. (2013). The real privacy problem. MIT Technology Review, 116 (6), 32–43.
Retrieved from https://www.technologyreview.com/s/520426/the-real-privacy-
problem/
Peck, D. (2013). They’re watching you at work. The Atlantic, 312 (5), 72–84. Retrieved
from http://www.theatlantic.com/magazine/print/2013/12/theyre-watching-
you-at-work/354681
HUMAN RESOURCES & RECRUITING IN THE AGE OF BIG DATA 14