Iphone in Vivo: Video Analysis of Mobile Device Use: April 2013
Iphone in Vivo: Video Analysis of Mobile Device Use: April 2013
Iphone in Vivo: Video Analysis of Mobile Device Use: April 2013
net/publication/266653790
CITATIONS READS
50 305
3 authors, including:
Moira Mcgregor
Stockholm University
18 PUBLICATIONS 263 CITATIONS
SEE PROFILE
All content following this page was uploaded by Moira Mcgregor on 23 September 2017.
1031
Session: Reflecting on Phones CHI 2013: Changing Perspectives, Paris, France
design, but in terms of the embedding of device use in the diary-method examined issues surrounding cost and plan
world of co-present others. usage, relatively neglected for research since then [26].
PREVIOUS WORK Focusing specifically on smartphone use, three studies are
The massive growth in mobile phone use, and the more revealing. The first is Barkhuus and Polichar’s study of
recent explosion in mobile device functionality, has been smartphone use [1], where they document the importance of
one of the most notable changes to our modern world. As smartphones as combining multiple functionality in a
Merchant puts it “Mobile phones have rapidly been relatively ‘messy’ way. While this work documents
absorbed into the fabric of our day-to-day lives”, [22]. A revealing aspects of use, its reliance on interviews and
range of statistics can be drawn upon to argue for the diaries means that its description of use is indirect. There is
importance of mobile phones as a computing device. a lack of description of actual cases of smartphone use.
Smartphones outsell PCs, and recent surveys have reported
that mobile phones are used for longer than all other In contrast, Böhmer et al. [3] used logging of application
computers combined [17]. Tablet use is also one of the launches on the Android platform to present a detailed view
quickest growing, new, multi-purpose technology forms, on which applications are used on smartphones, alongside
followed by the growth in ebook readers and other devices an analysis of time and length of use. An interesting
with a similar form factor. Indeed with laptop computers concept introduced in this work is ‘application chains’ -
outselling desktops, one might even say that the majority of where users switch from one application to another in a
computing is already ‘mobile’. ‘chain’ of activity. While they report that the majority of
sessions consist of only one application launch, 32% of
While smartphones can be relatively simple to demarcate, sessions have two or more applications. Although this paper
the question of what devices are ‘mobile’, and what gives us an important view on what applications are being
mobility means for research can be harder to define. One launched, there is relatively little information on what is
approach is to see mobility in terms of the mobility of users then being done with the applications themselves.
themselves. For Brown and O’Hara [6], ‘mobility’ is a
concept that has analytic value as a worker’s concern. This Techniques of device logging such as this have much in
research documents how ‘mobility’ is something that has to common with experience sampling, such as MyExperience
be dealt with in terms of managing place - when and where and Momento [7, 9]. While these systems pioneered using
work will be done, who will be there, and what artefacts are instrumentation and user alerts on mobile devices to collect
need to be deployed to make places ‘work-place-able’. In a data about use and broader user interaction, they have not
similar way, Luff and Heath [20] discuss mobility yet focus been used as part of studying modern mobile device use.
instead on the manipulation of artefacts in collaborative One exception is the work of Church and her collaborators
settings. Here, their interest is on what they term ‘micro [8], which has focused on mobile search. This research
mobility’: the ways in which information - particularly makes extensive use of experience sampling (prompted
paper documents - come to be deployed in face to face through SMS messages), and web-based surveys of mobile
interactional settings. The dynamic is between the co- search. This work is particularly strong in understanding the
present revelation and obscuring of information. context and motivation behind mobile search. For example,
reporting on the importance of conversations as a prompt
Alternatively, the predominant user interface paradigm built for web search. Yet it also suffers from analysing reported,
into different devices can be seen as providing a definition rather than actual, use.
of mobility. Perhaps the most notable recent change has
been the growth in finger-based interaction with smaller Previous research thus leaves much about modern mobile
mobile devices. In terms of user interaction, devices can be device use unanswered. In particular, there is little in the
divided between ‘touch’ systems (predominately iOS and way of understanding the details of usage of particular
Android) and keyboard and pointer based systems applications and what shapes usage beyond simply what
(predominately Windows and OSX). With a market in flux applications are used when. For example, what prompts the
this distinction is open to revision - although it does use of particular applications at particular times and
highlight a fairly distinct class of touchscreen devices with locations? What purposes are different applications put to?
distinct forms of use. As we will argue, this demarcation of How might application use be influenced by the setting,
mobile devices as touch systems also reveals that we have what role does task have in use, and how might smartphone
relatively little understanding of the differences in how use influence action more broadly? To be answered, these
these touch systems are used. sorts of questions demand a different method.
USING VIDEO TO STUDY MOBILE DEVICE USE
A number of classic studies have documented the use of
As we can see, the predominant methods in studying mobile
mobile phones, however this work mainly discusses the use
device use have been either post-hoc interviews, diary,
of conventional mobile phone features, such as text
logging or experience sampling based. While these methods
messages or phone calls. Weilenmann discusses shared
have considerable strengths they face two immediate
phone use [32], Murtagh describes mobile phone use in
challenges. The first is a lack of of coverage. Interviews
public [24], and Taylor [2] documents some of the broader
take place after use, diary reports are made just after use,
interactions around mobile phone use amongst teenagers
and experience sampling depends upon sampling situations
being some of the most notable. Palen’s voicemail-based
of use. These reports are not likely to capture every
1032
Session: Reflecting on Phones CHI 2013: Changing Perspectives, Paris, France
1033
Session: Reflecting on Phones CHI 2013: Changing Perspectives, Paris, France
The participating smartphone users were recruited from services via the websites instead. Prominent in the
advertisements in local cafes, and adverts on visitor smartphone usage captured was the use of the internet to
websites. We recruited a mix of five groups of locals and browse for information, and the iPhone’s maps application
five groups of visitors, volunteering to let us record them to find and navigate on foot and public transport.
during a ‘typical’ day out. Four of our individuals were
1. Visit to ferry and music concert
female, and one male, and two of our couples were male/
2. Walk through the city to collect bike from repair shop,
female, and three all female. Our participants were recruited
followed by cycling to music concert.
with the request they take a ‘typical day out in Stockholm’.
3. Lunch followed by shopping trip
All the participants were fluent in English, and we asked
them to interact in English - although some participants 4. Visit to botanical gardens
switched to other languages for parts of their trip. 5. Visit to church and walk across the Stockholm
6. Visit to local town outside city
Our screen capture app was written for the iPhone. Due to a 7. Visit to Reimersholm and then to the city hall
range of issues, seven of the ten participants made use of an 8. Visit to Old Town to watch changing of the guard
iPhone supplied by ourselves. When running the trial, two 9. Visit to food hall
participants carried Android phones which could not have 10. Shopping trip and visit to gardens
their screens recorded. Another three participants requested Figure 2: list of activities
that we did not install any software on their phone and two
participants did not have data access enabled on their Multiple involvements: Divergent and convergent
phone. For these reasons, only three of our participants had One of the most prevalent features of the video recordings
our recording software installed on their own phone. For the is the extent to which device use takes place alongside
remaining seven participants we instead supplied them with involvements with co-present activities and other people.
our own iPhone with the recording software pre-installed. These interaction can be characterised as ‘multi-modal’, in
For the participants that went out carrying our device, they that interaction takes place with the device but also through
nearly exclusively used our device which was recorded. In spoken conversation, or ‘talk-in-interaction’, [27]. What is
one case a group of two participants used both devices - one also clear is that the device use itself shapes the interactions
using their own phone, and one our iPhone. In two that unfold. If mobile device use takes place with others
additional cases text messages and phone calls were who are co-present, device use is frequently brought into
conducted on the users’ own device, but the supplied the interaction - what Mondada [23] characterised as
recording device was used for their other smartphone use. ‘convergent’ interaction. In contrast we can identify
To record their interactions we equipped participants with ‘divergent’ interactions [23] where the use of the device is
small GoPro portable cameras worn over their neck in map- kept relatively separate from interactions with others.
bags. Audio was recorded with a small external microphone In figure 3, one tourist is talking to a researcher while on
attached to the outside of the bag. We combined the screen the subway. She pauses in conversation before moving on
recordings, with the multiple camera angles to produce to interacting with her phone to read some details about the
composite videos of each participant’s trip. These ranged in section of Stockholm she is visiting. She returns to the
length, but were limited by the battery life of the GoPro conversation by reading out a name from the website she
cameras (around four hours). has found, and then returns to reading. In this clip the
The analysis of the data grows out of ethnomethodology reading of the webpage takes place mainly without
and conversations analysis. Our goal was not to provide a interaction with the researcher who is sitting in the seat
statistical breakdown of the uses of the mobile device. In opposite. Moreover, the visitor ‘reading’ is something that
many ways this is already well catered for in the existing is visible in their orientation to the device. In the short
log-based literature. Rather our goal was to better exchange with their companion, the user lifts their head and
understand the details of particular situations of use. In reads aloud, keeping her gaze on the phone. In this clip the
individual and group data analysis sessions we surveyed the involvement of the participant with the conversation and
journeys, editing the 24 hours of video down to 205 one to reading are kept separate - her attention, gaze, and body
two minute clips for focused analysis, of which 12 were compartment used to keep apart her engagement with the
transcribed and analysed in depth. Our goal in this analysis device from her participation with her companion.
was to understand both in a broad sense how mobile device In studies of talk and interaction one recurrent concept is
use was organised, but also how specific incidents of use that of ‘multi-activity’ settings where participants are
unfolded, and what contingencies users brought to bear. engaged in multiple streams of activity simultaneously [11,
RESULTS 15]. One well researched example of this is in driving and
While obviously the device usage we recorded was conversation - where the activity of driving plays a role in
fashioned to an extent by our recruitment, our participants regulating the flow of conversation. Drivers and passengers
conducted a range of different visits (figure 2). In doing so regulate their talk around the commitments of the driver to
they also used a wide range of applications including: Yelp, safe and competent driving. In her analysis of car travel
Facebook, Instagram, Google Maps, Phone, Web browsing Mondada refers to cases of ‘convergent’ or ‘divergent’
and SMS. Users provided with our own phone did not use interaction.
the iPhone’s Facebook or mail clients, but accessed these
1034
Session: Reflecting on Phones CHI 2013: Changing Perspectives, Paris, France
Figure 3: Talk and device use - divergent Figure 4: Talk and device use - convergent
That is, settings where the participants converge in mutual The participant reads out a spelling, at the same time as
orientation, or alternatively, diverge from that common typing it in. This spelling is confirmed by the companion,
interaction. Co-participants: “can mutually align [...] and but at this point Google makes a suggested autocompletion
display shared attention towards common objects; they can of the incorrect spelling (from saa to saami). The tourist
dis-align as well, in not attending to the other’s talk.” [23] then reads out the suggested spelling emphasising an
As she explains, even in cases of ‘divergent’ attention, elongated syllable. On conducting the search a conversation
parties are still engaged with each other and can time ensues about the search before a dismissal of one of the
utterances and actions with respect to each other. Figure 3 is result (“big teeth”). Clearly the search and typing are
one example of ‘divergent’ talk and device use, in that the arranged in concert with the conversation that is taking
participant keeps each involvement relatively separate. The place. The search responds to and is a source for the
two or so minutes spent reading the webpage are spent conversation. While in this case the device screen does not
mostly without talking to their companion. seem to be directly available to the second conversationalist
the participant reads out aspects of the search (“sami”,
In contrast, in figure 4 the search and the conversation are “lapland people”, “big teeth”). In this way although her
more closely related. Here device use and interaction with focus of attention is on the device, her involvement in the
others is convergent, in that the results of the search at conversation continues. The use of the device is part of the
different points are brought into the conversation. same activity as the conversation.
Interaction here takes place between the researcher and the
participant about the Sami people and a singer. This At a smaller scale of interaction - with specific user
conversation prompts a web search. As the search is interface events, we can see in a similar way how the device
conducted ("Is Joni Mitchell sami?") the conversation does use can converge with interaction. Figure 5 is an example of
not stop, but rather the search provides topical resources for where a user-interface gesture is also an interactional
the conversation. Particularly interesting is the way in gesture. Here the user is talking about an area of the map
which the spelling of sami is approached. and as she does this, the participant puts two fingers at each
1035
Session: Reflecting on Phones CHI 2013: Changing Perspectives, Paris, France
A: i just check [2.0 ] which road do we take Occasioned search and search chains
B: yes The three data extracts above display interactional features
A: eh yes we will see i think we actually where of mobile device use. They are also examples of
is it where are we information search. In figure 3 the participant reads
A: hh [*] e:r [*] e [*]
information about areas of Stockholm that they might go
and visit. In figure 4, a web search is initiated around Joni
Mitchell. Lastly, in figure 5, the participant checks where
they are and what route they need to take to go through a
forest that is nearby. As we mentioned earlier, much of the
mobile device use we captured can broadly be characterised
[Pinches to zoom in on the blue dot] as ‘information search’ of some sort. Since our participants
were making visits to the city this provided a context to
B: ah ha much of the searching for information. For example,
participants searched for information about what they could
[A moves map over to
do next, the area they were exploring, public transport
the right, revealing
the forest and hiding information about how to get to particular destinations, or
the suggested route] information about local cafes, restaurants or shops.
As a time and attention-consuming activity, search is
A: because I want to
something which requires - at least some of the time - focal
go through the
forest[*]---------- attention. Some search we would refer to as pre-meditated
search, that is search which takes place before an activity to
Figure 5: double duty gestures be part of planning or organising that activity. In earlier
work (pre-smartphone) on tourists this was characterised as
side of the area referred to, and pinches outwards to zoom
"pre-visiting" [4], getting information about a place before
in. The pinch enlarges the area of the map where the
the actual visit from printed and online sources.
referred to point is, as well as highlighting the point
between the fingers. While the finger movement is away What was striking about the information searching in our
from the desired destination, it does prevent the fingers study was how it was conducted on the way to or even as
from obscuring the gestured target. part of the activity itself; information would be gathered on
the way to a particular part of the city, or even on arrival at
We would describe this as a form of 'double duty' [30]. In
the attraction or area of interest. Figure 3 is an example of
this case the highlighting of an item worked in combination
this, in that the participant is on public transport on the way
with a user interface gesture. What it also underlines is that
to the city, and they start to read through a list of ‘city
interface actions frequently come to have a role to play in
highlights’ from the Time Out website. The use of resources
the conversation too. At times in our videos, interaction
in the environment is one reason why search may be
with the mobile device was done as a solitary task - either
delayed until the last minute. As you move through an
by a visitor on their own, or on a device turned away from
environment you gain information about that place, about
those co-present. Yet devices were also very frequently
the setting, your own mood and what a place is like. Any in-
shared and seen by others. For example, at the end of figure
situ search can then draw directly upon that information,
5 the participant scrolls the map to the right, revealing the
whereas a search conducted earlier could be premature or
forest as she does so. Her reference to the forest is then
waste time. A change in the weather, mood and the like,
easily understandable as ‘this forest’ (she also later points).
might affect the planned activity and any subsequent
As has been extensively documented in studies of searching that might be undertaken. We are reminded of
interaction around fixed displays, ‘awareness’ of what is March’s description of decision making in organisations,
going on amongst co-present interactionists extends to co- where decisions are postponed until as late as possible to
present objects and devices in this case, the screen and map allow for the collection of maximum amounts of
which is being manipulated [14]. Using the device, when it information [21].
is visible to someone else, produces actions that are
This last-minute searching that we observed we titled
‘oriented-to phenomena’ [12]. That is, those who see the
‘occasioned searches’, where searching is triggered by the
screen can be expected by others to have seen the action
environment or local events. For example, one visitor
and to understand what that action means. Indeed, they
(travelling on their own) walks past a postal museum. He
often comment on, correct or repair the on screen actions of
searches for information on his iPhone about the museum,
the device user. Broadly speaking, much of our mobile
reads the short summary and then walks into the museum.
device use shares this ‘double duty’ quality - with usage
After picking up an information sheet (which he quickly
oriented to the device itself (to get it to do what the user
discards) and a quick browse of the bookshop he returns to
desires) but also as something that is seen by others. This is
walking along the street. In this case the physical
an interactional role of gesture that goes alongside the user
surroundings prompt an online search, which informed the
interface role.
visit. The search was occasioned by the circumstances. In a
related way, in figure 4, this search was occasioned by the
1036
Session: Reflecting on Phones CHI 2013: Changing Perspectives, Paris, France
1037
Session: Reflecting on Phones CHI 2013: Changing Perspectives, Paris, France
1038
Session: Reflecting on Phones CHI 2013: Changing Perspectives, Paris, France
deviations from the route, since these will not initially be ordinary use because of how that device fits with
rendered. Figure 8 shows one example of a pedestrian interaction itself. Luff and Heath describe ‘micro-mobility’
walking in the wrong direction, which is seemingly - the ways in which a device would fit with physical bodies
confirmed by the map. Yet after 18 seconds it is revealed and interactional commitments.
that the map has incorrectly plotted their moving position.
Developing this notion further we can see that the form of
When a route was deviated from, at least it was often not a mobility here goes beyond simple portability. This
large challenge to navigate in the general direction of the mobility-in-interaction encompasses the broader range of
suggested route, or to navigate back to the route. The blue commitments and activities that we undertake in our daily
line on the map was still useful even if it was not literally lives. That mobility is the way in which a drink can be
followed (see, for example, figure 5). It is clear that much ordered and that transaction interleaved with looking
of the design of the Maps is optimised for driving. This something up on a map. Mobility is what allows
perhaps is wise since the majority of journeys in the US are information to be looked up as one walks past a museum,
taken by car. Yet for pedestrians this means that the map immediately seeing electronic sources as one walks into the
tool, while useful, does have shortcomings. actual physical museum.
DISCUSSION It may be the case that the value of mobile devices come
We found that the recording methods described here less from their explicit portability, and instead from how we
provide a unique and powerful view on interaction around can weave themselves into our ordinary actions and
mobile devices. The data we have included highlights a interactions. That is, mobility not simply in terms of
range of different aspects of device use. The news here physical size but mobility in vivo. Mobile devices fit with
includes the ‘double duty’ of interactional gestures, the our lives and actions in ways that conventional technology
ways in which information search is occasioned by local never could. We would argue that mobility in vivo is a topic
activity, and features of using the map such as ‘walking the that still rewards renewed research interest.
blue dot’ This would have been impossible to find without a
CONCLUSIONS
method that supported this sort of detailed analysis.
The data here reveals a world of device use that using
There are of course some limitations. As with any complex previous methods is hard to access. It reveals how
device the variety of applications and use are determined interaction with smartphones such as the iPhone in vivo is
largely by the user and their situation. Our data then is threaded through other activities. We documented the
obviously somewhat constrained in that it focuses on usage differences between convergent and divergent mobility,
on a particular type of day (non-work), in one city. Our how gestures on the device are seen by others, the ways in
reliance on providing a device for the majority of our data which searching for information is arranged, and lastly we
collection also constrains considerably the type of talked at length about how the map is used. Our focus
application and device usage we collected (alongside our throughout this analysis has been on understanding the
reliance on one model of mobile device). We have no relationships between device actions, interactions, physical
examples of game playing, for example, even though this is actions, movement and conversation.
one of the most popular categories of applications. With the
In closing we would argue that the method we have
reliance on mobile cameras, the perspectives and view that
provided here gives us considerable potential to move to a
is recorded can at times inadequate for analysis. Yet this is
better understanding of device use, and of course how these
balanced by the inclusion of a clear uninterrupted recording
devices might be better designed. We have resisted detailed
of the onscreen activity on the mobile phone. This makes
design suggestions, instead attempting to build a more
available the ‘workplace objects’ for analysis in a way not
substantive understanding of what it is to use a mobile
previously possible; something that would be harder to
device. Indeed, it may be that the touch and surface
access with traditional ethnographic methods. In turn the
computing literature is too focused on developing the latest
use of portable and wearable cameras supports collecting
interaction technique to engage properly with situations of
data on interactions that would have been impossible to
mobile use. Device use is dependent upon and threaded into
record using fixed cameras.We have also developed a
what goes on around us. It may be that rather than pushing
version of the software that records audio alongside screen
us away from the world around us, our mobile devices are
interactions, and we hope to capture the device camera as
instead just another thread in the complex tapestry of
well. We intend to make this software available to other
everyday interaction.
researchers, to assist in future studies.
ACKNOWLEDGEMENTS
Mobility in vivo
We thank Marek Bell for writing the recording software, the
A second concept that our data casts light on is mobility
helpful comments on the paper by the anonymous reviewers
itself. While mobility is a foundation for much technology
and our colleagues at mobile-life. The research was made
design, it has had surprisingly little analytic attention within
possible by a grant from the Swedish Governmental Agency
HCI. Our research builds on the work of Luff and Heath - in
for Innovation Systems to the Mobile Life VinnExcellence
particular the move away from mobility as mere portability,
Center, in partnership with Ericsson, Microsoft, Nokia,
to ‘mobility in interaction’ and the myriad ways in which a
IKEA and the City of Stockholm.
particular device may or may not fit with situations of
1039
Session: Reflecting on Phones CHI 2013: Changing Perspectives, Paris, France
1040