Towards A Personal Automatic Music Playlist Genera
Towards A Personal Automatic Music Playlist Genera
Towards A Personal Automatic Music Playlist Genera
net/publication/228866304
CITATIONS READS
18 2,029
4 authors:
All content following this page was uploaded by Eugene Dermot Coyle on 02 June 2014.
Abstract. Large music collections afford the listener flexibility in the form of choice, which enables the listener to choose the
appropriate piece of music to enhance or complement their listening scenario on-demand. However, bundled with such a large music
collection is the daunting task of manually searching through each entry in the collection to find the appropriate song required by the
listener. This often leaves the listener frustrated when trying to select songs from a large music collection. In this paper, an overview
of existing methods for automatically generating a playlist is discussed. This discussion outlines advantages and disadvantages
associated with such implementations.
The paper then highlights the need for contextual and environmental information, which ultimately defines the listener’s listening
scenario. Environmental features, such as location, activity, temperature, lighting and weather have great potential as meta-data.
Here, the key processes of a basic system are outlined, in which the extracted music features and captured contextual data are
analysed to create a personalised automatic playlist generator for large music collections.
-1-
Towards a Personal Automatic Music Playlist Generation Algorithm: The Need for Contextual Information
songs that are similar to that song. Figure 1 outlines the From Figure 2, Listener A has a lot in common with Listener B
Content-Based Playlist generation procedure. in terms of their music collections, compared to Listener C. As a
As shown in Figure 1, the user is required to specify a seed song result, the Collaborative-Filtering System will recommend songs
and the number of songs required in the playlist. The seed song from Listeners B collection to listener A and recommend songs
represents the type of music that the listener wants to listen to. from Listeners A collection to listener B. Nothing is
The system then filters the music collection based on similarity recommended from Listeners C collection to either Listener A
to the seed song. A similarity song space is hence created from or Listener B. The system assumes that since Listener A and
which a playlist is generated. Listener B have so much music in common that their
preferences must be the same, i.e. they have the same musical
taste. Hence, Listener A would enjoy Listener B’s music
collection and vice-versa.
-2-
Towards a Personal Automatic Music Playlist Generation Algorithm: The Need for Contextual Information
generation system, it is ideal for the system to consider the 3.1. Choosing Appropriate Environmental Features
listeners mood. Measuring such a parameter directly from the It is required to identify and categorise environmental features
listener borders on impossibility. However, with the that may affect a listener’s mood or music selection process. As
establishment of attitude theory in the 1930’s, strong links have an example of how environmental features can affect mood, an
been forged between an individuals environment and attitude, investigation into to the effect of lighting on office workers [9]
which in turn defines mood and behaviour [6]. The experience discovered that natural lighting reduces stress and promotes a
of an individual in the outside world reflects how they feel on general sense of good being compared to artificial lighting. It is
the inside [5]. also suggested that the type of lighting combined with the
With such strong defined theoritical links between an intensity of the artificial light may determine the level of
individuals environemnt and their behaviour, it may be possible negative effects experienced. In addition, it has also been
to reduce the need to infer mood from a listener in order to documented that the weather appears to influence mood and
create an automatic playlist of songs to suit that mood. Such an productivity [10].
approach may circumvent problems raised by Tolos et al. in [7], To commence, it is proposed to consider seven environemnetal
such as defining a set of moods that is relatively unabigous, features. These features are 1) Time and Date, 2) Weather, 3)
widely accepted and useful for the average user. Lighting Conditions, 4) Humidity Conditions, 5) Temperature
It is proposed to design a system that will monitor a listeners Conditions, 6) Noise Conditions and 7) the Listener’s Activity.
environment and observe their choices in music selection.
Analogous to a basic input/output black box system, given the 3.2. Capturing Environment Data
inputs (environmental features) and the outputs (the selected A brief outline on how each of the environmental features may
songs) one is required to reconstruct the transfer process, i.e the be captured is given in this chapter.
listeners mood or behaviour, Figure 4.
. 3.2.1. Time and Date
It is possible to capture time and date using the system clock of
the proposed music player, which is a PC based device. With the
availability of time and date, it is possible to expand the analysis
to include day of the week (Monday, Tuesday, .... ), month of
the year (Janruary, February, .... ), time of the day (morning,
afternoon, .... ) and season (Winter, Summer, Autumn and
Spring).
3.2.2. Weather
It is proposed to obtain weather data through an available
METAR service online due to its strict and compact data format.
METAR data is available from all airports and it is regularly
Figure 4: A black box approach, given the inputs and the updated on a thirty minute schedule.
outputs, one is required to reconstruct the transfer process.
3.2.3. Lighting, Temperature, Humidity and Noise Conditions
It should be noted that this process is not equivelent to With the use of an appropriate sensing device, lighting,
extracting the musical mood or the musical emotional state of a temperature, humidity and noise conditions can be monitered
composition as implemented by Liu et al. [8]. With the aid of and captured. An array of hardware devices exist to capture such
music theory, Liu et al. showed with the implementation of an parameters. Hardware considerations are discussed further in
algorithm, that a composition may occupy a particular emotional Chapter 5.
space, musically speaking.
However, each individual listener subjectively interperates the 3.2.4. Activity
emotional or mood space of a composition based on their It is proposed to determine a listeners activity in two forms,
experience. Hence, the same composition is capable of causing a these are 1) social scheduling and 2) using an accelerometer.
diverse array of emotions within the listening community. Such Social scheduling is based upon calender events which involves
an example of this is a composition by Carl Orff titled Carmina taking advantage of predictable behavour such as working
Burana. A classical enthusiast and non-horror movie viewer schedules, travelling schedules, excercising schedules and
may recognise and perceive the piece in the appropriate classical relaxing schedules. Further information on a listener’s activity
context it was written in. However, the non-classical lover and may be captured electronically with the use of an accelerometer,
horror movie fanatic will recognise the piece as the theme tune such as the E-LIS3L02AS4 from STMICROELECTRONICS.
to the horror movie ‘The Omen’. In this case, hearing the music An accelerometer is capable of measuring a listener’s physical
piece out of context may induce a sense of fear, uneasiness and movement, such as walking, running and jumping.
terror for the listener. This is because the listener only associates
this theme with horror. To summerise, environmental parameters have a significant
The long term consistancy and reliability of using envionmental affect on mood and hence influences the music selection
data in the selection process of automatic playlist generation is process. Therefore, environmental features have great potential
founded upon the habitual qualities of human nature. Covey as meta-data to allow the listener greater flexibility when
explains, that an individual’s action and re-action is pre- searching or accessing a music collection. In addition,
conditioned by their environment [5]. Also, as outlined by environmental features may provide a valuable source of
Ostrom in [6], an individuals attitude, which is a description of information for an automatic playlist generation algorithm in the
their behaviour and formed through experience in their generation of playlists to suit a listener’s mood.
environemnt, operates to make the individuals world predictable
and orderly.
-3-
Towards a Personal Automatic Music Playlist Generation Algorithm: The Need for Contextual Information
-4-
Towards a Personal Automatic Music Playlist Generation Algorithm: The Need for Contextual Information
in two parts, 1) the Learning Process and 2) the Operational 4.4.2. Operational Process
Process. The trained system, Figure 7, operates without the need for the
listener’s interaction, since the intelligent engine functions under
4.4.1. The Learning Process the same set of parameters and definitions in which it was
Figure 6 overviews the learning process of the proposed playlist trained. When the listener requires to listen to music, the trained
generation system with integrated environmental meta-data. As system will analyse the current listening environment through a
the listener selects the required music manually, the system sensor array. Based upon the systems previous training, the
monitors all events in the background which involves the captured environmental data is matched and assigned to
capturing of environmental features. These environmental appropriate music features. The music collection is then filtered
features are then added to the chosen songs as new meta-data. according to the required features and a song selection algorithm
As different songs are selected, the system identifies and generates the appropriate playlist.
quantifies how each selected song is similar and how they differ.
MIR algorithms then use this information to find other songs 5. Hardware Considerations
deemed musically similar to the chosen songs within the bounds It is important that the development and test hardware platform
of a similarity threshold. Once identified, these songs are also for the proposed music player is mobile, unobtrusive and does
tagged with the same environmental meta-data. The not contribute to the listener’s mood. In addition, the music
identification process is based upon existing meta-data and device must have appropriate storage capacities and processing
music features extracted from each song as previously described power. As a result, a small form factor PC such as the Samsung
in Section 4.2. All results are then catalogued within the system Q1b – Ultra Mobile PC is currently being used as a testbed. This
for future reference. device is portable and highly inter-connectable with the
availability of onboard LAN, WLAN, Bluetooth and USB
services. The system uses a 7” touch screen and operates under
Windows XP Tablet Edition or Windows Vista rather than a
scaled version of Windows such as PocketPC.
A hardware consideration for sensors to allow the capture of
environmental data includes the HOBO U12 data logger which
is currently being used. This device is compact, portable and
self-contains temperature, light and humidity sensors. The unit
also has an available external data channel which allows the
connection of an external noise level sensor. The data logger is
accessible through a standard USB connection and is compatible
with the Keyspan USB Server allowing access via Ethernet or
WiFi.
To detect and monitor a listener’s movement, an Olimex
accelerometer (MOD-MMA7260Q) is currently being used.
This is a 3-axis device and is pre-mounted on a development
board which includes the appropriate support ICs. A mini-USB
connection is required to interface with the accelerometer.
Figure 6: Learning Process of the Proposed Playlist Generator.
-5-
Towards a Personal Automatic Music Playlist Generation Algorithm: The Need for Contextual Information
and may provide a valuable resource in the automatic generation [15] G. Husain, W. F. Thompson and E. G. Schellenberg.
of music playlists. Effects of Musical Tempo and Mode on Arousal, Mood
and Spatial Abilities. Music Perception, Vol. 20, No. 2,
151-171, (2002).
-6-