Prerequisites for Affective Signal Processing (ASP) - Part II

Marjolein Van Der Zwaag; Joyce Westerink; Egon L. van den Broek

Prerequisites for Affective Signal Processing (ASP) - Part II

Marjolein Van Der Zwaag

Joyce Westerink

Egon L. van den Broek

2010

visibility

…

description

8 pages

link

1 file

Although emotions are embraced by science, their recognition has not reached a satisfying level. Through a concise overview of affect, its signals, features, and classification methods, we provide understanding for the problems encountered. Next, we identify the prerequisites for successful Affective Signal Processing: validation (e.g., mapping of constructs on signals), triangulation, a physiology-driven approach, and contributions of the signal processing community. Using these directives, a critical analysis of a real-world case is provided. This illustrates that the prerequisites can become a valuable guide for Affective Signal Processing.

PREREQUISITES FOR AFFECTIVE SIGNAL PROCESSING (ASP) Egon L. van den Broek∗ Center for Telematics and Information Technology, University of Twente, P.O. Box 217, 7500 AE Enschede, The Netherlands vandenbroek@acm.org Joris H. Janssen∗ , Joyce H.D.M. Westerink User Experience Group, Philips Research Europe, High Tech Campus 34, 5656 AE Eindhoven, The Netherlands {joris.h.janssen,joyce.westerink}@philips.com Jennifer A. Healey Intel Corporation, Corporate Technology Group, 3600 Juliette Lane SC12-319 Santa Clara CA 95054, USA jennifer.healey@intel.com Keywords: Affective Signal Processing (ASP), Emotion, Validation, Physiology-driven, Triangulation Abstract: Although emotions are embraced by science, their recognition has not reached a satisfying level. Through a concise overview of affect, its signals, features, and classification methods, we provide understanding for the problems encountered. Next, we identify the prerequisites for successful Affective Signal Processing: validation (e.g., mapping of constructs on signals), triangulation, a physiology-driven approach, and contributions of the signal processing community. Using these directives, a critical analysis of a real-world case is provided. This illustrates that the prerequisites can become a valuable guide for Affective Signal Processing. When dealing with people, let us remember that we are not dealing with creatures of logic; we are dealing with creatures of emotion ... (Dale Carnegie, 1936; p. 41) Dale Carnegie (1888-1955) 1 1 INTRODUCING EMOTIONS While a century ago emotions were considered as too spiritual and human’s health was solely explained in physical (e.g., injuries) and physiological terms (e.g., bacteria, viruses), it is now generally acknowledged that emotions have their impact on health and illness. It has been shown that emotions influence our cognitive and social functioning as well as our cardiovascular system (Schuler and O’Brien, 1997) and, as a consequence, can even either shorten or prolong life (Frederickson et al., 2000). ∗ Joris H. Janssen and Egon L. van den Broek did equally contribute to this article; hence, they are shared first authors. 1 For the interested reader, we refer to “Historical foundations of social effectiveness? Dale Carnegie’s principles” (Duke and Novicevic, 2008), which illustrates the timeless significance of Carnegie’s work. Medicine’s interest in emotions was followed by that of Artificial Intelligence (AI), which envisioned that emotions both lead the path to true AI and enhance the communication between man and machine (or even environment) (Picard, 1997; Minsky, 2006). This expresses the intrinsic need for automated sensing of emotions. Often this is done through speech or face analysis; see Cowie et al. (2001) and Zeng et al. (2009) for reviews. Alternatively, physiological signals are used to identify emotions; see also Table 1 and 2, Box 1, and (Westerink et al., 2008a). This paper discusses the last approach, which we will denote as Affective Signal Processing (ASP). Physiological signals have the advantage that they are free from social masking and have the potential of being measured by non-invasive sensors; e.g., (Samboa et al., 2009; Westerink et al., 2008a), making them suited for a wide range of applications. In contrast, recognizing facial expressions is notoriously problematic and speech is often either absent or suffers from severe distortions in many in real-world applications (Cowie et al., 2001; Zeng et al., 2009). In the next section, we provide both an overview and a review of ASP. After that, we introduce four prerequisites for successful ASP. Moreover, a real world case will be analyzed, using these prerequisites (Box 1). We finish the paper with a brief conclusion in which we denote some applications of ASP. Table 1: An overview of common physiological signals and features used in ASP. Physiology (source) Features Unit Remark Cardiovascular activity through ECG or BVP (Berntson et al., 1997) Heart rate (HR) SD IBIs RMSSD IBIs LF power (0.05Hz - 0.15Hz) HF power (0.15HZ - 0.40Hz) VLF power ( < 0.05Hz) LF/HF Pulse Transit Time (PTT) Mean, SD SCL Nr of SCRs SCR amplitude SCR 1/2 recovery time SCR rise time Mean, SD temp Respiration rate Amplitude Resps Mean, SD corrugator supercilii Mean, SD zygomaticus major Mean, SD upper trapezius Mean, SD inter-blink interval beats / min s s ms2 ms2 ms2 HRV index HRV index Sympathetic activity Parasympathetic activity Electrodermal Activity (EDA) (Boucsein, 1992) Skin temperature (ST) Respiration (Grossman and Taylor, 2007) Muscle activity through EMG (Reaz et al., 2006) (Westerink et al., 2008b) ms µS µS s s oC µV µV µV ms Tonic Sympathetic Activity Rate Phasic Activity Phasic Activity Frowning Smiling Notes. SD: Standard deviation; RMSSD: Root Mean Sum of Square Differences; IBI: Inter-beat interval; LF: Low frequency; HF: High frequency; VLF: Very Low Frequency; SCL: Skin Conductance Level; SCR: Skin Conductance Response; ECG: Electrocardiogram; EMG: Electromyogram; BVP: Blood volume pulse. 2 AFFECTIVE SIGNAL PROCESSING (ASP) A broad range of affective signals are used in affective sciences. When processing such signals some general issues have to be taken in consideration: 1) Affective signals are typically derived through noninvasive methods to determine changes in physiology and, as such, are indirect measures. Hence, a delay between the actual change in emotional state and the recorded change in signal has to be taken into account. 2) Physiological sensors are unreliable: they contain movement artifacts and are sensitive to differences in bodily position. 3) Most sensors are obtrusive, preventing their integration in real world applications. 4) Affective signals are influenced by (the interaction among) a variety of factors (Cacioppo and Tassinary, 1990). Some of these sources are located internally (e.g., a thought) and some are among the broad range of possible external factors (e.g., a signal outside). This makes affective signals inherently noisy, which is most prominent in real world research. 5) Physiological changes can evolve in a matter of milliseconds, seconds, minutes or even longer. Some changes hold for only a brief moment, while others can even be permanent. Although seldom reported, the expected time windows of change are of interest. In particular since changes can add to each other, even when having a different origin. 6) Humans are no linear time (translation or shift) invariant systems (Boucsein, 1992), they habituate. This increases the complexity of ASP substantially, since most signal processing techniques rely on this assumption. 7) Affective signals have large individual differences. This calls for methods and models tailored to the individual. It has been shown that personal approaches increase the performance of ASP (Bailenson et al., 2008). 2.1 Classification of Affective Signals To enable processing of the signals, in most cases comprehensive sets of features have to be identified for each affective signal. To extract these features, the affective signals are processed in the time (e.g., statistical moments (Westerink et al., 2008b)), frequency (e.g., Fourier), time-frequency (e.g., Wavelets), or power domain (e.g., Periodogram and Autoregression). In Table 1, we provide a brief overview of the signals most often applied, including their best known features, with reference to their physiological source. The features obtained from the affective signals (see Table 1) are fed to pattern recognition methods, which can be classified as: template matching, syn- tactic or structural matching, and statistical classification; e.g., artificial neural networks (ANN). The former two are not or seldom used in affective signal processing; most ASP schemes use the latter. Statistical pattern recognition distinguishes supervised and unsupervised (e.g., clustering) pattern recognition; i.e., respectively, with or without a set of (labeled) training data. With unsupervised pattern recognition, the distance/similarity measure used and the algorithm applied to generate the clusters are key elements. Supervised pattern recognition relies on learning from a set of examples (training set). Statistical pattern recognition uses input features, a discriminant function (or network function for ANN) to recognize the features, and an error criterion in its classification process. In the field of ASP, several studies have been conducted, using a broad range of signals, features, and classifiers; see Table 2 for an overview. Nonetheless, both the recognition performance and the number of emotions that the classifiers were able to discriminate are disappointing. Moreover, comparing the different studies is problematic because of the different settings the research was applied in, ranging from controlled lab studies to real world testing, the type of emotion triggers used, the number of target states to be discriminated, and the signals and features employed. This illustrates the need for a set of prerequisites for ASP. 3 PREREQUISITES 3.1 Validity In the pursuit to trigger emotions in a more or less controlled manner, a range of methods have been applied: actors, images (IAPS), sounds (e.g., music), (fragments of) movies (Westerink et al., 2008b), speech (Van den Broek, 2004), commercials (Hazlett and Hazlett, 1999; Poels and Dewitte, 2006), games, agents / serious gaming / virtual reality (Slater et al., 2006; Westerink et al., 2008a), reliving of emotions (Van den Broek, 2004), and real world experiences (Healey and Picard, 2005); see also Box 1. However, how to know which of these methods actually triggered participants’ true emotions? This is a typical concern of validity, which is a crucial issue for ASP. Validity can be best obtained through four approaches: content, criteria-related, construct, and ecological validation, which we will discuss in relation to ASP; in addition, see Box 1. Content validity refers to a) The agreement of experts on the domain of interest; e.g., limited to a specific application or group of patients; b) The degree to which a feature (or its parameters) of a given signal represents a construct; and c) The degree to which a set of features (or their parameters) of a given set of signals adequately represents all facets of the domain. For instance, employing only skin conductance level (SCL) for ASP will lead to a weak content validity when trying to measure emotion, as SCL is known to relate to the arousal component of an emotion, but not to the valence component. However, when trying to measure only emotional arousal, measuring only SCL may form strong content validity; see also Box 1. Criteria-related validity handles the quality of the translation from the preferred measurement to an alternative, rather than to what extend the measurement represents a construct. Emotions are preferably measured at the moment they occur; however, measurements before (predictive) or after (postdictive) the particular event are sometimes more feasible; e.g., through subjective questionnaires. The quality of these translations are referred to as predictive or postdictive validity. A third form of criteria-related validity is concurrent validity: a metric for the reliability of measurements applied in relation to the preferred standard. For instance, the more affective states are discriminated the higher the concurrent validity. A construct validation process aims to develop a nomological network (i.e., a ground truth), or possibly an ontology or semantic network, build around the construct of interest. Such a network requires theoretically grounded, observable, operational definitions of all constructs and the relations between them. Such a network aims to provide a verifiable theoretical framework. The lack of such a network is one of the most pregnant problems ASP is coping with; e.g., see Box 1. A frequently occurring mistake is that emotions are denoted, where moods (i.e., longer object-unrelated affective states with very different physiology) are meant. This is very relevant for ASP, as it is known that moods are accompanied by very different physiological patterns than emotions are (Gendolla and Brinkman, 2005). Ecological validity refers to the influence of the context on measurements. We identify two issues: 1) Natural affective events are sparse, which makes it hard to let participants cycle through a range of affective states in a limited time frame and 2) The affective signals that occur are easily contaminated by contextual factors; so, using a similar context as the intended ASP application for initial learning is of vital importance; see also Box 1. Although understandable from a measurement-feasibility perspective, emotion measurements are often done in controlled laboratory settings. This makes results poorly generalizable to realworld applications. Hence, the need for unobtrusive Table 2: A summary of 18 studies that have tried to infer affect from physiological signals. Information source Signals Part Fea Sinha & Parsons, 1996 Picard et al., 2001 M C ,E ,R ,M 27 1 18 40 Scheirer et al., 2002 Nasoz et al., 2003 Takahashi, 2003 Haag et al., 2004 C ,E C ,E ,S C ,E ,B C ,E ,S ,M ,R 24 31 12 1 5 3 18 13 Kim et al., 2004 Lisetti & Nasoz, 2004 C ,E ,S C ,E ,S 175 29 Wagner et al., 2005 C ,E ,R ,M 1 32 Yoo et al., 2005 Choi & Woo, 2005 Healey & Picard, 2005 Rani et al., 2006 C ,E 6 E C ,E ,R ,M 9 C ,E ,S ,M ,P 15 5 3 22 46 Zhai & Barreto, 2006 Leon et al., 2007 Liu et al., 2008 Katsis et al., 2008 Yannakakis & Hallam, 2008 Kim & André, 2009 C ,E ,S ,P C ,E C ,E ,S ,M C ,E ,M ,R C ,E C ,E ,M ,R 11 5 35 15 20 110 32 8 6 10 72 3 Sel / Red SFS, Fisher Viterbi SFS, Fisher PCA Fisher DBI ANOVA SBS Classifiers Target Result LDA LDA 2 emotions 8 emotions 86% 81% HMM kNN, LDA SVM MLP 2 frustrations 6 emotions 6 emotions val / aro SVM kNN, LDA, MLP kNN, LDA, MLP MLP MLP LDA kNN, SVM, RT, BN SVM AANN SVM SVM, ANFIS SVM, MLP LDA, EMDC 3 emotions 6 emotions 64% 69% 42% 90% 97% 78% 84% 4 emotions 92% 4 emotions 4 emotions 3 stress levels 3 emotions 80% 75% 97% 86% 2 stress levels 3 emotions 3 affect states 4 affect states 2 fun levels 4 emotions 90% 71% 83% 79% 70% 70/95% Notes. Part: the number of participants; Fea: the number of features; Sel / Red: Algorithms used for selection or reduction of features; C : Cardiovascular activity; E : Electrodermal activity; R : Respiration; M : Electromyogram; B : Electroencephalogram; S : Skin temperature; P : Pupil Diameter; MLP: MultiLayer Perceptron; HMM: Hidden Markov Model; RT: Regression Tree; BN: Bayesian Network; AANN: Auto-Associative Neural Network; SVM: Support Vector Machine; LDA: Linear Discriminant Analysis; kNN: k Nearest Neighbors; ANFIS: Adaptive Neuro-Fuzzy Inference System; DBI: Davies-Bouldin Index; PCA: Principal Component Analysis; SFS: Sequential Forward Selection; SBS: Sequential Backward Selection; EMDC: Emotion-specific Multilevel Dichotomous Classification. sensors and methods to make longitudinal real-world studies possible is pressing. 3.2 Triangulation We propose to adopt the principle of Triangulation on ASP, as applied in social sciences and humancomputer interaction. Heath (2001) defines triangulation as “the strategy of using multiple operationalizations of constructs to help separate the construct under consideration from other irrelevancies in the operationalization”. Using this strategy provides several advantages: 1. Distinct signals can be used to validate each other; 2. Extrapolations can be made based on multiple data sets, providing more certainty. In turn, corrections can be made to errors in a result set that clearly defy from other results; and 3. More solid ground, or even a ground truth (see also Section 3.1), is obtained for the interpretation of signals, as multiple perspectives are used. Triangulation was, for example, successfully employed by Bailenson et al. (2008) and Healey and Picard (2005) . As (one of) the first, Bailenson et al. (2008) has shown that using both physiological signals and facial expressions leads to better ASP than using one of them; see also Section 1. Hence, we advise to record three affective signals, or have at least three features derived from them, for each construct under investigation, in well controlled research. Moreover, qualitative and subjective measures should accompany the signals (e.g., questionnaires, video recordings, interviews, and Likert scales); e.g., see (Hazlett and Hazlett, 1999; Healey and Picard, 2005; Slater et al., 2006; Van den Broek, 2004; Westerink et al., 2008a). Please consult also Section 3.1 on this topic. Box 1: Application of the prerequisites in practice One of the first large-scale real-world cases in which ASP is applied is the driving application of Healey and Picard (2005). They applied ASP on ECG, EMG, EDA, and respiration to determine the stress of 24 participants, during at least 50 minutes real world driving, completed by questionnaires and video recordings. Healey and Picard (2005) were able to distinguish between three stress levels, using five minute time windows. In addition, through ASP, they developed a continuous stress metric. In this box, we apply our ASP prerequisites for a critical analysis of this case; each prerequisite is denoted separately in relation to the case. Validation Section 3.1 denotes content, criteria-related, construct, and ecological validation. The continuous stress metric developed by Healey and Picard predicts three situations: sitting still, driving on the highway, and driving in town. These situations are likely to be confounded by movement. Movement, however, is not necessarily correlated with stress level. And even when movement reflects a more excited state, it does not always come with a negative bias. As affective signals are also influenced by movement, this makes it uncertain whether the actually measured construct is movement, stress, or a combination of both. This, together with a weak definition, leads to a limited content validity. For the continuous stress metric employed, Healey and Picard show a very high inter-observer reliability. Moreover, this stress metric has a very high temporal resolution. Nonetheless, as the authors elaborate, this was in some cases not enough to deal with the short latencies of some physiological signals. In addition, subjective stress reports could not be done during driving, so they were conducted after the drive. As the drives were quite long, this might have lead to a decrease in postdictive validity. However, they were congruent with averages of the continuous stress metric. Taken together, the criteria-related validity of the measurements is high. Construct validation refers to a nomological framework built around the construct of interest. They nicely describe their stress hypothesis in terms of the activity of the sympathetic and parasympathetic nervous systems. However, a description of the relation between most of the physiological signals and these dimensions is only elaborated to a limited extent. Moreover, the HRV measurement is questionable as it was not corrected for respiration (Grossman and Taylor, 2007). All in all, the construct validity is poor. The ecological validity of Healey and Picard (2005) is excellent: the research results can be generalized to driving in general, most likely to all driving situations. One could argue that the results are hard to generalize to other domains of application. This is indeed a valid observation; however, the authors do (correctly) not claim to want so. Triangulation Healey and Picard (2005) use multiple signals for the construct under investigation, complemented by subjective self-reports and observer scores. Healey and Picard correlated different physiological signals to see to what extent they describe the same construct and used self-reports and observer scores to validate the physiological changes and, hence, created a higher validity; see also Section 3.2. Hence, they successfully adopted the principle of triangulation. Physiology-driven approach Instead of continuously inferring the stress level of the driver, it might be sufficient to express a high stress level in terms of one or a few physiological signals. For instance, (Healey and Picard, 2005) show that EDA is strongly correlated with their continuous stress metric. So, using EDA as a stress scale might be sufficient for actual applications. By setting some thresholds in the EDA, it can also be used for music selection or distraction management (e.g., cell phones), as the authors propose. This reduces the amount of sensors and computing power needed, making it more feasible for practical application; see also Section 3.3. Signal processing contributions Healey and Picard (2005) state that “Each signal was sampled at a rate appropriate for capturing the information contained in the signal . . . ”. Such an ill specified statement is in line with other research where similar statements are made or the subject is ignored completely. However, Healey and Picard (2005) continue with “ . . . constrained by the sampling rates available . . . ”, which explains at least partly the reported sample frequencies; i.e., ECG: 496 Hz; EDA and respiration: 31 Hz; and EMG: 15.5 Hz. These are not reported previously as standard frequencies, if reported at all. Tailored filters as proposed in the current paper are absent. The only filter defined is the 0.5 seconds averaging filter. Why this is applied is not reported. Possibly, more filters were applied; however, their specifications are omitted. On the one hand, this makes it hard to reproduce the research; on the other hand, it illustrates the lack of attention for filtering. On several occasions throughout the paper, comparable work on aircraft pilots was mentioned. It would be of interest to compare the rich set of data Healey and Picard (2005) gathered with other data sets. Then, the robustness of the variety of signals, their features and parameters could be fully accessed. To extend from a few single initiatives to a more general practice, a benchmark should be set up; see also Section 3.4. Conclusion The work of Healey and Picard (2005) is already a showcase for triangulation. Their results illustrate that their case is per excellence also suitable for a physiologically driven approach. An additional advantage of this direction is that it can be further explored even without the preferred thorough theoretical framework. This would ease the way of their work to possible application areas. Moreover, the applicability of the processing schemes proposed by Healey and Picard (2005) could be verified with future benchmarks and the robustness of their approach could be strengthened through the usage of tailored filters and ASP techniques. Hence, using the framework introduced in this paper, the case discussed could be substantially improved, even post-hoc, although its foundation is already good and unique in its kind. 3.3 A physiology-driven approach 4 CONCLUSION A final prerequisite stems from the idea that ASP can never be entirely based on psychological changes. As discussed in Section 2, there are many factors outside one’s affective state that contaminate affective signals. Beside validation and triangulation, another way to deal with this is to employ a more physiologydriven perspective (Tractinsky, 2004). Instead of expressing the goals of ASP directly in terms of affective states, they can often be stated in terms of the affective signals themselves (Slater et al., 2006). For instance, instead of inferring an air-traffic controller’s or driver’s stress level, thresholding SCL might be sufficient; see also Box 1. Note that there always remains an interpretation in affective states. Then, the use of syntactic or structural pattern recognition for ASP should be (further) explored. Its hierarchical approach to simplifying complex patterns in affective signals is expected to be valuable for ASP. This paper provided both an overview and a review of ASP and explained the lack of success of ASP; see Section 2 and Tables 1 and 2. Next, in Section 3, four prerequisites are introduced from which ASP is expected to benefit significantly: validation, the principle of triangulation, and a physiologicaldriven approach, and contributions of the signal processing community. In addition, a real world case is discussed in Box 1, which illustrated the use of the proposed prerequisites. With the guidelines provided and the future’s progress ahead, we envision embedding of ASP in various professional and consumer settings, as a key factor of our every day life. A broad range of probes have been developed over the years (Westerink et al., 2008a), which illustrate the feasibility of embedding ASP in various settings. Let us briefly denote three of them: 1) For more than a decade, ASP is already applied to determine the impact of advertisements on people (Hazlett and Hazlett, 1999); for a review on emotion measurement in advertising, see (Poels and Dewitte, 2006). 2) Almost half a century ago, with the development of Eliza (Weizenbaum, 1966) both the possible implications of AI (for medicine) and the limitations of (classic) AI became apparent. Among many others, Liu et al. (2008), Slater et al. (2006), and Van den Broek (2004) denote how ASP can help AI to mature and to be of real value in this field. 3) ASP has often been applied with pilots and in automotive industry (Healey and Picard, 2005; Westerink et al., 2008a), as is also denoted in Box 1. Healey and Picard (2005) also provide a brief overview of literature on this topic. With Ambient Intelligence evolving, (wireless) sensor networks becoming more mature, and with prerequisites that enable the exploitation of ASP’s full potential, empathic machines should come within reach. And would it not be an appealing idea to live in an empathic surrounding that adapts to your mood and emotions, which can calm you or help you to concentrate when required? 3.4 Contributions of Signal Processing The majority of research on ASP is conducted by psychology, physiology, medicine, human-computer interaction, or artificial intelligence. Hence, true signal processing expertise is often missing; see also Box 1. In particular, expertise from biomedical signal processing could significantly contribute to ASP’s progress. We will now address a triplet of issues: The development of filters tailored to the specifications of ASP sensors and to ASP’s applications could significantly boost the performance of ASP. Affective signals are mostly sampled in a discrete fashion. The required AD conversion, however, can distort the signal; i.e., aliasing. For all possible signals, with all possible amplifiers and sensors, it is recommended to determine the relation between sample frequency and signal loss / distortion. So far, this has not been done and guidelines are provided founded on weak assumptions; see also Box 1. Then, for all signals, also the Nyquist frequency could be defined. A benchmark should be founded with verified affective signals. This would enable objective performance measurements of signal processing and pattern recognition techniques. The principle of triangulation (see Section 3.2) could be applied using it and the generic applicability of techniques could be tested. Moreover, it could be used for concurrent validation (see Section 3.1); e.g., through comparing different signals or apparatus that can substitute each other. ACKNOWLEDGMENTS The authors would like to thank Marjolein van der Zwaag and Stijn de Waele for their comments on an earlier version of this paper. Furthermore, we would like to thank the anonymous reviewers, who provided us the opportunity to improve this paper. REFERENCES Bailenson, J. N., Pontikakis, E. D., Mauss, I. B., Gross, J. J., Jabon, M. E., Hutcherson, C. A., Nass, C., and John, O. (2008). Real-time classification of evoked emotions using facial feature tracking and physiological responses. International Journal of Human-Computer Studies, 66(5):303–317. Berntson, G. G., Bigger, J. T., Eckberg, D. L., Grossman, P., Kaufmann, P. G., Malik, M., Nagaraja, H. N., Porges, S. W., Saul, J. P., Stone, P. H., and van der Molen, M. W. (1997). Heart rate variability: Origins, methods, and interpretive caveats. Psychophysiology, 34(6):623–648. Boucsein, W. (1992). Electrodermal activity. New York, NY, USA: Plenum Press. Cacioppo, J. and Tassinary, L. (1990). Inferring psychological significance from physiological signals. American Psychologist, 45(1):16–28. Choi, A. and Woo, W. (2005). Physiological sensing and feature extraction for emotion recognition by exploiting acupuncture spots. Lecture Notes in Computer Science (Affective Computing and Intelligent Interaction), 3784:590–597. Cowie, R., Douglas-Cowie, E., Tsapatsoulis, N., Votsis, G., Kollias, S., Fellenz, W., and Taylor, J. G. (2001). Emotion recognition in human– computer interaction. IEEE Signal Processing Magazine, 18(1):32–80. Duke, A. and Novicevic, M. M. (2008). Historical foundations of social effectiveness? Dale Carnegie’s principles. Social Influences, 3(2):132–142. Frederickson, B. L., Manusco, R. A., Branigan, C., and Tugade, M. M. (2000). The undoing effect of positive emotions. Motivation and Emotion, 24(4):237–257. Gendolla, G. H. E. and Brinkman, K. (2005). The role of mood states in self-regulation: Effects on action preferences and resource mobilization. European Psychologist, 10(3):187–198. Grossman, P. and Taylor, E. W. (2007). Toward understanding respiratory sinus arrhythmia: Relations to cardiac vagal tone, evolution and biobehavioral functions. Biological Psychology, 74(2):263–285. Haag, A., Goronzy, S., Schaich, P., and Williams, J. (2004). Emotion recognition using bio-sensors: First steps towards an automatic system. Lecture notes in computer science (Affective Dialogue Systems), 3068:36–48. Hazlett, R. L. and Hazlett, S. Y. (1999). Emotional response to television commercials: Facial EMG vs. self-report. Journal of Advertising Research, 39(2):7–23. Healey, J. A. and Picard, R. W. (2005). Detecting stress during real-world driving tasks using physiological sensors. IEEE Transactions on Intelligent Transportation Systems, 6(2):156–166. Heath, L. (2001). Triangulation: Methodology, pages 15901–15906. Elsevier Science Ltd.: Oxford, UK, 1 edition. ISBN: 978-0-08-043076-8. Katsis, C. D., Katertsidis, N., Ganiatsas, G., and Fotiadis, D. I. (2008). Toward emotion recognition in car-racing drivers: A biosignal processing approach. IEEE Transactions on Systems, Man, and Cybernetics–Part A: Systems and Humans, 38(3):502–512. Kim, J. and André, E. (2008). Emotion recognition based on physiological changes in music listening. IEEE Transactions on Pattern Analysis Machine Intelligence, 30(12):2067–2083. Kim, K. H., Bang, S. W., and Kim, S. R. (2004). Emotion recognition system using short-term monitoring of physiological signals. Medical & Biological Engineering & Computing, 42(3):419– 427. Leon, E., Clarke, G., Callaghan, V., and Sepulveda, F. (2007). A user-independent real-time emotion recognition system for software agents in domestic environments. Engineering Applications of Artificial Intelligence, 20(3):337–345. Lisetti, C. L. and Nasoz, F. (2004). Using noninvasive wearable computers to recognize human emotions from physiological signals. EURASIP Journal on Applied Signal Processing, 2004(11):1672–1687. Liu, C., Conn, K., Sarkar, N., and Stone, W. (2008). Physiology-based affect recognition for computer-assisted intervention of children with Autism Spectrum Disorder. International Journal of Human-Computer Studies, 66(9):662– 677. Minsky, M. (2006). The Emotion Machine: Commonsense Thinking, Artificial Intelligence, and the Future of the Human Mind. New York, NY, USA: Simon & Schuster. Nasoz, F., Alvarez, K., Lisetti, C. L., and Finkelstein, N. (2003). Emotion recognition from physiological signals for presence technologies. International Journal of Cognition, Technology and Work, 6:4–14. Picard, R. W. (1997). Affective Computing. Boston MA, USA: MIT Press. sue on affective computing. Computers, 16(4):751–757. Interacting with Picard, R. W., Vyzas, E., and Healey, J. (2001). Toward machine emotional intelligence: Analysis of affective physiological state. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(10):1175–1191. Van den Broek, E. L. (2004). Emotional Prosody Measurement (EPM): A voice-based evaluation method for psychological therapy effectiveness. Studies in Health Technology and Informatics (Medical and Care Compunetics), 103:118–125. Poels, K. and Dewitte, S. (2006). How to capture the heart? reviewing 20 years of emotion measurement in advertising. Journal of Advertising Research, 46(1):18–37. Wagner, J., Kim, J., and André, E. (2005). From physiological signals to emotions: Implementing and comparing selected methods for feature extraction and classification. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME). Rani, P., Liu, C., Sarkar, N., and Vanman, E. (2006). An empirical study of machine learning techniques for affect recognition in human-robot interaction. Pattern Analysis & Applications, 9(1):58–69. Reaz, M. B. I., Hussain, M. S., and Mohd-Yasin, F. (2006). Techniques of EMG signal analysis: detection, processing, classification and applications. Biological Procedures Online, 8(1):11– 35. Samboa, H., Silva, F., Silva, H., and ao, R. F. (2009). PLUX – Biosignals Aquisition and Processing. URL: http://www.plux.info [Last accessed on January 21, 2009]. Scheirer, J., Fernandez, R., Klein, J., and Picard, R. W. (2002). Frustrating the user on purpose: a step toward building an affective computer. Interacting with Computers, 14(2):93–118. Schuler, J. L. H. and O’Brien, W. H. (1997). Cardiovascular recovery from stress and hypertension factors: A meta-analytic view. Psychophysiology, 34:649–659. Sinha, R. and Parsons, O. A. (1996). Multivariate response patterning of fear. Cognition and Emotion, 10(2):173–198. Slater, M., Guger, C., Edlinger, G., Leeb, R., Pfurtscheller, G., Antley, A., Garau, M., Brogni, A., and Friedman, D. (2006). Analysis of physiological responses to a social situation in an immersive virtual environment. Presence: Teleoperators and Virtual Environments, 15(5):553– 569. Takahashi, K. (2003). Remarks on emotion recognition from bio-potential signals. In Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, volume 2, pages 1655–1659, Palmerston North, New Zealand. Tractinsky, N. (2004). Tools over solutions? comments on interacting with computers special is- Weizenbaum, J. (1966). ELIZA – a computer program for the study of natural language communication between man and machine. Communications of the ACM, 9(1):36–45. Westerink, J. H. D. M., Ouwerkerk, M., overbeek, T., Pasveer, W. F., and de Ruyter, B. (2008a). Probing Experiences: From Academic Research to Commercial Propositions, volume 8 of Philips Research Book Series. Springer: Dordrecht, The Netherlands. Westerink, J. H. D. M., van den Broek, E. L., Schut, M. H., van Herk, J., and Tuinenbreijer, K. (2008b). Computing emotion awareness through galvanic skin response and facial electromyography, volume 8 of Philips Research Book Series, chapter 14 (Part II: Probing in order to Feed Back), pages 137–150. Springer: Dordrecht, The Netherlands. Yannakakis, G. N. and Hallam, J. (2008). Entertainment modeling through physiology in physical play. International Journal of Human-Computer Studies, 66(10):741–755. Yoo, S. K., Lee, C. K., Park, J. Y., Kim, N. H., Lee, B. C., and Jeong, K. S. (2005). Neural network based emotion estimation using heart rate variability and skin resistance. Lecture Notes in Computer Science (Advances in Natural Computation), 3610:818–824. Zeng, Z., Pantic, M., Roisman, G. I., and Huang, T. S. (2009). A survey of affect recognition methods: Audio, visual, and spontaneous expressions. IEEE Transactions on Pattern Analysis Machine Intelligence, 31(1):39–58. Zhai, J. and Barreto, A. (2006). Stress detection in computer users through noninvasive monitoring of physiological signals. Biomedical Science Instrumentation, 42:495–500.

Log In

Prerequisites for Affective Signal Processing (ASP) - Part II

Related papers

Related topics