Riccardo Guidotti

University of Pisa, Informatica, Graduate Student

Followers

Following

Public Views

Interests

Uploads

Papers by Riccardo Guidotti

The Trajectory Interval Forest Classifier for Trajectory Classification

Download

Interpretable Data Partitioning Through Tree-Based Clustering Methods

Lecture Notes in Computer Science, Dec 31, 2022

A Survey Of Methods For Explaining Black Box Models

arXiv (Cornell University), Feb 6, 2018

Download

Explaining Multi-label Black-Box Classifiers for Health Applications

Springer eBooks, Aug 2, 2019

Download

City indicators for geographical transfer learning: an application to crash prediction

Geoinformatica, Mar 22, 2022

The massive and increasing availability of mobility data enables the study and the prediction of ... more The massive and increasing availability of mobility data enables the study and the prediction of human mobility behavior and activities at various levels. In this paper, we tackle the problem of predicting the crash risk of a car driver in the long term. This is a very challenging task, requiring a deep knowledge of both the driver and their surroundings, yet it has several useful applications to public safety (e.g. by coaching high-risk drivers) and the insurance market (e.g. by adapting pricing to risk). We model each user with a data-driven approach based on a network representation of users’ mobility. In addition, we represent the areas in which users moves through the definition of a wide set of city indicators that capture different aspects of the city. These indicators are based on human mobility and are automatically computed from a set of different data sources, including mobility traces and road networks. Through these city indicators we develop a geographical transfer learning approach for the crash risk task such that we can build effective predictive models for another area where labeled data is not available. Empirical results over real datasets show the superiority of our solution.

Audio Ergo Sum

Lecture Notes in Computer Science, 2016

Download

Understanding Any Time Series Classifier with a Subsequence-based Explainer

ACM Transactions on Knowledge Discovery from Data

The growing availability of time series data has increased the usage of classifiers for this data... more The growing availability of time series data has increased the usage of classifiers for this data type. Unfortunately, state-of-the-art time series classifiers are black-box models and, therefore, not usable in critical domains such as healthcare or finance, where explainability can be a crucial requirement. This paper presents a framework to explain the predictions of any black-box classifier for univariate and multivariate time series. The provided explanation is composed of three parts. First, a saliency map highlighting the most important parts of the time series for the classification. Second, an instance-based explanation exemplifies the black-box’s decision by providing a set of prototypical and counterfactual time series. Third, a factual and counterfactual rule-based explanation, revealing the reasons for the classification through logical conditions based on subsequences that must, or must not, be contained in the time series. Experiments and benchmarks show that the propo...

Download

Explaining Crash Predictions on Multivariate Time Series Data

Lecture Notes in Computer Science, 2022

Stable and actionable explanations of black-box models through factual and counterfactual rules

Data Mining and Knowledge Discovery, Nov 14, 2022

Download

Opening the black box: a primer for anti-discrimination

The pervasive adoption of Artificial Intelligence (AI) models in the modern information society, ... more The pervasive adoption of Artificial Intelligence (AI) models in the modern information society, requires counterbalancing the growing decision power demanded to AI models with risk assessment methodologies. In this paper, we consider the risk of discriminatory decisions and review approaches for discovering discrimination and for designing fair AI models. We highlight the tight relations between discrimination discovery and explainable AI, with the latter being a more general approach for understanding the behavior of black boxes. SUMMARY: 1. AI risks. – 2. Discrimination discovery and fairness in AI. – 3. Explainable AI. – 4. Closing the gap. – 5. Conclusion.

Download

City Indicators for Mobility Data Mining

Classifying cities and other geographical units is a classical task in urban geography, typically... more Classifying cities and other geographical units is a classical task in urban geography, typically carried out through manual analysis of specific characteristics of the area. The primary objective of this paper is to contribute to this process through the definition of a wide set of city indicators that capture different aspects of the city, mainly based on human mobility and automatically computed from a set of data sources, including mobility traces and road networks. The secondary objective is to prove that such set of characteristics is indeed rich enough to support a simple task of geographical transfer learning, namely identifying which groups of geographical areas can share with each other a basic traffic prediction model. The experiments show that similarity in terms of our city indicators also means better transferability of predictive models, opening the way to the development of more sophisticated solutions that leverage city indicators.

Download

Know Thyself" How Personal Music Tastes Shape the Last.Fm Online Social Network

As Nietzsche once wrote “Without music, life would be a mistake” (Twilight of the Idols, 1889.). ... more As Nietzsche once wrote “Without music, life would be a mistake” (Twilight of the Idols, 1889.). The music we listen to reflects our personality, our way to approach life. In order to enforce self-awareness, we devised a Personal Listening Data Model that allows for capturing individual music preferences and patterns of music consumption. We applied our model to 30k users of Last.Fm for which we collected both friendship ties and multiple listening. Starting from such rich data we performed an analysis whose final aim was twofold: (i) capture, and characterize, the individual dimension of music consumption in order to identify clusters of like-minded Last.Fm users; (ii) analyze if, and how, such clusters relate to the social structure expressed by the users in the service. Do there exist individuals having similar Personal Listening Data Models? If so, are they directly connected in the social graph or belong to the same community?.

“ Are we playing like Music-Stars ? ” Placing Emerging Artists on the Italian Music Scene

The Italian emerging bands chase success on the footprint of popular artists by playing rhythmic ... more The Italian emerging bands chase success on the footprint of popular artists by playing rhythmic danceable and happy songs. Our finding comes out from a study of the Italian music scene and how the new generation of musicians relate with the tradition of their country. By analyzing Spotify data we investigated the peculiarity of regional music and we placed emerging bands within the musical movements defined by already successful artists. The approach proposed and the results obtained are a first attempt to outline rules suggesting the importance of those features needed to increase popularity in the Italian music scene.

Download

Data-Driven Location Annotation for Fleet Mobility Modeling

The large availability of mobility data allows studying human behavior and human activities. Howe... more The large availability of mobility data allows studying human behavior and human activities. However, this massive and raw amount of data generally lacks any detailed semantics or useful categorization. Annotations of the locations where the users stop may be helpful in a number of contexts, including user modeling and profiling, urban planning, activity recommendations, and can even lead to a deeper understanding of the mobility evolution of an urban area. In this paper, we foster the expressive power of individual mobility networks, a data model describing users’ behavior, by defining a data-driven procedure for locations annotation. The procedure considers individual, collective, and contextual features for turning locations into annotated ones. The annotated locations own a high expressiveness that allows generalizing individual mobility networks, and that makes them comparable across different users. The results of our study on a dataset of trucks moving in Greece show that the...

Download

Open the Black Box Data-Driven Explanation of Black Box Decision Systems

ArXiv, 2018

Black box systems for automated decision making, often based on machine learning over (big) data,... more Black box systems for automated decision making, often based on machine learning over (big) data, map a user's features into a class or a score without exposing the reasons why. This is problematic not only for lack of transparency, but also for possible biases hidden in the algorithms, due to human prejudices and collection artifacts hidden in the training data, which may lead to unfair or wrong decisions. We introduce the local-to-global framework for black box explanation, a novel approach with promising early results, which paves the road for a wide spectrum of future developments along three dimensions: (i) the language for expressing explanations in terms of highly expressive logic-based rules, with a statistical and causal interpretation; (ii) the inference of local explanations aimed at revealing the logic of the decision adopted for a specific instance by querying and auditing the black box in the vicinity of the target instance; (iii), the bottom-up generalization of t...

Download

Principles of Explainable Artificial Intelligence

Explainable AI Within the Digital Transformation and Cyber Physical Systems, 2021

Global Explanations with Local Scoring

Machine Learning and Knowledge Discovery in Databases, 2020

Artificial Intelligence systems often adopt machine learning models encoding complex algorithms w... more Artificial Intelligence systems often adopt machine learning models encoding complex algorithms with potentially unknown behavior. As the application of these “black box” models grows, it is our responsibility to understand their inner working and formulate them in human-understandable explanations. To this end, we propose a rule-based model-agnostic explanation method that follows a local-to-global schema: it generalizes a global explanation summarizing the decision logic of a black box starting from the local explanations of single predicted instances. We define a scoring system based on a rule relevance score to extract global explanations from a set of local explanations in the form of decision rules. Experiments on several datasets and black boxes show the stability, and low complexity of the global explanations provided by the proposed solution in comparison with baselines and state-of-the-art global explainers.

Crash Prediction and Risk Assessment with Individual Mobility Networks

2020 21st IEEE International Conference on Mobile Data Management (MDM), 2020

The massive and increasing availability of mobility data enables the study and the prediction of ... more The massive and increasing availability of mobility data enables the study and the prediction of human mobility behavior and activities at various levels. In this paper, we address the problem of building a data-driven model for predicting car drivers’ risk of experiencing a crash in the long-term future, for instance, in the next four weeks. Since the raw mobility data, although potentially large, typically lacks any explicit semantics or clear structure to help understanding and predicting such rare and difficult-to-grasp events, our work proposes to build concise representations of individual mobility, that highlight mobility habits, driving behaviors and other factors deemed relevant for assessing the propensity to be involved in car accidents. The suggested approach is mainly based on a network representation of users’ mobility, called Individual Mobility Networks, jointly with the analysis of descriptive features of the user’s driving behavior related to driving style (e.g., accelerations) and characteristics of the mobility in the neighborhood visited by the user. The paper presents a large experimentation over a real dataset, showing comparative performances against baselines and competitors, and a study of some typical risk factors in the areas under analysis through the adoption of state-of-art model explanation techniques. Preliminary results show the effectiveness and usability of the proposed predictive approach.

Investigating Neighborhood Generation Methods for Explanations of Obscure Image Classifiers

Advances in Knowledge Discovery and Data Mining, 2019

Download

Explaining Image Classifiers Generating Exemplars and Counter-Exemplars from Latent Representations

Proceedings of the AAAI Conference on Artificial Intelligence, 2020

We present an approach to explain the decisions of black box image classifiers through synthetic ... more We present an approach to explain the decisions of black box image classifiers through synthetic exemplar and counter-exemplar learnt in the latent feature space. Our explanation method exploits the latent representations learned through an adversarial autoencoder for generating a synthetic neighborhood of the image for which an explanation is required. A decision tree is trained on a set of images represented in the latent space, and its decision rules are used to generate exemplar images showing how the original image can be modified to stay within its class. Counterfactual rules are used to generate counter-exemplars showing how the original image can “morph” into another class. The explanation also comprehends a saliency map highlighting the areas that contribute to its classification, and areas that push it into another class. A wide and deep experimental evaluation proves that the proposed method outperforms existing explainers in terms of fidelity, relevance, coherence, and s...

Download

The Trajectory Interval Forest Classifier for Trajectory Classification

Download

Interpretable Data Partitioning Through Tree-Based Clustering Methods

Lecture Notes in Computer Science, Dec 31, 2022

A Survey Of Methods For Explaining Black Box Models

arXiv (Cornell University), Feb 6, 2018

Download

Explaining Multi-label Black-Box Classifiers for Health Applications

Springer eBooks, Aug 2, 2019

Download

City indicators for geographical transfer learning: an application to crash prediction

Geoinformatica, Mar 22, 2022

Audio Ergo Sum

Lecture Notes in Computer Science, 2016

Download

Understanding Any Time Series Classifier with a Subsequence-based Explainer

ACM Transactions on Knowledge Discovery from Data

Download

Explaining Crash Predictions on Multivariate Time Series Data

Lecture Notes in Computer Science, 2022

Stable and actionable explanations of black-box models through factual and counterfactual rules

Data Mining and Knowledge Discovery, Nov 14, 2022

Download

Opening the black box: a primer for anti-discrimination

Download

City Indicators for Mobility Data Mining

Download

Know Thyself" How Personal Music Tastes Shape the Last.Fm Online Social Network

“ Are we playing like Music-Stars ? ” Placing Emerging Artists on the Italian Music Scene

Download

Data-Driven Location Annotation for Fleet Mobility Modeling

Download

Open the Black Box Data-Driven Explanation of Black Box Decision Systems

ArXiv, 2018

Download

Principles of Explainable Artificial Intelligence

Explainable AI Within the Digital Transformation and Cyber Physical Systems, 2021

Global Explanations with Local Scoring

Machine Learning and Knowledge Discovery in Databases, 2020

Crash Prediction and Risk Assessment with Individual Mobility Networks

2020 21st IEEE International Conference on Mobile Data Management (MDM), 2020

The massive and increasing availability of mobility data enables the study and the prediction of ... more The massive and increasing availability of mobility data enables the study and the prediction of human mobility behavior and activities at various levels. In this paper, we address the problem of building a data-driven model for predicting car drivers’ risk of experiencing a crash in the long-term future, for instance, in the next four weeks. Since the raw mobility data, although potentially large, typically lacks any explicit semantics or clear structure to help understanding and predicting such rare and difficult-to-grasp events, our work proposes to build concise representations of individual mobility, that highlight mobility habits, driving behaviors and other factors deemed relevant for assessing the propensity to be involved in car accidents. The suggested approach is mainly based on a network representation of users’ mobility, called Individual Mobility Networks, jointly with the analysis of descriptive features of the user’s driving behavior related to driving style (e.g., accelerations) and characteristics of the mobility in the neighborhood visited by the user. The paper presents a large experimentation over a real dataset, showing comparative performances against baselines and competitors, and a study of some typical risk factors in the areas under analysis through the adoption of state-of-art model explanation techniques. Preliminary results show the effectiveness and usability of the proposed predictive approach.

Investigating Neighborhood Generation Methods for Explanations of Obscure Image Classifiers

Advances in Knowledge Discovery and Data Mining, 2019

Download

Explaining Image Classifiers Generating Exemplars and Counter-Exemplars from Latent Representations

Proceedings of the AAAI Conference on Artificial Intelligence, 2020

Download

Riccardo Guidotti

Uploads

Papers by Riccardo Guidotti

Log In