POMDPs Research Papers - Academia.edu

In this work, we propose a multimodal interaction framework for robust human-multirobot communication in outdoor environments. In these scenarios, several human or environmental factors can cause errors, noise and wrong interpretations of... more

Bookmark
Download
- by Alberto Finzi
- •
- Computer Science

Partially Observable Markov Decision Processes (POMDP) is a widely used model to represent the interaction of an environment and an agent, under state uncertainty. Since the agent does not observe the environment state, its uncertainty is... more

Bookmark
Download
- by Divya Grover
- •
- 8
  Computer Science, Markov Decision Process, Probabilistic Logic, Solver

Partially Observable Markov Decision Processes (POMDPs) provide an efficient way to model real-world sequential decision making processes. Motivated by the problem of maintenance and inspection of a group of infrastructure components with... more

La prise de décision est un problème omniprésent qui survient dés qu'on fait faceà plusieurs choix possibles. Ce problème est d'autant plus complexe lorsque les décisions, ou les actions, doiventêtre prise d'une manière séquentielle. En... more

La prise de décision est un problème omniprésent qui survient dés qu'on fait faceà plusieurs choix possibles. Ce problème est d'autant plus complexe lorsque les décisions, ou les actions, doiventêtre prise d'une manière séquentielle. En effet, l'exécution d'une actionà un moment donné entraîne un changementà l'environnement, ou au système qu'on veut contrôler, et un tel changement ne peut pasêtre prévu avec certitude. Le but d'un processus de prise de décision consiste alorsà choisir des actions en vue de se comporter d'une manière optimale dans un environnement incertain. Afin d'y parvenir, l'environnement est souvent modélisé comme un système dynamiqueà plusieursétats, et les actions sont choisies d'une telle manièreà ramener le système vers unétat désirable. Dans le cadre de cette thèse, nous avons proposé un ensemble de modèles stochastiques et d'algorithmes, afin d'améliorer la qualité du processus de prise de décision sous l'incertain. Les modèles développés sont une alternative aux Processus Décisionnels de Markov (MDPs), un cadre formel largement utilisé pour ce genre de problèmes. En particulier, nous avons montré que l'état d'un système dynamique peutêtre représenté d'une manière plus concise lorsqu'il est décrit en termes de prédictions de certainsévènements dans le futur. Nous avons aussi montré que le processus cognitif même du choix d'actions, appelé politique, peutêtre vu comme un système dynamique. Partant de cette observation, nous avons proposé une panoplie d'algorithmes, tous basés sur des représentations prédictives de politiques, pour résoudre différents problèmes de prise de décision, tels que la panification décentralisée, l'apprentissage par renforcement, ou bien encore l'apprentissage par imitation. Nous avons montré analytiquement et empiriquement que les approches proposées mènent a des réductions de la complexité de calcul età une amélioration de la qualité des solutions par rapport aux approches d'apprentissage et de planification standards.

Addressing current challenges in research on disruptive mood dysregulation disorder (DMDD), this study aims to compare executive function in children with DMDD, children with attention-deficit/hyperactivity disorder (ADHD), and children... more

Bookmark
Download
- by Annika Melinder
- •
- 9
  Psychology, Working Memory, Cognitive Flexibility, Mood

Recent work in the behavioural sciences has begun to overturn the long-held belief that human decision making is irrational, suboptimal and subject to biases. This turn to the rational suggests that human decision making may be a better... more

As agents' technology becomes increasing more prevalent, coordination in mixed agent-human environments becomes a key issue. Agent-human coordination is becoming even more important in real life situations, where uncertainty and... more

Bookmark
Download
- by Sarit Kraus
- •
- 9
  Engineering, Computer Science, Knowledge Management, Cognition

We propose a novel approach to developing a tractable affective dialogue model for probabilistic frame-based dialogue systems. The affective dialogue model, based on Partially Observable Markov Decision Process (POMDP) and Dynamic... more

Planning to see: A hierarchical approach to planning visual actions on a robot using POMDPs This journal article describes a novel approach that enables a mobile robot to autonomously tailor vision-based sensing and information processing... more

This paper proposes a novel hierarchical representation of POMDPs that for the first time is amenable to real-time solution. It will be referred to in this paper as the Robot Navigation-Hierarchical POMDP (RN-HPOMDP). The RN-HPOMDP is... more

We analyze a single-item periodic-review inventory system with random yield and finite capacity operating in a random environment. The primary objective is to extend the model of Gallego and Hu (2004) to the more general case when the... more

Bookmark
Download
- by Suleyman Ozekici
- •
- 14
  Computer Science, Economics, Dynamic programming, Multidisciplinary

Objective: To develop a scale for emotional regulation using item response theory. Method: Eighteen Swanson Nolan and Pelham (SNAP-IV) items that loaded on an emotional dysregulation factor were submitted to Rasch analysis. After... more

Bookmark
Download
- by Don Duncan
- •
- 8
  Psychology, Clinical Psychology, Medicine, Rating Scale

Methods of deep machine learning enable to to reuse low-level representations efficiently for generating more abstract high-level representations. Originally, deep learning has been applied passively (e.g., for classification purposes).... more

We formalize decision-making problems in robotics and automated control using continuous MDPs and actions that take place over continuous time intervals. We then approximate the continuous MDP using finer and finer discretizations. Doing... more

Bookmark
Download
- by Nan Rong
- •
- 10
  Robotics, Mathematics, Computer Science, Artificial Intelligence

Bayesian learning methods have recently been shown to provide an elegant solution to the exploration-exploitation trade-off in reinforcement learning. However most investigations of Bayesian reinfo...

High dimensionality of belief space in Partially Observable Markov Decision Processes (POMDPs) is one of the major causes that severely restricts the applicability of this model. Previous studies have demonstrated that the dimensionality... more

Recent research has shown that effective dialogue management can be achieved through the Partially Observable Markov Decision Process (POMDP) framework. However past research on POMDP-based dialogue systems usually assumed the parameters... more

Bookmark
Download
- by Rebecca Lowndes
- •
- 8
  Cognitive Psychology, Computer Science, Perception, Cognition

Identifying an object of interest, grasping it, and handing it over are key capabilities of collaborative robots. In this context we propose a fast, supervised learning framework for learning associations between human hand gestures and... more

Bookmark
Download
- by Justus Piater
- •
- 6
  Computer Science, Human Computer Interaction, Gesture, ROBOT

A common approach to the control problem in partially observable environments is to perform a direct search in policy space, as defined over some set of features of history. In this paper we consider predictive features, whose values are... more

The problem of developing good policies for partially observable Markov decision problems (POMDPs) remains one of the most challenging areas of research in stochastic planning. One line of research in this area involves the use of... more

One of the main challenges when it comes to designing a dialogue system is error handling. The Automatic Speech Recognition (ASR) technology is not perfect and the user may use words or expressions that are unknown to the system... more

Bookmark
Download
- by Hatim Khouzaimi
- •
- 2
  Computer Science, Utterance

A common approach to the control problem in partially observable environments is to perform a direct search in policy space, as defined over some set of features of history. In this paper we consider predictive features, whose values are... more

Active Object Recognition (AOR) has been approached as an unsupervised learning problem, in which optimal trajectories for object inspection are not known and are to be discovered by reducing label uncertainty measures or training with... more

Existing algorithms for discrete partially observable Markov decision processes can at best solve problems of a few thousand states due to two important sources of intractability: the curse of dimensionality and the policy space... more

We describe a new approximation algorithm for solving partially observable MDPs. Our bounded policy iteration approach searches through the space of bounded-size, stochastic finite state controllers, combining several advantages of... more

Bookmark
Download
- by Craig Boutilier
- •
- 11
  Mathematics, Computer Science, POMDPs, State Space

We consider the problem belief-state monitoring for the purposes of implementing a policy for a partially-observable Markov decision process (POMDP), specifically how one might approxi mate the belief state. Other schemes for belief state... more

We consider the problem of learning the behavior of a POMDP (Partially Observable Markov Decision Process) with deterministic actions and observations. This is a challenging problem due to the fact that the observations can only partially... more

There is a tendency in decision-making research to treat uncertainty only as a problem to be overcome. But it is also a feature that can be leveraged, particularly in social interaction. Comparing the behavior of profitable and... more

Bookmark
Download
- by Dominic Albino
- •
- 11
  Psychology, Cognitive Science, Computer Science, Information Theory

In human-robot interactive scenarios communication and collaboration during task execution are crucial issues. Since the human behavior is unpredictable and ambiguous, an interactive robotic system is to continuously interpret intentions... more

Bookmark
Download
- by Michelangelo Fiore
- •
- 3
  Computer Science, ROBOT, Human Robot Interaction

My PhD was a period full of excitement, of intense learning on both the scientific and personal level. At the same time, there were many hard moments, where I had real doubts on its success. There are many people without whom I couldn't... more

Bookmark
Download
- by Michelangelo Fiore
- •
- 3
  Computer Science, Supervision, Task planning

Attracted by their easy-to-use interfaces and captivating benefits, conversational systems have been widely embraced by many individuals and organizations as side-by-side digital co-workers. They enable the understanding of user needs,... more

Bookmark
Download
- by Marcos Baez
- •
- 3
  Computer Science, Conversation, Scalability

POMDPs.jl is an open-source framework for solving Markov decision processes (MDPs) and partially observable MDPs (POMDPs). POMDPs.jl allows users to specify sequential decision making problems with minimal effort without sacrificing the... more

Bookmark
Download
- by jayesh gupta
- •
- 5
  Computer Science, Machine Learning, Markov Decision Process, Computation

Motion planning under uncertainty is essential to autonomous robots. Over the past decade, the scalability of such planners have advanced substantially. Despite these advances, the problem remains difficult for systems with non-linear... more

Motion planning under uncertainty is essential to autonomous robots. Over the past decade, the scalability of such planners have advanced substantially. Despite these advances, the problem remains difficult for systems with non-linear dynamics. Most successful methods for planning perform forward search that relies heavily on a large number of simulation runs. Each simulation run generally requires more costly integration for systems with non-linear dynamics. Therefore, for such problems, the entire planning process remains relatively slow. Not surprisingly, linearization-based methods for planning under uncertainty have been proposed. However, it is not clear how linearization affects the quality of the generated motion strategy, and more importantly where to and where not to use such a simplification. This paper presents our preliminary work towards answering such questions. In particular, we propose a measure, called Statistical-distance-based Non-linearity Measure (SNM), to identify where linearization can and where it should not be performed. The measure is based on the distance between the distributions that represent the original motion-sensing models and their linearized version. We show that when the planning problem is framed as the Partially Observable Markov Decision Process (POMDP), the difference between the value of the optimal strategy generated if we plan using the original model and if we plan using the linearized model, can be upper bounded by a function linear in SNM. We test the applicability of this measure in simulation via two venues. First, we compare SNM with a negentropy-based Measure of Non-Gaussianity (MoNG)-a measure that has recently been shown to be a suitable measure of non-linearity for stochastic systems [1]. We compare their performance in measuring the difference between a general POMDP solver [2] that computes motion strategies using the original model and a solver that uses the linearized model (adapted from [3]) on various scenarios. Our results indicate that SNM is more suitable in taking into account the effect that obstacles have on the effectiveness of linearization. In the second set of tests, we use a local estimate of SNM to develop a simple on-line planner that switches between using the original and the linearized model. Simulation results on a car-like robot with second order dynamics and a 4-DOFs and 6-DOFs manipulator with torque control indicate that our simple planner appropriately decides if and when linearization should be used.

Low-cost navigation solutions for indoor environments have a variety of real-world applications ranging from emergency evacuation to mobility aids for people with disabilities. Challenges for indoor navigation include robust localization... more

Bookmark
Download
- by Balajee Kannan
- •
- 4
  Computer Science, Architecture, Mobile Computing, Mobile phone

We formalize decision-making problems in robotics and automated control using continuous MDPs and actions that take place over continuous time intervals. We then approximate the continuous MDP using finer and finer discretizations. Doing... more

Bookmark
Download
- by Nan Rong
- •
- 8
  Robotics, Mathematics, Computer Science, Artificial Intelligence

Bayesian approaches provide a principled solution to the explorationexploitation trade-off in Reinforcement Learning. Typical approaches, however, either assume a fully observable environment or scale poorly. This work introduces the... more

This Dagstuhl Seminar also stood as the 11th European Workshop on Reinforcement Learning (EWRL11). Reinforcement learning gains more and more attention each year, as can be seen at the various conferences (ECML, ICML, IJCAI, . . . ).... more

Bookmark
Download
- by Peter Sunehag
- •
- 2
  Computer Science, Reinforcement Learning

Abstract—A mobile robot must have the ability of building a representation of its environment and the objects in it. To build a three-dimensional (3D) model of a physical object, several scans must be taken at different locations.... more

In this paper the approach of using a partially observable Markov model for games with dynamical difficulty adjustment is introduced. This approach leads implicitly to a strategy which balances gathering information about the player... more

Brain-imaging technology has boosted the quantification of neurobiological phenomena underlying human mental operations and their disturbances. Since its inception, drawing inference on neurophysiological effects hinged on classical... more

Others can have a different perception of the world than ours. Understanding this divergence is an ability, known as perspective taking in developmental psychology, that humans exploit in daily social interactions. A recent trend in... more

Many robotic projects use simulation as a faster and easier way to develop, evaluate and validate software components compared with on-board real world settings. In the human-robot interaction field, some recent works have attempted to... more

Bookmark
Download
- by Emmanuel Ferreira
- •
- 3
  Computer Science, Simulation, Human Robot Interaction

Automatically generating solutions to general multi-robot coordination problems with communication limitations is challenging, but crucial in many domains. As one way to address this problem, we describe a probabilistic framework for... more

Online, sample-based planning algorithms for POMDPs have shown great promise in scaling to problems with large state spaces, but they become intractable for large action and observation spaces. This is particularly problematic in... more

With rapid profusion of video data, automated surveillance and intrusion detection is becoming closer to reality. In order to provide timely responses while limiting false alarms, an intrusion detection system must balance resources... more

Bookmark
Download
- by Christopher Amato
- •
- 4
  Computer Science, Adaptive Control, Surveillance, POMDPs

Decentralized partially observable Markov decision processes (Dec-POMDPs) are general models for decentralized decision making under uncertainty. However, they typically model a problem at a low level of granularity, where each agent's... more

POMDPs and their decentralized multiagent counterparts, DEC-POMDPs, offer a rich framework for sequential decision making under uncertainty. Their high computational complexity, however, presents an important research challenge. One way... more

In recent years, there has been much debate regarding the most appropriate diagnostic classification of children exhibiting emotion dysregulation in the form of irritability and severe temper outbursts. Most recently, this has resulted in... more

Bookmark
Download
- by Amy Roy
- •
- 6
  Psychology, Medicine, Affective Disorders, Research Domain Criteria

POMDPs

Log In