POMDP
8 Followers
Recent papers in POMDP
This is an overview of partially observable Markov decision processes (POMDPs). We describe POMDP value and policy iteration as well as gradient ascent algorithms. The emphasis is on solution methods that work directly in the space of... more
Today within the AAMAS community, we see at least four competing approaches to building multiagent systems: beliefdesire-intention (BDI), distributed constraint optimization (DCOP), distributed POMDPs, and auctions or game-theoretic... more
Partially Observable Markov Decision Processes (POMDPs) have been demonstrated empirically to be good models for robust spoken dialogue design. This chapter shows that such models are also very appropriate for designing affective dialogue... more
A long-standing goal of AI is to enable robots to plan in the face of uncertain and incomplete information, and to handle task failure intelligently. This paper shows how to achieve this. There are two central ideas. The first idea is to... more
In a spoken dialog system, determining which action a machine should take in a given situation is a difficult problem because automatic speech recognition is unreliable and hence the state of the conversation can never be known with... more
In most of the papers on inventory models operating in a random environment, the state of the environment in each period is assumed to be fully observed with perfect information. However, this assumption is not realistic in most real-life... more
A new approach to solve the sensor control problem is proposed, formulated based on multi-object Bayes filtering in the partially observable Markov decision process (POMDP) context, where the multi-object states are assumed to be random... more
We demonstrate the In Situ testbed, a system that aids in evaluating computational models of learning, including artificial neural networks. The testbed models contingencies of reinforcement using an extension of Mechner's (1959)... more
This paper considers a scenario in which a secondary user (SU) opportunistically accesses a channel allocated to some primary network (PN) that switches between idle and active states in a time-slotted manner. At the beginning of each... more
We consider the problem of joint network coding and packet scheduling for multimedia transmission from the Access Point (AP) to multiple receivers in 802.11 networks. The state of receivers is described by a hidden Markov model and the AP... more
This extended abstract discusses various approaches to the constraining of Partially Observable Markov Decision Processes (POMDPs) using social norms and logical assertions in a dynamic logic framework. Whereas the exploitation of... more
Today within the AAMAS community, we see at least four competing approaches to building multiagent systems: beliefdesire-intention (BDI), distributed constraint optimization (DCOP), distributed POMDPs, and auctions or game-theoretic... more
Today within the AAMAS community, we see at least four competing approaches to building multiagent systems: beliefdesire-intention (BDI), distributed constraint optimization (DCOP), distributed POMDPs, and auctions or game-theoretic... more
Interactions between humans and service robots are more natural when emotions can be synthesized in those robots. Cognitive appraisal theory of emotions provides a theoretical basis for designing artificial emotion generation systems for... more
We present a framework for sensor actuation and control in sentient spaces, in which sensors are used to observe a physical phenomena. We focus on sentient spaces that enable pervasive computing applications, such as smart video... more
... Mürsel Yildiz, Ahmet Cihat Toker, Fikret Sivrikaya, Seyit Ahmet Camtepe, Sahin Albayrak ... assume certain implementation conventions and can not be applied to all products [7]. Similarly, delay time between scheduled and actual... more
This extended abstract discusses various approaches to the constraining of Partially Observable Markov Decision Processes (POMDPs) using social norms and logical assertions in a dynamic logic framework. Whereas the exploitation of... more
ABSTRACT Dynamic Spectrum Access is a key capability of Cognitive Radio (CR) networks to increase the efficiency in the use of the available spectrum resources. In this respect, this paper focuses on the spectrum selection problem when a... more
We consider the problem of joint network coding and packet scheduling for multimedia transmission from the Access Point (AP) to multiple receivers in 802.11 networks. The state of receivers is described by a hidden Markov model and the AP... more
The context is sensor control for multi-object Bayes filtering in the framework of partially observed Markov decision processes. The current information state is represented by the multi-object probability density function (PDF), while... more