Abstract
In the field of artificial intelligence, learning automaton (LA) is a self-adaptive decision-maker which plays an important role in reinforcement learning (RL). Games of learning automata are stochastic games with incomplete information that have received frequent usage. Traditional learning automata schemes using in games are parameter-based schemes which exist a tunable parameter (stepsize) changing with different environments. In this paper, we proposed Bayesian method-based parameter-free learning automata (BPFLA) for two-player stochastic games with incomplete information. The parameter-free property indicates that a set of parameters in the scheme can be universally applicable for all configurations of games. Besides, simulation results demonstrate that BPFLA has much faster convergence rate than traditional schemes using games of learning automata with equal or higher accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An introduction, vol. 1, No. 1. Cambridge, MIT press (1998)
Narendra, K.S., Thathachar, M.A.: Learning Automata: An Introduction. Courier Corporation (2012)
Thomas, L.C.: Games, Theory And Applications. Courier Corporation (2012)
Wang, H., et al.: Reinforcement learning for constrained energy trading games with incomplete information. IEEE Trans. Cybern. 47(10), 3404–3416 (2017)
Fu, K.S., Li, T.J.: Formulation of learning automata and automata games. Inf. Sci. 1(3), 237–256 (1969)
Lakshmivarahan, S., Narendra, K.S.: Learning algorithms for two-person zero-sum stochastic games with incomplete information: a unified approach. SIAM J. Control. Optim. 20(4), 541–552 (1982)
Tilak, O., Ryan, M., Mukhopadhyay, S.: Decentralized indirect methods for learning automata games. IEEE Trans. Syst. Man Cybern. Part B (Cybernetics) 41(5), 1213–1223 (2011)
Ge, H., et al.: A parameter-free gradient Bayesian two-action learning automaton scheme. In: Proceedings of the 2015 International Conference on Communications, Signal Processing, and Systems. Springer, Berlin (2016)
Ge, H.: A parameter-free learning automaton scheme (2017). arXiv:1711.10111
Gupta, A.K., Nadarajah, S. (eds.): Handbook of Beta Distribution and its Applications. CRC Press, Boca Raton (2004)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Ding, H., Di, C., Shenghong, L. (2020). Bayesian Method-Based Learning Automata for Two-Player Stochastic Games with Incomplete Information. In: Liang, Q., Liu, X., Na, Z., Wang, W., Mu, J., Zhang, B. (eds) Communications, Signal Processing, and Systems. CSPS 2018. Lecture Notes in Electrical Engineering, vol 517. Springer, Singapore. https://doi.org/10.1007/978-981-13-6508-9_4
Download citation
DOI: https://doi.org/10.1007/978-981-13-6508-9_4
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-6507-2
Online ISBN: 978-981-13-6508-9
eBook Packages: EngineeringEngineering (R0)