A reinforcement-learning approach for admission control in distributed network service systems

Lu, Xiaonong; Yin, Baoqun; Zhang, Haipeng

doi:10.1007/s10878-014-9820-3

A reinforcement-learning approach for admission control in distributed network service systems

Published: 03 December 2014

Volume 31, pages 1241–1268, (2016)
Cite this article

Journal of Combinatorial Optimization Aims and scope Submit manuscript

We’re sorry, something doesn't seem to be working properly.

Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Abstract

In the distributed network service systems such as streaming-media systems and resource-sharing systems with multiple service nodes, admission control (AC) technology is an essential way to enhance performance. Model-based optimization approaches are good ways to be applied to analyze and solve the optimal AC policy. However, due to “the curse of dimensionality”, computing such policy for practical systems is a rather difficult task. In this paper, we consider a general model of the distributed network service systems, and address the problem of designing an optimal AC policy. An analytical model is presented for the system with fixed parameters based on semi-Markov decision process (SMDP). We design an event-driven AC policy, and the stationary randomized policy is taken as the policy structure. To solve the SMDP, both the state aggregation approach and the reinforcement-learning (RL) method with online policy optimization algorithm are applied. Then, we extend the problem by considering the system with time-varying parameters, where the arrival rates of requests at each service node may change over time. In view of this situation, an AC policy switching mechanism is presented. This mechanism allows the system to decide whether to adjust its AC policy according to the policy switching rule. And in order to maximize the gain of system, that is, to obtain the optimal AC policy switching rule, another RL-based algorithm is applied. To assess the effectiveness of SMDP-based AC policy and policy switching mechanism for the system, numerical experiments are presented. We compare the performance of optimal policies obtained by the solutions of proposed methods with other classical AC policies. The simulation results illustrate that higher performance and computational efficiency could be achieved by using the SMDP model and RL-based algorithms proposed in this paper.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

€32.70 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (France)

Instant access to the full article PDF.

Institutional subscriptions

Learning optimal admission control in partially observable queueing networks

Article 29 June 2024

Event-based optimization for admission control in distributed service system

Article 25 September 2015

Optimal admission control in queues with multiple customer classes and abandonments

Article 31 January 2025

References

Abundo M, Cardellini V, Presti FL (2012) Admission control policies for a multi-class QoS-aware service oriented architecture. ACM SIGMETRICS Perform Eval Rev 39(4):89–98
Article Google Scholar
Altman E, Jimenez T, Koole G (2001) On optimal call admission control in a resource-sharing system. IEEE Trans Commun 49(9):1569–1668
Article MATH Google Scholar
Cao X (2005a) A basic formula for online policy gradient algorithms. IEEE Trans Autom Control 50(5):696–699
Article MathSciNet Google Scholar
Cao X (2005b) Basic ideas for event-based optimization of Markov systems. Discr Event Dyn Syst Theory Appl 15(2):169–197
Article MathSciNet MATH Google Scholar
Chen W, Shih C (2012) Architecture of portable electronic medical records system integrated with streaming media. J Med Syst 36(1):25–31
Article Google Scholar
Gosavi A (2004) Reinforcement learning for long-run average cost. Eur J Oper Res 155:657–674
Article MathSciNet MATH Google Scholar
Gosavi A (2011) Target-sensitive control of Markov and semi-Markov processes. Int J Control Autom Syst 9(5):941–951
Article MathSciNet Google Scholar
Huang Y, Fu T, Chiu D, Lui J, Huang C (2008) Challenges, design and analysis of a large-scale p2p-vod system. In: ACM SIGCOMM, pp 375–388
Janssen J (1999) Semi-Markov models: theory and applications. Springer, New York
Book MATH Google Scholar
Janssen J, Manca R (2005) Applied semi-Markov processes. Springer, New York
MATH Google Scholar
Li J, Yang J, Xi H (2009) Dynamic threshold based admission control policy for video-on-demand systems. J Chin Comput Syst 3(3):551–554
Google Scholar
Li Y, Cao F (2013) A basic formula for performance gradient estimation of semi-Markov decision processes. Eur J Oper Res 224:333–339
Article MathSciNet MATH Google Scholar
Lin F, Yin B, Huang J, Wu X (2012) Admission control with elastic QoS for video-on-demand systems. Int J Autom Comput 9(5):467–473
Article Google Scholar
Lu X, Yin B, Zhang H, Ling Q (2012) Admission control scheme for distributed service systems based on model and prediction. In: Chinese Control Conference, pp 5518–5523
Lu X, Yin B, Zhang H (2014) Switching-pomdp based admission control policies for service systems with distributed architecture. In: IEEE ICNSC, pp 209–214
Mundur P, Simon R, Sood A (2004) End-to-end analysis of distributed video-on-demand systems. IEEE Trans Multimed 6(1):129–141
Article Google Scholar
Mundur P, Sood A, Simon R (2005) Class-based access control for distributed video-on-demand systems. IEEE Trans Circ Syst Video Technol 15(7):844–853
Article Google Scholar
Ni J, Tstang D, Tatikonda S, Bensaou B (2007) Optimal and structured call admission control policies for resource-sharing systems. IEEE Trans Commun 55(1):158–170
Article Google Scholar
Singh S, Tadic V, Doucet A (2007) A policy gradient method for semi-Markov decision processes with application to call admission control. Eur J Oper Res 178:808–818
Article MathSciNet MATH Google Scholar
Thng I, Luo X (2004) A robust m/m/1/k scheme for providing hand-off dropping QoS in multi-service mobile networks. Wirel Netw 10(3):301–309
Article Google Scholar
Xia Z, Hao W, Yen I, Li P (2005) Architecture of portable electronic medical records system integrated with streaming media. IEEE Trans Parallel Distrib Syst 16(12):1143–1153
Article Google Scholar
Yin B, Lu S, Guo D (2011) Analysis of admission control in p2p-based media delivery network based on POMDP. Int J Innov Comput Inf Control 7(7B):4411–4422
Google Scholar
Zhang F, Sun W (2012) P2p streaming media technology in the remote education system. Adv Mater Res 433:4893–4897
Article Google Scholar
Zhang H, Yin B, Lu X (2013) A novel dynamic model for streaming service system. In: IEEE ICSESS, pp 326–329
Zhi Y, Zhu Z, Ma X, Wang B (2006) Client-class based admission control for distributed video-on-demand system. In: International Conference on Digital Object Identifier, pp 1–4
Zhou Y, Chiu D, Lui J (2011) A simple model for chunk scheduling strategies in p2p streaming. IEEE/ACM Trans Netw 19(1):42–54
Article Google Scholar
Zimmerman R, Fu K (2003) Comprehensive statistical admission control for streaming media servers. In: ACM Multimedia Conference, pp 75–85

Download references

Acknowledgments

This work is supported in part by the National Natural Science Foundation of China under Grant Nos. 61174124, 61233003, in part by Research Fund for the Doctoral Program of Higher Education of China under Grant No. 20123402110029 and in part by Natural Science Research Program of the Anhui High Education Bureau of China under Grant No. KJ2012A286.

Author information

Authors and Affiliations

Department of Automation, University of Science and Technology of China, Hefei, 230027, China
Xiaonong Lu, Baoqun Yin & Haipeng Zhang

Authors

Xiaonong Lu
View author publications
You can also search for this author in PubMed Google Scholar
Baoqun Yin
View author publications
You can also search for this author in PubMed Google Scholar
Haipeng Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiaonong Lu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lu, X., Yin, B. & Zhang, H. A reinforcement-learning approach for admission control in distributed network service systems. J Comb Optim 31, 1241–1268 (2016). https://doi.org/10.1007/s10878-014-9820-3

Download citation

Published: 03 December 2014
Issue Date: April 2016
DOI: https://doi.org/10.1007/s10878-014-9820-3

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

€32.70 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (France)

Instant access to the full article PDF.

Institutional subscriptions

A reinforcement-learning approach for admission control in distributed network service systems

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Learning optimal admission control in partially observable queueing networks

Event-based optimization for admission control in distributed service system

Optimal admission control in queues with multiple customer classes and abandonments

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

A reinforcement-learning approach for admission control in distributed network service systems

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Learning optimal admission control in partially observable queueing networks

Event-based optimization for admission control in distributed service system

Optimal admission control in queues with multiple customer classes and abandonments

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation