Cloud Networking
Cloud Networking
Cloud Networking
2
Cloud Computing Era
• Cloud computing is Internet-based computing,
whereby shared resources, software and
information are provided to computers and other
devices on-demand, like the electricity grid.
3
*source : Wikimedia Commons – Cloud Computing
Why Cloud Computing?
• Elastic resources
– Expand and contract resources
– Pay-per-use
– Infrastructure on demand
• Multi-tenancy
– Multiple independent users
– Security and resource isolation
– Divide the cost of the (shared) infrastructure
• Simplify app deployment & management
– Common programming model across mobile, browser,
client, server, cloud
4
Microsoft’s Cloud Platform
Finished Services
Dev Tools
& Solutions
Building Block
Services
Cloud
Infrastructure
Services
Global
Foundation
Services
*Source: http://research.microsoft.com/pubs/102318/Location-based%20service%20on%20the%20Cloud.pptx
Virtualization
……
Aggregator Aggregator Aggregator
… …
Worker Worker Worker Worker Worker
7
Outline
1 Introduction
8
Applications inside Data Centers
….
…. …. ….
Congestion
• Query [1KB-100KB]
- Delay-sensitive
• Short messages [100KB-1MB] - Large in number
- Few bytes amount
(Coordination, Control state)
10
Typical Sources of Performance Degradation
in Data Center Networks [4]
a) Incast : many flows go through the same port within a short interval The
buffer space get exhausted packets of some flows dropped miss
deadline
b) Queue buildup : even with no packets are dropped short flows
experience increased latency queued behind packets from the large flow
c) Buffer Pressure: when shallow buffered switch (shared memory) is used
short flows on one port to be impacted by activity of long flows on other
ports.
11
Outline
1 Introduction
12
TCP Congestion Control
• Designed to address Internet congestion problem
– Window-based (AIMD) adjustment of sending rates.
– Assume packet losses network congestion
– many variants: Tahoe, Reno, Vegas, Cubic, Westwood, ..
• Router assistance to TCP
– Random Early Detection (RED) : measures congestion based
on weighted moving average of queue length and either
drop/mark probabilistically
– Explicit Congestion Notification (ECN) : is used for conveying
congestion information to the senders
• Clean-slate approach
– eXplicit Congestion Control (XCP): Congestion Window +
Feedback (in ACKs)
Differences Between DCN and
Internet/WAN
Characteristic Internet/WAN DCN
14
Outline
1 Introduction
15
Data Center Transport
Requirements
1. High Burst Tolerance
– Incast due to Partition/Aggregate is common.
2. Low Latency
– Short flows, queries
3. High Throughput
– Continuous data updates, large file transfers
16
Existing Solutions
1. Sender-Based :
– Mirco-seconds MinRTO [3] and DCTCP [4]
2. Receiver-Based :
– ICTCP [10] and PAC [11]
3. Switch-Assisted :
– PFabric [7] and Cutting-Payload [12]
4. Deadline-Aware :
– D3 [5] , D2TCP [8] and PDQ [9]
17
Data Center TCP (DCTCP)
Sender 1
ECN = Explicit Congestion Notification
ECN Mark (1
bit) Receiver
Sender 2
18
Two Key Ideas
1. React in proportion to the extent of congestion, not
to its presence.
Reduces variance in sending rates, lowering queuing
requirements.
1 2 3
21
Outline
1 Introduction
22
Our Work
• Simple yet efficient switch-assisted solution
• No modification to the TCP sender or receiver
stack.
• Solution that fits in regardless of TCP flavor.
• Appealing to public cloud operators.
• Incremental deployment is possible.
• IQM [12] at Globecom15
• RWNDQ [13,14] at Cloudnet15 and IPCCC15
23
TCP Flow Control is the answer
Flow Control is part of all TCP flavors
Data Data
Sender Receiver
ACK ACK
25
IQM Algorithm
Switch side (Continuously monitor incoming SYN/FIN):
– If extra traffic > “limit” raise incast flag.
– Set TCP RWND=1 MSS during incast epoch.
– Disable window rewriting when the queue drops back to
“Save thr”.
Switch Port
Limit Safe Thr
Data Data
ACK ACK
Bottleneck
Switch
27
Sample - Experimental Analysis
Small Scale Testbed using Open vSwitch
Scenario depicting 150 elephants against 30 Mice.
Mice Goal: Low Latency and low variance.
Elephants Goal: High and enough throughput
28
RWNDQ - Two Key Ideas
1. Switch egress port toward destination is a receiver of the data.
Buffer occupancy change over time
Buffer occupancy reflects level of congestion.
Locality of number of ongoing flow information.
29
RWNDQ Algorithm
Switch side (Local window proportional to queue occupancy):
– Increase receive window when below the target.
– Decrease when we are above the queue target.
– Slow start to initially reach target fast.
Switch Port
Queue Target
Data Data
ACK ACK
31
Conclusion
• DCN congestion is a hot research topic
– Business needs and service agreements
– Quality of service (QoS)
• DCN congestion control is a necessity
– Incast is a very serious and frequent problem.
– Employing an efficient packet queueing-
scheduling to preserve small switch buffers
– Meeting deadlines either by achieving low
latency or building a deadline-aware
networking architecture.
32
Future Research Directions
• Leveraging functionalities of SDN
• Stability analysis and study.
• Handling persistent TCP connections.
• Adapting to varying initial congestion
window.
• Bandwidth allocation in Multi-tenant
datacenter with QoS constraints.
33
THANKS!
QUESTIONS ARE WELCOMED
34
References
1. V. Jacobson. Congestion avoidance and control. ACM SIGCOMM Computer Communication
Review, 18:314-329, 1988.
2. J Dean and S Ghemawat. MapReduce : Simplied Data Processing on Large Clusters.
Communications of the ACM, 51:1-13, 2008.
3. Vijay Vasudevan, Amar Phanishayee, Hiral Shah, Elie Krevat, David G. Andersen, Gregory R.
Ganger, Garth A. Gibson, and Brian Mueller. Safe and effective fine-grained TCP retransmissions
for datacenter communication. ACM SIGCOMM Computer Communication Review, 39:303, 2009.
4. Mohammad Alizadeh, Albert Greenberg, David A. Maltz, Jitendra Padhye, Parveen Patel, Balaji
Prabhakar, Sudipta Sengupta, and Murari Sridharan. Data center TCP (DCTCP). ACM SIGCOMM
Computer Communication Review, 40:63, 2010.
5. Christo Wilson, Hitesh Ballani, Thomas Karagiannis, and Ant Rowstron. Better Never than Late:
Meeting Deadlines in Datacenter Networks. In Proc. ACM Conference on Communications
Architectures, Protocols and Applications (SIGCOMM'11), pages 50{61, 2011.
6. Theophilus Benson, Aditya Akella, and David a. Maltz. Network traffic characteristics of data
centers in the wild. In Proceedings of the 10th ACM SIGCOMM page 267, 2010.
7. Mohammad Alizadeh, Shuang Yang, Sachin Katti, Nick McKeown, Balaji Prabhakar, and Scott
Shenker. Deconstructing datacenter packet transport. Proceedings of the 11th ACM Workshop on
Hot Topics in Networks - HotNets-XI, pages 133-138, 2012.
8. Balajee Vamanan, Jahangir Hasan, and T.N. Vijaykumar. Deadline-aware data center tcp (d2tcp). In
Proceedings of the ACM SIGCOMM 2012 Conference on Applications, Technologies,
Architectures, and Protocols for Computer Communication, SIGCOMM '12, pages 115{126, New
York, NY, USA, 2012. ACM.
35
References
8. Chi-Yao Hong, Matthew Caesar, and P. Brighten Godfrey. Finishing flows quickly with
preemptive scheduling. In Proceedings of the ACM SIGCOMM 2012 conference on
Applications, technologies, architectures, and protocols for computer communication -
SIGCOMM '12, page 127, New York, New York, USA, August 2012. ACM Press.
9. Haitao Wu, Zhenqian Feng, Chuanxiong Guo, and Yongguang Zhang. ICTCP: Incast
congestion control for TCP in data-center networks. IEEE/ACM Transactions on Networking,
21:345{358, 2013.
10. Wei Bai, Kai Chen, Haitao Wu, Wuwei Lan, and Yangming Zhao. PAC: Taming TCP Incast
congestion Using Proactive ACK Control. In IEEE International Conference on Network
Protocols, 2014.
11. Peng Cheng, Fengyuan Ren, Ran Shu, and Chuang Lin. Catch the Whole Lot in an Action:
Rapid Precise Packet Loss Notication in Data Center. In Proceedings of the 11th USENIX
Symposium on Networked Systems Design and Implementation (NSDI 14), pages 17{28,
2014.
12. Ahmed M. Abdelmoniem and Brahim Bensaou, “Incast-Aware Switch-Assisted TCP
Congestion Control for Data Centers”, IEEE Global Communications Conference (Globecom
15).
13. Ahmed M. Abdelmoniem and Brahim Bensaou, “Reconciling Mice and Elephants in Data
Center Networks”, IEEE Cloud Networking Conference (Cloudnet 15).
14. Ahmed M. Abdelmoniem and Brahim Bensaou, “ Efficient Switch-Assisted Congestion
Control for Data Centers: an Implementation and Evaluation”, IEEE Performance Computing
and Communications Conference (IPCCC15).
36