Abstract
Sorting is a very important function which is widely used in several applications like signal processing and other data center acceleration. Sorting is generally implemented on CPU or GPU, which takes several cycles to finish the sorting process. Further improvement in performance in sorting is possible through hardware acceleration either in FPGA or ASIC. The performance improvement and reducing the power consumption are the primary goals for researchers to improve the hardware acceleration of sorting algorithms. The sorting techniques like Bubble sort, Bitonic sort and Odd–even sort are found suitable for hardware implementation and widely discussed in the research literature. It is evident from the literature survey endeavors from researchers to make these sorting techniques more modular and low power, which is required to design large-scale sorting for data center-based applications. In this paper, we investigate application of generic and structured low-power technique like clock gating and Multi-Vth in designing the low-power sorters. The bubble sort, bitonic sort and odd–even sorting techniques are redesigned to make them low power using clock gating and multi-Vth technique. The implementation results show that the clock gating reduces the dynamic power consumption on sorters by 47.5% without much impact on the performance. Further performance improvement is achieved through adopting multi-Vth libraries without compromising the dynamic power reduction achieved through clock gating. The power reduction results obtained are comparable with state-of-the-art low-power sorters which are complex in design. The proposed sorters are implemented and results are presented for Saed90nm standard cell libraries.





Similar content being viewed by others
References
Marcelino R, Neto HC, Cardoso JMP. Sorting Units for FPGA- Based Embedded Systems. In: Proc. IFIP Cong. Distributed Embedded Systems: Design, Middleware and Resources, 2008; 11–22.
Sklyarov V, Skliarova I. High-performance implementation of regular and easily scalable sorting networks on an FPGA. Elsevier J Microprocess Microsyst. 2014;38(5):470–84.
Alaparthi S, Gulati K, Khatri SP. Sorting binary numbers in hardware—a novel algorithm and its implementation. In: 2009 IEEE International Symposium on Circuits and Systems, 2009, pp. 2225–2228.
Dayarathna M, Wen Y, Fan R. Data center energy consumption modeling: a survey. IEEE Commun Surv Tutor. 2016;18(1):732–94.
Falsafi B, Dally B, Singh D, Chiou D, Yi JJ, Sendag R. FPGAs versus GPUs in data centers. IEEE Micro. 2017;37(1):60–72.
Preethi P, Mohan KG, Sudeendra Kumar K, Kamala Kanta Mahapatra. Low power sorters using clock gating. In: 7th IEEE International Symposium on Smart Electronic Systems (iSES-2021) 2021.
Thompson CD. The VLSI complexity of sorting. IEEE Trans Comput. 1983;32(12):1171–84.
Farmahini-Farahani A, Duwe HJ III, Schulte MJ, Compton K. Modular design of high-throughput, low-latency sorting units. IEEE Trans Comput. 2013;62(7):1389–402.
Lin S, Chen P, Lin Y. Hardware design of low-power high-throughput sorting unit. IEEE Trans Comput. 2017;66(8):1383–95.
Pedroni V, Jasinski RP, Pedroni RU. Panning sorter: an approach to the design of minimal-hardware parallel-input data sorters. Electron Lett. 2010;46(18):1262–3.
Abdel-Hafeez S, Gordon-Ross A. An efficient O(N) comparison−free sorting algorithm. IEEE Trans Very Large-Scale Integr VLSI Syst. 2017;25(6):1930–42.
Najafi MH, Lilja DJ, Riedel MD, Bazargan K. Low- cost sorting network circuits using unary processing. IEEE Trans Very Large-Scale Integr VLSI Syst. 2018;26(8):1471–80.
Norollah A, Kazemi Z, Beitollahi H, Hely D. Hardware support for efficient and low-power data sorting in massive data application: the 3D sorting method. IEEE Consum Electron Mag. 2021;11:87–94.
Rabaey JM. Low power design essentials. Springer; 2009. (ISBN: 978-0-387-71712-8).
Madaminov S. Comprehensive framework for sorting benchmarks. In: Proceedings of the VLDB 2019 PhD Workshop, California, August 26th, 2019.
Norollah A. RTHS: a low-cost high-performance real-time hardware sorter, using a multidimensional sorting algorithm. IEEE Trans Very Large Scale Integr VLSI Syst. 2019;99:1–13.
Locharla GR, Kallur SK. Implementation of MIMO data reordering and scheduling methodologies for eight-parallel variable length multi-path delay commutator FFT/IFFT. In: IET computers and design techniques, VDAT, 2016.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
On behalf of all authors, the corresponding author states that there is no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article is part of the topical collection “Smart and Connected Electronic Systems” guest edited by Amlan Ganguly, Selcuk Kose, Amit M. Joshi, and Vineet Sahula.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Preethi, P., Mohan, K.G., Kumar K., S. et al. Sorter Design with Structured Low Power Techniques. SN COMPUT. SCI. 4, 129 (2023). https://doi.org/10.1007/s42979-022-01546-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42979-022-01546-7