Accelerating software radios by means of SIMD Instructions. A case for the AVX2 and AVX512 Extensions
Abstract
Current computer architecture trends are moving towards parallelization by means of node replication and data parallelization, which optimize the execution speed of a given application. Increasing the number of nodes is constrained by the hardware platform in use; however, effective data parallelization techniques can improve processing speeds by leveraging existing resources of the platform. This paper presents the AVX2 and AVX512 instruction addition to several kernels in the VOLK library. We discuss the capabilities of the new extensions and their interaction with the VOLK library. Finally, we show profiling results of the speed enhancements added to the library for AVX capable machines.
- I grant gnuradio.org a perpetual, non-exclusive license to distribute this article.
- I certify that I have the right to grant this license.
- I understand that submissions cannot be completely removed once accepted.
- I understand that gnuradio.org reserves the right to reclassify or reject any submission.