Background: The enhancement of real-world speech signals is still a challenging task to eliminate noises, namely reverberation, background, street, and babble noises. Recently learned methods like dictionary learning have become increasingly popular and showed promising results in speech enhancement. The K-means Singular Value Decomposition (KSVD) algorithm is best suited for dictionary learning among many sparse representation algorithms. Moreover, the orthogonal matching pursuit (OMP) based algorithm used for signal recovery is given. The orthogonal matching pursuit (OMP) based algorithm for signal recovery gives the best enhancement results. On the other hand, FPGAs and ASICs are widely used to accelerate speech enhancement applications. FPGAs are commonly used in healthcare and consumer applications, where speech enhancement plays a crucial role.
Methods: This paper proposes a modified KSVD algorithm that can easily be implemented onto hardware platforms like FPGAs and ASICS. Instead of using the double-precision arithmetic for the singular value decomposition part of the KSVD algorithm, we proposed to use CORDIC (Coordinate Rotation Digital Computer) based QR decomposition and QR-based singular value decomposition in dictionary learning.
Results: The proposed KSVD algorithm is optimal with the CORDIC algorithm that can reduce by 7-8 times the processing time.
Conclusion: The finding indicates that the proposed work is best suited to FPGA or ASIC platforms.
Keywords: K-singular value decomposition, FPGA, ASIC, CORDIC, dictionary learning, sparse representation.