SlideShare a Scribd company logo
NXFEE INNOVATION
(SEMICONDUCTOR IP &PRODUCT DEVELOPMENT)
(ISO : 9001:2015Certified Company),
# 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam,
Pondicherry– 605004, India.
Buy Project on Online :www.nxfee.com | contact : +91 9789443203 |
email : nxfee.innovation@gmail.com
_________________________________________________________________
The Implementation of the Improved OMP for AIC Reconstruction
Based on Parallel Index Selection
Abstract:
Sparse signal recovery becomes extremely challenging for a variety of real-time
applications. In this paper, we improve the orthogonal matching pursuit (OMP) algorithm
based on parallel correlation indices selection mechanism in each iteration and
Goldschmidt algorithm. Simulation results show that the improved OMP algorithm with a
reduced number of iterations and low hardware complexity of matrix operations has
higher success rate and recovery signal-to-noise-ratio (RSNR) for sparse signal recovery.
This paper presents an efficient complex valued system hardware architecture of the
recovery algorithm for analog-to-information structure based on compressive sensing.
The proposed architecture is implemented and validated on the Xilinx Virtex6 field-
programmable gate array (FPGA) for signal reconstruction with N = 1024, K = 36, and M
= 256. The implementation results showed that the improved OMP algorithm achieved a
higher RSNR of 31.04 dB compared with the original OMP algorithm. This synthesized
design consumes a few percentages of the hardware resources of the FPGA chip with the
clock frequency of 135.4 MHZ and reconstruction time of 170 µs, which is faster than the
existing design.
Software Implementation:
 Modelsim
 Xilinx 14.2
Existing System:
Sensing and processing information have traditionally relied on the Shannon sampling
theorem, one of the central tenets of digital signal processing. However, in many
NXFEE INNOVATION
(SEMICONDUCTOR IP &PRODUCT DEVELOPMENT)
(ISO : 9001:2015Certified Company),
# 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam,
Pondicherry– 605004, India.
Buy Project on Online :www.nxfee.com | contact : +91 9789443203 |
email : nxfee.innovation@gmail.com
_________________________________________________________________
applications, including digital image, video cameras, and high-speed analog-to-digital
converters (ADCs), increasing the sampling rate is very expensive. The recent theory of
compressive sensing (CS), is presented to capture and represent compressible signals at a
rate significantly below the Nyquist rate. This is the new sampling architecture is
proposed aimed at implementing sub-Nyquist signal acquisition system such as analog-
to-information converters (AICs). In this structure, the high-dimensional signal is
projected into the low-dimensional space with an incoherent measurement matrix by
exploiting the sparsity of the signal. The signal is then reconstructed from these
projections using an optimization process.
NXFEE INNOVATION
(SEMICONDUCTOR IP &PRODUCT DEVELOPMENT)
(ISO : 9001:2015Certified Company),
# 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam,
Pondicherry– 605004, India.
Buy Project on Online :www.nxfee.com | contact : +91 9789443203 |
email : nxfee.innovation@gmail.com
_________________________________________________________________
At present, there are different structures of the AIC model including the random
demodulator structure proposed, related to hardware implementation. However, these
applications do not have a complete realization of the system and mainly focus on the
front-end analogy information processing hardware implementation. The back-end
digitization and reconstruction are carried out using external units such as oscilloscopes,
Agilent Infiniium 54855A, Tektronix oscilloscope, and 8-bit oscilloscopes. The collected
information is sent to the PC for reconstruction. Therefore, the current research to
achieve reconstruction of the signal is mostly in the CPU, GPU, or DSP platform.
However, offline processing results in excessive I/O data-rates and storage requirements.
More importantly, it prohibits timely (or real-time) decisions based on the recovered
information and prevents the use of adaptive sensing strategies. Hence, it is necessary to
design dedicated hardware architectures for the very large-scale integration (VLSI) or
field-programmable gate arrays (FPGAs) connected with the analog front-end hardware
for real-time processing. Recently, various algorithms have been proposed for
reconstructing signals from the compressively sensed samples. As efficient methods to
calculate a matrix decomposition exists, orthogonal matching pursuit (OMP) algorithm is
suitable for VLSI implementation, mostly due to the regular structure of the least-
squares(LSs) optimization.
In recent years, many structures of OMP algorithm hardware implementations have been
proposed. However, few designs can present a low-complexity and high-speed hardware
implementation. An implementation of OMP algorithm with an FPGA has been
proposed. However, the design uses a 128-length vector for sparsity of 5. The
architecture contains higher cycle period due to the path delay in the dot product of
matrix. Stanislaus and Mohsenin, have proposed and improved the hardware architecture
of OMP algorithm based on QR decomposition. This design finds the inverse of a matrix
involving square root units. A more efficient approach is to use the modified Cholesky
decomposition which avoids the use of square roots. Numerous works related to the
NXFEE INNOVATION
(SEMICONDUCTOR IP &PRODUCT DEVELOPMENT)
(ISO : 9001:2015Certified Company),
# 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam,
Pondicherry– 605004, India.
Buy Project on Online :www.nxfee.com | contact : +91 9789443203 |
email : nxfee.innovation@gmail.com
_________________________________________________________________
acceleration of these operations individually have been proposed. In the design presented,
the matrix inversion is based on CORDIC divider with latency and a sequential execution
of several parts of matrix multiplication. Inversion based on Newton–Raphson iteration,
but the two multiplication modules in this algorithm are also
Fig. 1. Complete structure of signal recovery Chky-mat, which is Cholesky decomposition and is used to
get the inversion matrix
executed sequentially. Besides, existing designs are mainly implemented with the real-
valued system. They are mostly aimed at the hardware implementation of the OMP
algorithm, but not for the actual applications. The processed signals are usually sparse in
time domain and do not need to be converted to frequency domain processing. However,
a complex-valued system of reconstruction algorithm is required for practical
applications because many man-made or natural signals are sparse in an appropriate
orthonormal basis (such as Fourier basis). The sensing matrix of the OMP algorithm is
usually complex-valued matrix, so the proposed design in this paper is a complex-valued
system for practical applications. Furthermore, we find the high computational
complexity of matrix operations based on OMP is a major concern on achieving real-time
reconstruction of compressively sensed signal.
In this paper, we mainly proposed the complex-valued system of the signal reconstruction
for the CS-based AIC structure. Additionally, we reduce the computational complexity of
matrix operations and number of iterations to improve the OMP algorithm. We have
developed a MATLAB code of the algorithm and compared the success rate and recovery
NXFEE INNOVATION
(SEMICONDUCTOR IP &PRODUCT DEVELOPMENT)
(ISO : 9001:2015Certified Company),
# 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam,
Pondicherry– 605004, India.
Buy Project on Online :www.nxfee.com | contact : +91 9789443203 |
email : nxfee.innovation@gmail.com
_________________________________________________________________
signal to-noise-ratio (RSNR) of input signals recovered correctly base on different
sparsities by verification. The improved OMP algorithm has higher success rate and
RSNR than the OMP algorithm. In order to validate the improved OMP algorithm
efficiently, a complete signal recovery structure using SIMULINK for AIC model based
on CS is built in this paper. Besides, we design VLSI architecture of the improved OMP
algorithm with real time, low power, and low complexity requirements. Additionally, we
propose an efficient matrix inversion design based on the Goldschmidt algorithm.
Compared with the inversion method of the Newton–Raphson, the design of the inversion
circuit based on the Goldschmidt algorithm can improve the parallelism of the operation
and reduce the quantization error of each iteration.
Disadvantages:
 Number of iterations is higher
 Hardware complexity is higher
Proposed System:
VLSI implementation of the improved OMP algorithm
In this paper, the proposed structure for the implementation of the improved algorithm
with the degree of sparsity K = 36, the measurements M = 256, and the length of the
original signal N = 1024. In hardware implementation, we completed the complex-valued
system of the improved algorithm. The whole can be divided into real part and imaginary
part with data precision of 24 bits.
The hardware implementation of OMP algorithm is mainly divided into three units as
shown in Fig. 2: 1) parallel complex multiplication; 2) matrix inversion; and 3) signal
estimation and residual calculation. In addition, it has dual-port RAMs and control
circuit. The measurement matrix and the measurements vector y are stored in the RAMs,
respectively. In order to increase the throughput, the correlation is computed on the
NXFEE INNOVATION
(SEMICONDUCTOR IP &PRODUCT DEVELOPMENT)
(ISO : 9001:2015Certified Company),
# 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam,
Pondicherry– 605004, India.
Buy Project on Online :www.nxfee.com | contact : +91 9789443203 |
email : nxfee.innovation@gmail.com
_________________________________________________________________
parallel complex multiplication. The most relevant two columns are found by comparison
circuit (sorting algorithm unit) and extract them according to the associated indices
Fig. 2. Architecture of the improved OMP algorithm
NXFEE INNOVATION
(SEMICONDUCTOR IP &PRODUCT DEVELOPMENT)
(ISO : 9001:2015Certified Company),
# 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam,
Pondicherry– 605004, India.
Buy Project on Online :www.nxfee.com | contact : +91 9789443203 |
email : nxfee.innovation@gmail.com
_________________________________________________________________
λmax = {λ(i1,i2)} from RAM of the matrix . We just save the addresses of two columns
to the new RAM (index unit) and update it after each iteration. The control unit in this
design operations on the RAM units by generating a read or write address, and generates
enable signals (en) for some computing units to initiate their operations.
Parallel Complex-Valued Multiplication
In this design, the computationally most expensive operation is the matrix-vector product
corresponding to the real and imaginary parts, which correlates the measurements
with the dictionary elements. In order to increase the throughput, the correlation is
computed on parallel complex-valued multiply accumulators. In this design, the real part
and the imaginary part have the same multiplication operation, we need to get the real
part and imaginary part of the inner product. One complex-valued multiply add unit can
be divided into three real-valued multipliers and three adders. We can compare the square
sum of the real part and the imaginary part to find the index of the maximum value of the
inner product. Note that there is no need the extra square root operation in this unit to get
the mode value of inner product, because the maximum value of the square sum will have
the maximum square root value.
NXFEE INNOVATION
(SEMICONDUCTOR IP &PRODUCT DEVELOPMENT)
(ISO : 9001:2015Certified Company),
# 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam,
Pondicherry– 605004, India.
Buy Project on Online :www.nxfee.com | contact : +91 9789443203 |
email : nxfee.innovation@gmail.com
_________________________________________________________________
Fig. 2. Structure of the sorting algorithm
Compared to real-valued system, the extra operations are multiplication of imaginary part
and square sum in the complex-valued system. This requires the coefficients of the
measurements matrix as the parallel inputs. Therefore, we need to store the matrix into a
RAM where we put a column of the matrix into the same address, and ensure that all the
inputs required by the multiplication circuit are given in one clock cycle.
During the first N cycles of each iteration for the computation of this unit, N columns of
are retrieved from the memory unit in serial order and fed to this unit. Finally, the result
is sent to the comparison circuit to obtain the indexes of the relevant columns. In order to
improve the operating speed of the circuit, the pipeline circuit model is used. The input
data of the sorting unit is positive and serial and we can scan sequentially the input data.
In this design, we only select the two maximum values to find two most correlated
NXFEE INNOVATION
(SEMICONDUCTOR IP &PRODUCT DEVELOPMENT)
(ISO : 9001:2015Certified Company),
# 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam,
Pondicherry– 605004, India.
Buy Project on Online :www.nxfee.com | contact : +91 9789443203 |
email : nxfee.innovation@gmail.com
_________________________________________________________________
columns. Two registers (reg1 and reg2) are required to store the two most relevant values,
the maximum and the second largest value.
First, we compare the input data to the maximum value in numerical comparison unit and
then control the data distributor. If the input data is larger than the maximum value, we
can replace the secondary register (reg2) with the maximum value and store input data
into the first register (reg1). If the input data is smaller than the maximum value and
larger than the second largest value, we can replace secondary register (reg2) with the
input data. The rest of the situation remains unchanged. The internal structure of the
sorting system has been shown in Fig. 3.
Advantages
 Number of iterations is reduced
 Hardware complexity is higher
References
[1] H. Mamaghanian, N. Khaled, D. Atienza, and P. Vandergheynst, “Design and exploration of low-
power analog to information conversion based on compressed sensing,” IEEE J. Emerg. Sel. Topics
Circuits Syst., vol. 2, no. 3, pp. 493–501, Sep. 2012.
[2] E. J. Candès, J. Romberg, and T. Tao, “Robust uncertainty principles: Exact signal reconstruction
from highly incomplete frequency information,” IEEE Trans. Inf. Theory, vol. 52, no. 2, pp. 489–509,
Feb. 2006.
[3] D. L. Donoho, “Compressed sensing,” IEEE Trans. Inf. Theory, vol. 52, no. 4, pp. 1289–1306, Apr.
2006.
[4] P. Daponte, L. De Vito, S. Rapuano, and I. Tudosa, “Analog-toinformation converters in the
wideband RF measurement for aerospace applications: Current situation and perspectives,” IEEE
Instrum. Meas. Mag., vol. 20, no. 1, pp. 20–28, Feb. 2017.
NXFEE INNOVATION
(SEMICONDUCTOR IP &PRODUCT DEVELOPMENT)
(ISO : 9001:2015Certified Company),
# 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam,
Pondicherry– 605004, India.
Buy Project on Online :www.nxfee.com | contact : +91 9789443203 |
email : nxfee.innovation@gmail.com
_________________________________________________________________
[5] J. Laska, S. Kirolos, M. Duarte, T. Ragheb, R. Baraniuk, and Y. Massoud, “Theory and
implementation of an analog-to-information converter using random demodulation,” in Proc. IEEE Int.
Symp. Circuits Syst., May 2007, pp. 1959–1962.
[6] A. Dutta and R. K. Mangang, “Analog to information converter based on random demodulation,” in
Proc. Int. Conf. Electron. Design, 2015, pp. 105–109.
[7] S. Kirolos et al., “Analog-to-information conversion via random demodulation,” in Proc. IEEE
Dallas Circuits Syst. Workshop (DCAS), Oct. 2006, pp. 71–74.
[8] J. A. Tropp, J. N. Laska, M. F. Duarte, J. K. Romberg, and R. G. Baraniuk, “Beyond Nyquist:
Efficient sampling of sparse bandlimited signals,” IEEE Trans. Inf. Theory, vol. 56, no. 1, pp. 520–544,
Jan. 2010.
[9] D. Bao, P. Daponte, L. De Vito, and S. Rapuano, “Frequency-domain characterization of random
demodulation analog-to-information converters,” Acta IMEKO, vol. 4, no. 1, pp. 111–120, Feb. 2015.
[10] D. Gangopadhyay, E. G. Allstot, A. M. R. Dixon, K. Natarajan, S. Gupta, and D. J. Allstot,
“Compressed sensing analog front-end for bio-sensor applications,” IEEE J. Solid-State Circuits, vol.
49, no. 2, pp. 426–438, Feb. 2014.
[11] T. R. Braun, “An evaluation of GPU acceleration for sparse reconstruction,” Proc. SPIE, vol. 7697,
no. 15, pp. 769715-1–769715-10, Apr. 2010.
[12] J.-W. Jhang and Y.-H. Huang, “A high-SNR projection-based atom selection OMP processor for
compressive sensing,” IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 24, no. 12, pp. 3477–
3488, Dec. 2016.
[13] J. D. Blanchard, M. Cermak, D. Hanle, Y. Jing, “Greedy algorithms for joint sparse recovery,”
IEEE Trans. Signal Process., vol. 62, no. 7, pp. 1694–1704, Apr. 2014.
[14] Y.-M. Lin, Y. Chen, N.-S. Huang, and A.-Y. Wu, “Low-complexity stochastic gradient pursuit
algorithm and architecture for robust compressive sensing reconstruction,” IEEE Trans. Signal Process.,
vol. 65, no. 3, pp. 638–650, Feb. 2017.
NXFEE INNOVATION
(SEMICONDUCTOR IP &PRODUCT DEVELOPMENT)
(ISO : 9001:2015Certified Company),
# 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam,
Pondicherry– 605004, India.
Buy Project on Online :www.nxfee.com | contact : +91 9789443203 |
email : nxfee.innovation@gmail.com
_________________________________________________________________
[15] A. Septimus and R. Steinberg, “Compressive sampling hardware reconstruction,” in Proc. IEEE Int.
Symp. Circuits Syst., May/Jun. 2010, pp. 3316–3319

More Related Content

What's hot (17)

PDF
Corrosion Detection Using A.I : A Comparison of Standard Computer Vision Tech...
csandit
 
PDF
Mixed Precision Training Review
LEE HOSEONG
 
PDF
Spine net learning scale permuted backbone for recognition and localization
Devansh16
 
PPT
Motion estimation overview
Yoss Cohen
 
PDF
Low Power High-Performance Computing on the BeagleBoard Platform
a3labdsp
 
PDF
Visual odometry & slam utilizing indoor structured environments
NAVER Engineering
 
PDF
Java image processing ieee projects 2012 @ Seabirds ( Chennai, Bangalore, Hyd...
SBGC
 
PDF
IRJET - A Speculative Approximate Adder for Error Recovery Unit
IRJET Journal
 
PDF
IRJET- A Review of Approximate Adders for Energy-Efficient Digital Signal Pro...
IRJET Journal
 
PDF
Review On Different Feature Extraction Algorithms
IRJET Journal
 
PDF
IRJET- Accuracy Configurable Adder
IRJET Journal
 
PDF
Image Processing IEEE 2015 Projects
Vijay Karan
 
PDF
Matlab Project Titles, 2009 2010 Ncct Final Year Projects
ncct
 
PDF
Dsp Project Titles, 2009 2010 Ncct Final Year Projects
ncct
 
PDF
MVTec AD: A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection
LEE HOSEONG
 
PDF
IRJET- FPGA Implementation of Low Power Configurable Adder for Approximate Co...
IRJET Journal
 
PDF
Parallelization of the LBG Vector Quantization Algorithm for Shared Memory Sy...
CSCJournals
 
Corrosion Detection Using A.I : A Comparison of Standard Computer Vision Tech...
csandit
 
Mixed Precision Training Review
LEE HOSEONG
 
Spine net learning scale permuted backbone for recognition and localization
Devansh16
 
Motion estimation overview
Yoss Cohen
 
Low Power High-Performance Computing on the BeagleBoard Platform
a3labdsp
 
Visual odometry & slam utilizing indoor structured environments
NAVER Engineering
 
Java image processing ieee projects 2012 @ Seabirds ( Chennai, Bangalore, Hyd...
SBGC
 
IRJET - A Speculative Approximate Adder for Error Recovery Unit
IRJET Journal
 
IRJET- A Review of Approximate Adders for Energy-Efficient Digital Signal Pro...
IRJET Journal
 
Review On Different Feature Extraction Algorithms
IRJET Journal
 
IRJET- Accuracy Configurable Adder
IRJET Journal
 
Image Processing IEEE 2015 Projects
Vijay Karan
 
Matlab Project Titles, 2009 2010 Ncct Final Year Projects
ncct
 
Dsp Project Titles, 2009 2010 Ncct Final Year Projects
ncct
 
MVTec AD: A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection
LEE HOSEONG
 
IRJET- FPGA Implementation of Low Power Configurable Adder for Approximate Co...
IRJET Journal
 
Parallelization of the LBG Vector Quantization Algorithm for Shared Memory Sy...
CSCJournals
 

Similar to The implementation of the improved omp for aic reconstruction based on parallel index selection (20)

PDF
A review of optimisation and least-square problem methods on field programmab...
nooriasukmaningtyas
 
PPTX
Signal Recovery with Orthogonal Matching Pursuit Algorithm
JuanAFigueroa6
 
PDF
A Study on Image Reconfiguration Algorithm of Compressed Sensing
TELKOMNIKA JOURNAL
 
PDF
IRJET- Compressed Sensing based Modified Orthogonal Matching Pursuit in DTTV ...
IRJET Journal
 
PDF
Performance analysis of compressive sensing recovery algorithms for image pr...
IJECEIAES
 
PDF
Algorithms for Sparse Signal Recovery in Compressed Sensing
Aqib Ejaz
 
PDF
Titan X Research Paper
Jennifer Wood
 
PDF
Image reconstruction through compressive sampling matching pursuit and curvel...
IJECEIAES
 
PDF
P044067476
IJERA Editor
 
PPTX
Application of Fourier Transformation
ManishKumar3747
 
PDF
IRJET- Reconstruction of Sparse Signals(Speech) Using Compressive Sensing
IRJET Journal
 
PDF
G234247
inventionjournals
 
PDF
Step by step process of uploading presentation videos
Hoopeer Hoopeer
 
PDF
Ieee projects 2012 2013 - Digital Image Processing
K Sundaresh Ka
 
PDF
08039246
Thilip Kumar
 
PPTX
A generalized algorithm and reconfigurable architecture for efficient and sca...
I3E Technologies
 
PPTX
Sensor Technology Lecture 6 [Autosaved].pptx
SachinKallurkar
 
PDF
VLSI IEEE Transaction 2018 - IEEE Transaction
Nxfee Innovation
 
PDF
Performance of Matching Algorithmsfor Signal Approximation
iosrjce
 
PDF
AN IMPROVED ECG SIGNAL ACQUISITION SYSTEM THROUGH CMOS TECHNOLOGY
BHAVANA KONERU
 
A review of optimisation and least-square problem methods on field programmab...
nooriasukmaningtyas
 
Signal Recovery with Orthogonal Matching Pursuit Algorithm
JuanAFigueroa6
 
A Study on Image Reconfiguration Algorithm of Compressed Sensing
TELKOMNIKA JOURNAL
 
IRJET- Compressed Sensing based Modified Orthogonal Matching Pursuit in DTTV ...
IRJET Journal
 
Performance analysis of compressive sensing recovery algorithms for image pr...
IJECEIAES
 
Algorithms for Sparse Signal Recovery in Compressed Sensing
Aqib Ejaz
 
Titan X Research Paper
Jennifer Wood
 
Image reconstruction through compressive sampling matching pursuit and curvel...
IJECEIAES
 
P044067476
IJERA Editor
 
Application of Fourier Transformation
ManishKumar3747
 
IRJET- Reconstruction of Sparse Signals(Speech) Using Compressive Sensing
IRJET Journal
 
Step by step process of uploading presentation videos
Hoopeer Hoopeer
 
Ieee projects 2012 2013 - Digital Image Processing
K Sundaresh Ka
 
08039246
Thilip Kumar
 
A generalized algorithm and reconfigurable architecture for efficient and sca...
I3E Technologies
 
Sensor Technology Lecture 6 [Autosaved].pptx
SachinKallurkar
 
VLSI IEEE Transaction 2018 - IEEE Transaction
Nxfee Innovation
 
Performance of Matching Algorithmsfor Signal Approximation
iosrjce
 
AN IMPROVED ECG SIGNAL ACQUISITION SYSTEM THROUGH CMOS TECHNOLOGY
BHAVANA KONERU
 
Ad

More from Nxfee Innovation (20)

DOCX
Noise insensitive pll using a gate-voltage-boosted source-follower regulator ...
Nxfee Innovation
 
DOCX
An efficient fault tolerance design for integer parallel matrix vector
Nxfee Innovation
 
PDF
Vector processing aware advanced clock-gating techniques for low-power fused ...
Nxfee Innovation
 
PDF
Securing the present block cipher against combined side channel analysis and ...
Nxfee Innovation
 
PDF
Multilevel half rate phase detector for clock and data recovery circuits
Nxfee Innovation
 
PDF
Low complexity methodology for complex square-root computation
Nxfee Innovation
 
PDF
Feedback based low-power soft-error-tolerant design for dual-modular redundancy
Nxfee Innovation
 
PDF
Fast neural network training on fpga using quasi newton optimization method
Nxfee Innovation
 
PDF
Efficient fpga mapping of pipeline sdf fft cores
Nxfee Innovation
 
PDF
Design of an area efficient million-bit integer multiplier using double modul...
Nxfee Innovation
 
PDF
Design and fpga implementation of a reconfigurable digital down converter for...
Nxfee Innovation
 
PDF
Combating data leakage trojans in commercial and asic applications with time ...
Nxfee Innovation
 
PDF
Approximate sum of-products designs based on distributed arithmetic
Nxfee Innovation
 
PDF
Approximate hybrid high radix encoding for energy efficient inexact multipliers
Nxfee Innovation
 
PDF
Analysis and design of cost effective, high-throughput ldpc decoders
Nxfee Innovation
 
PDF
An energy efficient programmable many core accelerator for personalized biome...
Nxfee Innovation
 
PDF
Algorithm and vlsi architecture design of proportionate type lms adaptive fil...
Nxfee Innovation
 
PDF
A reconfigurable ldpc decoder optimized applications
Nxfee Innovation
 
PDF
A high accuracy programmable pulse generator with a 10-ps timing resolution
Nxfee Innovation
 
PDF
A flexible wildcard pattern matching accelerator via simultaneous discrete fi...
Nxfee Innovation
 
Noise insensitive pll using a gate-voltage-boosted source-follower regulator ...
Nxfee Innovation
 
An efficient fault tolerance design for integer parallel matrix vector
Nxfee Innovation
 
Vector processing aware advanced clock-gating techniques for low-power fused ...
Nxfee Innovation
 
Securing the present block cipher against combined side channel analysis and ...
Nxfee Innovation
 
Multilevel half rate phase detector for clock and data recovery circuits
Nxfee Innovation
 
Low complexity methodology for complex square-root computation
Nxfee Innovation
 
Feedback based low-power soft-error-tolerant design for dual-modular redundancy
Nxfee Innovation
 
Fast neural network training on fpga using quasi newton optimization method
Nxfee Innovation
 
Efficient fpga mapping of pipeline sdf fft cores
Nxfee Innovation
 
Design of an area efficient million-bit integer multiplier using double modul...
Nxfee Innovation
 
Design and fpga implementation of a reconfigurable digital down converter for...
Nxfee Innovation
 
Combating data leakage trojans in commercial and asic applications with time ...
Nxfee Innovation
 
Approximate sum of-products designs based on distributed arithmetic
Nxfee Innovation
 
Approximate hybrid high radix encoding for energy efficient inexact multipliers
Nxfee Innovation
 
Analysis and design of cost effective, high-throughput ldpc decoders
Nxfee Innovation
 
An energy efficient programmable many core accelerator for personalized biome...
Nxfee Innovation
 
Algorithm and vlsi architecture design of proportionate type lms adaptive fil...
Nxfee Innovation
 
A reconfigurable ldpc decoder optimized applications
Nxfee Innovation
 
A high accuracy programmable pulse generator with a 10-ps timing resolution
Nxfee Innovation
 
A flexible wildcard pattern matching accelerator via simultaneous discrete fi...
Nxfee Innovation
 
Ad

Recently uploaded (20)

PPTX
Pharmaceuticals and fine chemicals.pptxx
jaypa242004
 
PDF
Water Design_Manual_2005. KENYA FOR WASTER SUPPLY AND SEWERAGE
DancanNgutuku
 
PPTX
EC3551-Transmission lines Demo class .pptx
Mahalakshmiprasannag
 
PPTX
The Role of Information Technology in Environmental Protectio....pptx
nallamillisriram
 
PDF
UNIT-4-FEEDBACK AMPLIFIERS AND OSCILLATORS (1).pdf
Sridhar191373
 
PPTX
Heart Bleed Bug - A case study (Course: Cryptography and Network Security)
Adri Jovin
 
PPTX
Break Statement in Programming with 6 Real Examples
manojpoojary2004
 
PPTX
265587293-NFPA 101 Life safety code-PPT-1.pptx
chandermwason
 
PDF
PORTFOLIO Golam Kibria Khan — architect with a passion for thoughtful design...
MasumKhan59
 
PPTX
REINFORCEMENT AS CONSTRUCTION MATERIALS.pptx
mohaiminulhaquesami
 
PDF
Statistical Data Analysis Using SPSS Software
shrikrishna kesharwani
 
PPTX
site survey architecture student B.arch.
sri02032006
 
DOCX
8th International Conference on Electrical Engineering (ELEN 2025)
elelijjournal653
 
PDF
Set Relation Function Practice session 24.05.2025.pdf
DrStephenStrange4
 
PPTX
NEUROMOROPHIC nu iajwojeieheueueueu.pptx
knkoodalingam39
 
PPT
inherently safer design for engineering.ppt
DhavalShah616893
 
PPTX
Thermal runway and thermal stability.pptx
godow93766
 
PDF
Zilliz Cloud Demo for performance and scale
Zilliz
 
PDF
Book.pdf01_Intro.ppt algorithm for preperation stu used
archu26
 
PPTX
drones for disaster prevention response.pptx
NawrasShatnawi1
 
Pharmaceuticals and fine chemicals.pptxx
jaypa242004
 
Water Design_Manual_2005. KENYA FOR WASTER SUPPLY AND SEWERAGE
DancanNgutuku
 
EC3551-Transmission lines Demo class .pptx
Mahalakshmiprasannag
 
The Role of Information Technology in Environmental Protectio....pptx
nallamillisriram
 
UNIT-4-FEEDBACK AMPLIFIERS AND OSCILLATORS (1).pdf
Sridhar191373
 
Heart Bleed Bug - A case study (Course: Cryptography and Network Security)
Adri Jovin
 
Break Statement in Programming with 6 Real Examples
manojpoojary2004
 
265587293-NFPA 101 Life safety code-PPT-1.pptx
chandermwason
 
PORTFOLIO Golam Kibria Khan — architect with a passion for thoughtful design...
MasumKhan59
 
REINFORCEMENT AS CONSTRUCTION MATERIALS.pptx
mohaiminulhaquesami
 
Statistical Data Analysis Using SPSS Software
shrikrishna kesharwani
 
site survey architecture student B.arch.
sri02032006
 
8th International Conference on Electrical Engineering (ELEN 2025)
elelijjournal653
 
Set Relation Function Practice session 24.05.2025.pdf
DrStephenStrange4
 
NEUROMOROPHIC nu iajwojeieheueueueu.pptx
knkoodalingam39
 
inherently safer design for engineering.ppt
DhavalShah616893
 
Thermal runway and thermal stability.pptx
godow93766
 
Zilliz Cloud Demo for performance and scale
Zilliz
 
Book.pdf01_Intro.ppt algorithm for preperation stu used
archu26
 
drones for disaster prevention response.pptx
NawrasShatnawi1
 

The implementation of the improved omp for aic reconstruction based on parallel index selection

  • 1. NXFEE INNOVATION (SEMICONDUCTOR IP &PRODUCT DEVELOPMENT) (ISO : 9001:2015Certified Company), # 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam, Pondicherry– 605004, India. Buy Project on Online :www.nxfee.com | contact : +91 9789443203 | email : [email protected] _________________________________________________________________ The Implementation of the Improved OMP for AIC Reconstruction Based on Parallel Index Selection Abstract: Sparse signal recovery becomes extremely challenging for a variety of real-time applications. In this paper, we improve the orthogonal matching pursuit (OMP) algorithm based on parallel correlation indices selection mechanism in each iteration and Goldschmidt algorithm. Simulation results show that the improved OMP algorithm with a reduced number of iterations and low hardware complexity of matrix operations has higher success rate and recovery signal-to-noise-ratio (RSNR) for sparse signal recovery. This paper presents an efficient complex valued system hardware architecture of the recovery algorithm for analog-to-information structure based on compressive sensing. The proposed architecture is implemented and validated on the Xilinx Virtex6 field- programmable gate array (FPGA) for signal reconstruction with N = 1024, K = 36, and M = 256. The implementation results showed that the improved OMP algorithm achieved a higher RSNR of 31.04 dB compared with the original OMP algorithm. This synthesized design consumes a few percentages of the hardware resources of the FPGA chip with the clock frequency of 135.4 MHZ and reconstruction time of 170 µs, which is faster than the existing design. Software Implementation:  Modelsim  Xilinx 14.2 Existing System: Sensing and processing information have traditionally relied on the Shannon sampling theorem, one of the central tenets of digital signal processing. However, in many
  • 2. NXFEE INNOVATION (SEMICONDUCTOR IP &PRODUCT DEVELOPMENT) (ISO : 9001:2015Certified Company), # 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam, Pondicherry– 605004, India. Buy Project on Online :www.nxfee.com | contact : +91 9789443203 | email : [email protected] _________________________________________________________________ applications, including digital image, video cameras, and high-speed analog-to-digital converters (ADCs), increasing the sampling rate is very expensive. The recent theory of compressive sensing (CS), is presented to capture and represent compressible signals at a rate significantly below the Nyquist rate. This is the new sampling architecture is proposed aimed at implementing sub-Nyquist signal acquisition system such as analog- to-information converters (AICs). In this structure, the high-dimensional signal is projected into the low-dimensional space with an incoherent measurement matrix by exploiting the sparsity of the signal. The signal is then reconstructed from these projections using an optimization process.
  • 3. NXFEE INNOVATION (SEMICONDUCTOR IP &PRODUCT DEVELOPMENT) (ISO : 9001:2015Certified Company), # 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam, Pondicherry– 605004, India. Buy Project on Online :www.nxfee.com | contact : +91 9789443203 | email : [email protected] _________________________________________________________________ At present, there are different structures of the AIC model including the random demodulator structure proposed, related to hardware implementation. However, these applications do not have a complete realization of the system and mainly focus on the front-end analogy information processing hardware implementation. The back-end digitization and reconstruction are carried out using external units such as oscilloscopes, Agilent Infiniium 54855A, Tektronix oscilloscope, and 8-bit oscilloscopes. The collected information is sent to the PC for reconstruction. Therefore, the current research to achieve reconstruction of the signal is mostly in the CPU, GPU, or DSP platform. However, offline processing results in excessive I/O data-rates and storage requirements. More importantly, it prohibits timely (or real-time) decisions based on the recovered information and prevents the use of adaptive sensing strategies. Hence, it is necessary to design dedicated hardware architectures for the very large-scale integration (VLSI) or field-programmable gate arrays (FPGAs) connected with the analog front-end hardware for real-time processing. Recently, various algorithms have been proposed for reconstructing signals from the compressively sensed samples. As efficient methods to calculate a matrix decomposition exists, orthogonal matching pursuit (OMP) algorithm is suitable for VLSI implementation, mostly due to the regular structure of the least- squares(LSs) optimization. In recent years, many structures of OMP algorithm hardware implementations have been proposed. However, few designs can present a low-complexity and high-speed hardware implementation. An implementation of OMP algorithm with an FPGA has been proposed. However, the design uses a 128-length vector for sparsity of 5. The architecture contains higher cycle period due to the path delay in the dot product of matrix. Stanislaus and Mohsenin, have proposed and improved the hardware architecture of OMP algorithm based on QR decomposition. This design finds the inverse of a matrix involving square root units. A more efficient approach is to use the modified Cholesky decomposition which avoids the use of square roots. Numerous works related to the
  • 4. NXFEE INNOVATION (SEMICONDUCTOR IP &PRODUCT DEVELOPMENT) (ISO : 9001:2015Certified Company), # 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam, Pondicherry– 605004, India. Buy Project on Online :www.nxfee.com | contact : +91 9789443203 | email : [email protected] _________________________________________________________________ acceleration of these operations individually have been proposed. In the design presented, the matrix inversion is based on CORDIC divider with latency and a sequential execution of several parts of matrix multiplication. Inversion based on Newton–Raphson iteration, but the two multiplication modules in this algorithm are also Fig. 1. Complete structure of signal recovery Chky-mat, which is Cholesky decomposition and is used to get the inversion matrix executed sequentially. Besides, existing designs are mainly implemented with the real- valued system. They are mostly aimed at the hardware implementation of the OMP algorithm, but not for the actual applications. The processed signals are usually sparse in time domain and do not need to be converted to frequency domain processing. However, a complex-valued system of reconstruction algorithm is required for practical applications because many man-made or natural signals are sparse in an appropriate orthonormal basis (such as Fourier basis). The sensing matrix of the OMP algorithm is usually complex-valued matrix, so the proposed design in this paper is a complex-valued system for practical applications. Furthermore, we find the high computational complexity of matrix operations based on OMP is a major concern on achieving real-time reconstruction of compressively sensed signal. In this paper, we mainly proposed the complex-valued system of the signal reconstruction for the CS-based AIC structure. Additionally, we reduce the computational complexity of matrix operations and number of iterations to improve the OMP algorithm. We have developed a MATLAB code of the algorithm and compared the success rate and recovery
  • 5. NXFEE INNOVATION (SEMICONDUCTOR IP &PRODUCT DEVELOPMENT) (ISO : 9001:2015Certified Company), # 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam, Pondicherry– 605004, India. Buy Project on Online :www.nxfee.com | contact : +91 9789443203 | email : [email protected] _________________________________________________________________ signal to-noise-ratio (RSNR) of input signals recovered correctly base on different sparsities by verification. The improved OMP algorithm has higher success rate and RSNR than the OMP algorithm. In order to validate the improved OMP algorithm efficiently, a complete signal recovery structure using SIMULINK for AIC model based on CS is built in this paper. Besides, we design VLSI architecture of the improved OMP algorithm with real time, low power, and low complexity requirements. Additionally, we propose an efficient matrix inversion design based on the Goldschmidt algorithm. Compared with the inversion method of the Newton–Raphson, the design of the inversion circuit based on the Goldschmidt algorithm can improve the parallelism of the operation and reduce the quantization error of each iteration. Disadvantages:  Number of iterations is higher  Hardware complexity is higher Proposed System: VLSI implementation of the improved OMP algorithm In this paper, the proposed structure for the implementation of the improved algorithm with the degree of sparsity K = 36, the measurements M = 256, and the length of the original signal N = 1024. In hardware implementation, we completed the complex-valued system of the improved algorithm. The whole can be divided into real part and imaginary part with data precision of 24 bits. The hardware implementation of OMP algorithm is mainly divided into three units as shown in Fig. 2: 1) parallel complex multiplication; 2) matrix inversion; and 3) signal estimation and residual calculation. In addition, it has dual-port RAMs and control circuit. The measurement matrix and the measurements vector y are stored in the RAMs, respectively. In order to increase the throughput, the correlation is computed on the
  • 6. NXFEE INNOVATION (SEMICONDUCTOR IP &PRODUCT DEVELOPMENT) (ISO : 9001:2015Certified Company), # 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam, Pondicherry– 605004, India. Buy Project on Online :www.nxfee.com | contact : +91 9789443203 | email : [email protected] _________________________________________________________________ parallel complex multiplication. The most relevant two columns are found by comparison circuit (sorting algorithm unit) and extract them according to the associated indices Fig. 2. Architecture of the improved OMP algorithm
  • 7. NXFEE INNOVATION (SEMICONDUCTOR IP &PRODUCT DEVELOPMENT) (ISO : 9001:2015Certified Company), # 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam, Pondicherry– 605004, India. Buy Project on Online :www.nxfee.com | contact : +91 9789443203 | email : [email protected] _________________________________________________________________ λmax = {λ(i1,i2)} from RAM of the matrix . We just save the addresses of two columns to the new RAM (index unit) and update it after each iteration. The control unit in this design operations on the RAM units by generating a read or write address, and generates enable signals (en) for some computing units to initiate their operations. Parallel Complex-Valued Multiplication In this design, the computationally most expensive operation is the matrix-vector product corresponding to the real and imaginary parts, which correlates the measurements with the dictionary elements. In order to increase the throughput, the correlation is computed on parallel complex-valued multiply accumulators. In this design, the real part and the imaginary part have the same multiplication operation, we need to get the real part and imaginary part of the inner product. One complex-valued multiply add unit can be divided into three real-valued multipliers and three adders. We can compare the square sum of the real part and the imaginary part to find the index of the maximum value of the inner product. Note that there is no need the extra square root operation in this unit to get the mode value of inner product, because the maximum value of the square sum will have the maximum square root value.
  • 8. NXFEE INNOVATION (SEMICONDUCTOR IP &PRODUCT DEVELOPMENT) (ISO : 9001:2015Certified Company), # 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam, Pondicherry– 605004, India. Buy Project on Online :www.nxfee.com | contact : +91 9789443203 | email : [email protected] _________________________________________________________________ Fig. 2. Structure of the sorting algorithm Compared to real-valued system, the extra operations are multiplication of imaginary part and square sum in the complex-valued system. This requires the coefficients of the measurements matrix as the parallel inputs. Therefore, we need to store the matrix into a RAM where we put a column of the matrix into the same address, and ensure that all the inputs required by the multiplication circuit are given in one clock cycle. During the first N cycles of each iteration for the computation of this unit, N columns of are retrieved from the memory unit in serial order and fed to this unit. Finally, the result is sent to the comparison circuit to obtain the indexes of the relevant columns. In order to improve the operating speed of the circuit, the pipeline circuit model is used. The input data of the sorting unit is positive and serial and we can scan sequentially the input data. In this design, we only select the two maximum values to find two most correlated
  • 9. NXFEE INNOVATION (SEMICONDUCTOR IP &PRODUCT DEVELOPMENT) (ISO : 9001:2015Certified Company), # 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam, Pondicherry– 605004, India. Buy Project on Online :www.nxfee.com | contact : +91 9789443203 | email : [email protected] _________________________________________________________________ columns. Two registers (reg1 and reg2) are required to store the two most relevant values, the maximum and the second largest value. First, we compare the input data to the maximum value in numerical comparison unit and then control the data distributor. If the input data is larger than the maximum value, we can replace the secondary register (reg2) with the maximum value and store input data into the first register (reg1). If the input data is smaller than the maximum value and larger than the second largest value, we can replace secondary register (reg2) with the input data. The rest of the situation remains unchanged. The internal structure of the sorting system has been shown in Fig. 3. Advantages  Number of iterations is reduced  Hardware complexity is higher References [1] H. Mamaghanian, N. Khaled, D. Atienza, and P. Vandergheynst, “Design and exploration of low- power analog to information conversion based on compressed sensing,” IEEE J. Emerg. Sel. Topics Circuits Syst., vol. 2, no. 3, pp. 493–501, Sep. 2012. [2] E. J. Candès, J. Romberg, and T. Tao, “Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information,” IEEE Trans. Inf. Theory, vol. 52, no. 2, pp. 489–509, Feb. 2006. [3] D. L. Donoho, “Compressed sensing,” IEEE Trans. Inf. Theory, vol. 52, no. 4, pp. 1289–1306, Apr. 2006. [4] P. Daponte, L. De Vito, S. Rapuano, and I. Tudosa, “Analog-toinformation converters in the wideband RF measurement for aerospace applications: Current situation and perspectives,” IEEE Instrum. Meas. Mag., vol. 20, no. 1, pp. 20–28, Feb. 2017.
  • 10. NXFEE INNOVATION (SEMICONDUCTOR IP &PRODUCT DEVELOPMENT) (ISO : 9001:2015Certified Company), # 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam, Pondicherry– 605004, India. Buy Project on Online :www.nxfee.com | contact : +91 9789443203 | email : [email protected] _________________________________________________________________ [5] J. Laska, S. Kirolos, M. Duarte, T. Ragheb, R. Baraniuk, and Y. Massoud, “Theory and implementation of an analog-to-information converter using random demodulation,” in Proc. IEEE Int. Symp. Circuits Syst., May 2007, pp. 1959–1962. [6] A. Dutta and R. K. Mangang, “Analog to information converter based on random demodulation,” in Proc. Int. Conf. Electron. Design, 2015, pp. 105–109. [7] S. Kirolos et al., “Analog-to-information conversion via random demodulation,” in Proc. IEEE Dallas Circuits Syst. Workshop (DCAS), Oct. 2006, pp. 71–74. [8] J. A. Tropp, J. N. Laska, M. F. Duarte, J. K. Romberg, and R. G. Baraniuk, “Beyond Nyquist: Efficient sampling of sparse bandlimited signals,” IEEE Trans. Inf. Theory, vol. 56, no. 1, pp. 520–544, Jan. 2010. [9] D. Bao, P. Daponte, L. De Vito, and S. Rapuano, “Frequency-domain characterization of random demodulation analog-to-information converters,” Acta IMEKO, vol. 4, no. 1, pp. 111–120, Feb. 2015. [10] D. Gangopadhyay, E. G. Allstot, A. M. R. Dixon, K. Natarajan, S. Gupta, and D. J. Allstot, “Compressed sensing analog front-end for bio-sensor applications,” IEEE J. Solid-State Circuits, vol. 49, no. 2, pp. 426–438, Feb. 2014. [11] T. R. Braun, “An evaluation of GPU acceleration for sparse reconstruction,” Proc. SPIE, vol. 7697, no. 15, pp. 769715-1–769715-10, Apr. 2010. [12] J.-W. Jhang and Y.-H. Huang, “A high-SNR projection-based atom selection OMP processor for compressive sensing,” IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 24, no. 12, pp. 3477– 3488, Dec. 2016. [13] J. D. Blanchard, M. Cermak, D. Hanle, Y. Jing, “Greedy algorithms for joint sparse recovery,” IEEE Trans. Signal Process., vol. 62, no. 7, pp. 1694–1704, Apr. 2014. [14] Y.-M. Lin, Y. Chen, N.-S. Huang, and A.-Y. Wu, “Low-complexity stochastic gradient pursuit algorithm and architecture for robust compressive sensing reconstruction,” IEEE Trans. Signal Process., vol. 65, no. 3, pp. 638–650, Feb. 2017.
  • 11. NXFEE INNOVATION (SEMICONDUCTOR IP &PRODUCT DEVELOPMENT) (ISO : 9001:2015Certified Company), # 45, Vivekanandar Street, Dhevan kandappa Mudaliar nagar, Nainarmandapam, Pondicherry– 605004, India. Buy Project on Online :www.nxfee.com | contact : +91 9789443203 | email : [email protected] _________________________________________________________________ [15] A. Septimus and R. Steinberg, “Compressive sampling hardware reconstruction,” in Proc. IEEE Int. Symp. Circuits Syst., May/Jun. 2010, pp. 3316–3319