Real-Time Spaceborne Synthetic Aperture Radar Float-Point Imaging System Using Optimized Mapping Methodology and a Multi-Node Parallel Accelerating Technique
<p>Flowchart of the chirp scaling (CS) algorithm.</p> "> Figure 2
<p>Hierarchical decomposition mapping flow.</p> "> Figure 3
<p>Field Programmable Gate Array (FPGA)-based architecture.</p> "> Figure 4
<p>Programmable element (PE) and PE array architecture.</p> "> Figure 5
<p>The relative errors between hardware and software computation.</p> "> Figure 6
<p>Point target imaging result with no window.</p> "> Figure 7
<p>Cross-mapping method.</p> "> Figure 8
<p>Block diagram of the multi-node parallel system architecture.</p> "> Figure 9
<p>The time-sequence of multi-node processing parallel processing.</p> "> Figure 10
<p>Two-dimensional data read mode.</p> "> Figure 11
<p>Actual scene imaging result of the system hardware.</p> "> Figure 12
<p>(<b>a</b>) Photographs of the board and (<b>b</b>) the prototype single machine.</p> "> Figure 13
<p>The continuous imaging results of 12 images.</p> ">
Abstract
:1. Introduction
- Optimized mapping methodology for single-chip integration. The CS algorithm can be decomposed in to two parts. For the nonlinear part, nonlinear–operation linearization method, logic-memory optimum allocation, and hierarchical reconfiguration structure are proposed to reduce the complexity and time consumption. Two-dimensional bandwidth dynamic balance technology is introduced to achieve a good real-time performance between the linear part and transpose operations.
- Multi-node parallel accelerating technique for strong real-time requirements. By analyzing the spaceborne SAR real-time imaging processing conditions, this paper presents a parallel accelerating architecture consisting of a master node and multiple independent processing nodes with high processing performance, high real-time performance, and linear scalability features.
2. Chirp Scaling (CS) Algorithm Review
3. Single-Node Field Programmable Gate Array (FPGA)-Based Architecture
3.1. Hardware Design Methodology
3.2. Nonlinear Part Mapping Strategy
3.2.1. Hierarchical Decomposition Mapping Strategy and Hardware-Based Optimization Methods
- Reasonable storage planning is introduced before implementation of the architecture design. The parameters maintained during one imaging process can be stored in registers, and the - and -related parameters should be stored in DPRAM and accessed in a time multiplexing mode.
- The trigonometric functions, such as sin, cos, and arccos, are approximated by the CORDIC method, which is achieved by the addition iteration. Next, whole operations can be unified in addition (subtraction), multiplication and division. Assuming the computation of one addition as the benchmark T, one multiplication is approximately 4 T and one division is approximately 10 T. Therefore, the proposed design reduces the utilization of logical resources by using dividers, rooting, and CORDIC operational cores in a time multiplexing mode. Corresponding to Figure 2, one divider and rooting are instituted for each level, and one CORDIC is used for the first level.
3.2.2. Reconfigurable Implementation Structure
3.2.3. Accuracy and Time-Consumption Analysis
3.3. Linear Part Mapping Strategy
3.3.1. Data Access Pattern
- Divide the NA × NR raw-data matrix into M × N sub-matrices of Na × Nr order. To improve the mapping performance of the sub-matrix, the number of sub-matrix factors should be equal to the contained elements of each DDR row. The addressing mapping rules of raw-data matrix A(x, y) to sub-matrix S(a, b) can be described as follows:
- Map each sub-matrix into the three-dimensional storage array of the DDR according to the cross-mapping method. The data processing is on a bank-by-bank to rank-by-rank basis. Four factors of two lines of sub-matrices as a group are cross-mapped to the DDR row space.
3.3.2. Two-Dimensional Bandwidth Timing Analysis
4. Real-Time System Establishment
- High performance: The proposed design works on addressing three critical requirements of high-performance spaceborne processing for SAR imaging: high computational throughput, large amount of memory, and high-speed data interconnect throughout the communication chain. Multiple processing nodes, in which instances featuring the same pipeline and available hardware resource, with corresponding independent mass storage DDR, are introduced to satisfy the first two requirements. Since the imaging procedure of each processing node is relatively independent, data reading or writing operation cannot occur at same time. Two unidirectional buses (read only and write only, processing node view) are designed not only for the high-speed data interconnects requirement, but also to avoid the bus conflict.
- Linear scalability: The standalone master node integrates the main I/O interfaces. All of the data distribution and switching control operations are conducted by the master node. The master node is responsible for optimizing I/O requests inside or outside of the system to disallow the pipeline processing nodes that lack new input data. In addition, each processing node has its own ID. In other words, the master node can distribute raw data to the processing nodes by polling the ID list. The telemetry and tele-control signal is responsible for spawning and shutting down each processing node. Modular and ID-based design allows the system architecture to have the scalability of hardware architecture; that is, the system can add (or reduce) the functionality or processing power by adding (or deleting) some of the modules.
- Real-time performance: the definition of real-time is the requirement of system that the operations, such as data transmission and computation, prescribed to be performed must be completed within a certain interval time, i.e., accumulation time of one granularity. The accumulation time can be described as follows:
5. Realization of the Multi-Node Prototype Platform
5.1. Single-Node Parallel Processing Analysis
5.2. Single-Node Imaging Result Analysis
5.3. Multi-Node System Architecture
- The data simulator sends the raw data of the original image to the master node via quadrature channels. The transmitted raw data are pixel-by-pixel 8: 4 compressed, and the frequency of each channel is 100 MHz.
- Through uncompressing, filter, and direct current removing operations, the 8-bit fixed data are sent to corresponding processing nodes. Each of the quadrature channels is 11 bit × 100 MHz, including 8 bits for raw data transform, 1 bit for clock, and 2 bits for processing the node chosen ID. As noted above, since these procedures can be considered to occur in real-time, the data input time Tac is approximately 2.7 s.
- After image processing, the result is sent back to the master node and then transmitted to the PC. The time delay of 8-bit imaging result Tout is approximately 2.7 s.
6. Conclusions
Acknowledgments
Author Contributions
Conflicts of Interest
References
- Franceschetti, G.; Lanari, R. Synthetic Aperture Radar Processing. In Proceedings of the IGARSS ‘94 International Geoscience and Remote Sensing Symposium, Surface and Atmospheric Remote Sensing: Technologies, Data Analysis and Interpretation, Pasadena, CA, USA, 8–12 August 1994. [Google Scholar]
- Lou, Y.; Clark, D.; Marks, P.; Muellerschoen, R.J.; Wang, C.C. Onboard Radar Processor Development for Rapid Response to Natural Hazards. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2016, 9, 2770–2776. [Google Scholar] [CrossRef]
- Hirose, A.; Rosen, P.A.; Yamada, H.; Zink, M. Foreword to the Special Issue on Advances in SAR and Radar Technology. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2015, 8, 3748–3750. [Google Scholar] [CrossRef]
- Gierull, C.H.; Vachon, P.W. Foreword to the Special Issue on Multichannel Space-Based SAR. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2015, 8, 4995–4997. [Google Scholar] [CrossRef]
- Tralli, D.M.; Blom, R.G.; Zlotnicki, V.; Donnellan, A.; Evans, D.L. Satellite remote sensing of earthquake, volcano, flood, landslide and coastal inundation hazards. ISPRS J. Photogramm. Remote Sens. 2005, 59, 185–198. [Google Scholar] [CrossRef]
- Percivall, G.S.; Alameh, N.S.; Caumont, H.; Moe, K.L.; Evans, J.D. Improving Disaster Management Using Earth Observations—GEOSS and CEOS Activities. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2013, 6, 1368–1375. [Google Scholar] [CrossRef]
- Joyce, K.E.; Belliss, S.E.; Samsonov, S.V.; Mcneill, S.J.; Glassey, P.J. A review of the status of satellite remote sensing and image processing techniques for mapping natural hazards and disasters. Prog. Phys. Geogr. 2009, 33, 183–207. [Google Scholar] [CrossRef]
- Copernicus: Sentinel-1—The SAR Imaging Constellation for Land and Ocean Services. Available online: https://directory.eoportal.org/web/eoportal/satellite-missions/c-missions/copernicus-sentinel-1 (accessed on 23 March 2017).
- TDX (TanDEM-X: TerraSAR-X Add-On for Digital Elevation Measurement). Available online: https://directory.eoportal.org/web/eoportal/satellite-missions/t/tandem-x (accessed on 23 March 2017).
- ALOS-2 (Advanced Land Observing Satellite-2; SAR Mission)/Daichi-2. Available online: https://directory.eoportal.org/web/eoportal/satellite-missions/a/alos-2 (accessed on 23 March 2017).
- RCM (RADARSAT Constellation Mission). Available online: https://directory.eoportal.org/web/eoportal/satellite-missions/r/rcm (accessed on 23 March 2017).
- KOMPSAT-5 (Korea Multi-Purpose Satellite-5)/Arirang-5. Available online: https://directory.eoportal.org/web/eoportal/satellite-missions/k/kompsat-5 (accessed on 23 March 2017).
- Song, W.S.; Baranoski, E.J.; Martinez, D.R. One trillion operations per second on-board VLSI signal processor for Discoverer II space based radar. In Proceedings of the Aerospace Conference Proceedings, Big Sky, MT, USA, 25 March 2000. [Google Scholar]
- Langemeyer, S.; Kloos, H.; Simon-Klar, C.; Friebe, L.; Hinrichs, W.; Lieske, H.; Pirsch, P. A compact and flexible multi-DSP system for real-time SAR applications. In Proceedings of the 2003 IEEE International Geoscience and Remote Sensing Symposium, IGARSS ‘03, Toulouse, France, 21–25 July 2003. [Google Scholar]
- Wai-Chi, F.; Jin, M.Y. On board processor development for NASA’s spaceborne imaging radar with VLSI system-on-chip technology. In Proceedings of the 2004 IEEE International Symposium on Circuits and Systems (IEEE Cat. No. 04CH37512), Vancouver, BC, Canada, 23–26 May 2004. [Google Scholar]
- Le, C.; Chan, S.; Cheng, F.; Fang, W.; Fischman, M.; Hensley, S.; Johnson, R.; Jourdan, M.; Marina, M.; Parham, B.; et al. Onboard FPGA-based SAR processing for future spaceborne systems. In Proceedings of the Radar Conference, Philadelphia, PA, USA, 29 April 2004. [Google Scholar]
- Fang, W.C.; Le, C.; Taft, S. On-board fault-tolerant SAR processor for spaceborne imaging radar systems. In Proceedings of the IEEE International Symposium on Circuits and Systems, Kobe, Japan, 23–26 May 2005. [Google Scholar]
- Greco, J.; Cieslewski, G.; Jacobs, A.; Troxel, I.A.; George, A.D. Hardware/software Interface for High-performance Space Computing with FPGA Coprocessors. In Proceedings of the IEEE Aerospace Conference, Big Sky, MT, USA, 4–11 March 2006. [Google Scholar]
- Desai, N.M.; Kumar, B.S.; Sharma, R.K.; Kunal, A.; Gameti, R.B.; Gujraty, V.R. Near Real Time SAR Processors for ISRO’s Multi-Mode RISAT-I and DMSAR. In Proceedings of the European Conference on Synthetic Aperture Radar, Friedrichshafen, Germany, 2–5 June 2008. [Google Scholar]
- Pfitzner, M.; Cholewa, F.; Pirsch, P.; Blume, H. FPGA based architecture for real-time SAR processing with integrated motion compensation. In Proceedings of the Synthetic Aperture Radar, Tsukuba, Japan, 23–27 September 2013. [Google Scholar]
- Zhang, F.; Li, G.; Li, W.; Hu, W.; Hu, Y. Accelerating Spaceborne SAR Imaging Using Multiple CPU/GPU Deep Collaborative Computing. Sensors 2016, 16. [Google Scholar] [CrossRef] [PubMed]
- Yang, C.; Li, B.; Chen, L.; Wei, C.; Xie, Y.; Chen, H.; Yu, W. A Spaceborne Synthetic Aperture Radar Partial Fixed-Point Imaging System Using a Field-Programmable Gate Array—Application-Specific Integrated Circuit Hybrid Heterogeneous Parallel Acceleration Technique. Sensors 2017, 17. [Google Scholar] [CrossRef] [PubMed]
- Zhou, J. Fine-grained Algorithm and Architecture for Data Processing in SAR Applications. Ph.D. Thesis, National University of Defense Technology, Changsha, China, October 2010. [Google Scholar]
- Gu, C.F.; Chang, W.G.; Li, X.Y.; Liu, Z.H. Multi-Core DSP Based Parallel Architecture for FMCW SAR Real-Time Imaging. Radioengineering 2015, 24, 1084–1090. [Google Scholar] [CrossRef]
- Raney, R.K.; Runge, H.; Bamler, R.; Cumming, I.G.; Wong, F.H. Precision SAR processing using chirp scaling. IEEE Trans. Geosci. Remote Sens. 1994, 32, 786–799. [Google Scholar] [CrossRef]
- Dall, J. A New Frequency Domain Autofocus. In Proceedings of the Geoscience and Remote Sensing Symposium, Espoo, Finland, 3–6 June 1991. [Google Scholar]
- Li, B.; Xie, Y.; Liu, X.; Deng, Y. A new reconfigurable methodology to implement the shift-and-correlate (SAC) algorithm for real-time SAR autofocusing. In Proceedings of the IET International Radar Conference, Hangzhou, China, 14–16 October 2015. [Google Scholar]
- Ackenhusen, J.G. Real Time Signal Processing: Design and Implementation of Signal Processing Systems; Prentice Hall PTR: Upper Saddle River, NJ, USA, 1999. [Google Scholar]
- Wen, Y.; Chen, H.; Xie, Y.; Liu, X. A New Chirp Scaling Algorithm Using Region-Constant Phase Multipliers. Trans. Beijing Inst. Technol. 2014, 3, 304–309. [Google Scholar]
- Zhang, H.; Li, Y.; Su, Y. SAR image quality assessment using coherent correlation function. In Proceedings of the 2012 5th International Congress on Image and Signal Processing, Chongqing, China, 16–18 October 2012. [Google Scholar]
- Bierens, L.; Vollmuller, B.J. On-board Payload Data Processor (OPDP) and its application in advanced multi-mode, multi-spectral and interferometric satellite SAR instruments. In Proceedings of the EUSAR 2012; 9th European Conference on Synthetic Aperture Radar, Nuremberg, Germany, 23–26 April 2012. [Google Scholar]
- Franceschetti, G.; Tesauro, M.; Strollo, A.G.M.; Napoli, E. A VLSI architecture for real time processing of one-bit coded SAR signals. In Proceedings of the Ursi International Symposium on Signals, Systems, and Electronics, Pisa, Italy, 2 October 1998. [Google Scholar]
Window | Azimuthal Direction | Range Direction | |||||
---|---|---|---|---|---|---|---|
PSLR (dB) | ISLR (dB) | RES (m) | PSLR (dB) | ISLR (dB) | RES (m) | ||
Proposed | / | −11.21 | −10.00 | 2.93 | −10.93 | −9.03 | 2.2 |
Matlab | −12.52 | −11.41 | 2.1 | −12.61 | −10.05 | 1.9 | |
Proposed | Taylor window | −27.54 | −21.85 | 4.10 | −27.23 | −22.34 | 2.45 |
Matlab | −29.74 | −25.93 | 2.7 | −28.58 | −24.21 | 2.3 | |
Proposed | Hamming window | −38.73 | −37.21 | 4.61 | −36.85 | −34.73 | 3.10 |
Matlab | −40.71 | −39.10 | 3.16 | −38.32 | −36.08 | 2.69 |
Stage | Cycles (Working Frequency of 100 MHz) |
---|---|
1st PFE | 512 |
2nd PFE | 512 |
3rd PFE | 404 |
Parameter | Value |
---|---|
FPGA main frequency | 100 MHz |
NA | 16,384 |
NR | 16,384 |
Na | 32 |
Nr | 32 |
l | 8 |
400 MHz | |
200 MHz | |
100 MHz | |
W | 64 bit |
Parameter | Range Direction Operation | Azimuthal Direction Operation | ||
---|---|---|---|---|
Proposed | [22] | Proposed | [22] | |
0.74 | 0.9375 | 0.74 | 0.74 | |
1 | 1 | 1 | 0.5 | |
k | 2 | 1 | 4 | 4 |
6.4 GB/s | 6.4 GB/s | 6.4 GB/s | 6.4 GB/s | |
4.8 GB/s | 6 GB/s | 4.8 GB/s | 2.37 GB/s | |
1.6 GB/s | 1.6 GB/s | 1.6 GB/s | 1.6 GB/s | |
0.8 GB/s | 0.8 GB/s | 0.8 GB/s | 0.8 GB/s | |
n | 4 | 2 | 6 | 2 |
Parameter | Value |
---|---|
Number of slice registers | 134,259 (34%) |
Number of LUTs | 122,467 (62%) |
Number of block RAMs/FIFOs | 499 (67%) |
Number of DSP48s | 387 (28%) |
Parameter | Value |
---|---|
MSE | 23.3 |
PSNR (dB) | 30 |
SSIM | 0.99 |
γ (dB) | 4.98 |
Works | Year | Schemes | Data Granularity | Working Frequency | Power Consumption | Processing Time |
---|---|---|---|---|---|---|
Proposed | 2017 | FPGA | 16,384 × 16,384 | 100 MHZ | 17 W | 10.6 s |
[22] | 2017 | FPGA + ASIC | 16,384 × 16,384 | 100 MHZ | 21 W | 12.1 s |
[2] | 2016 | FPGA + Microprocess | 6472 × 3328 | / | 68 W | 8 s |
[21] | 2016 | CPU + GPU | 32,768 × 32,768 | / | >330 W | 2.8 s |
[24] | 2015 | Multi-DSP | 4096 × 4096 | 100 MHZ | / | 2.178 s |
[31] | 2012 | CPU + ASIC | 1024 × 1024 | 100 MHz | 10 W | / |
[19] | 2008 | Multi-DSP | 4096 × 4096 | 100 MHZ | 35 W | 13 s |
[32] | 1998 | ASIC | 1020 × 200 | 10 MHz | 2 W | / |
Parameter | Value (s) |
---|---|
Tac | 7.9 |
Tin | 2.7 |
Tpro | 10.6 |
Tout | 2.7 |
P | 4 |
Indicator | Value |
---|---|
Weight | 10 kg |
Volume | 32 cm × 24 cm × 20 cm |
Power | <100 W |
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Li, B.; Shi, H.; Chen, L.; Yu, W.; Yang, C.; Xie, Y.; Bian, M.; Zhang, Q.; Pang, L. Real-Time Spaceborne Synthetic Aperture Radar Float-Point Imaging System Using Optimized Mapping Methodology and a Multi-Node Parallel Accelerating Technique. Sensors 2018, 18, 725. https://doi.org/10.3390/s18030725
Li B, Shi H, Chen L, Yu W, Yang C, Xie Y, Bian M, Zhang Q, Pang L. Real-Time Spaceborne Synthetic Aperture Radar Float-Point Imaging System Using Optimized Mapping Methodology and a Multi-Node Parallel Accelerating Technique. Sensors. 2018; 18(3):725. https://doi.org/10.3390/s18030725
Chicago/Turabian StyleLi, Bingyi, Hao Shi, Liang Chen, Wenyue Yu, Chen Yang, Yizhuang Xie, Mingming Bian, Qingjun Zhang, and Long Pang. 2018. "Real-Time Spaceborne Synthetic Aperture Radar Float-Point Imaging System Using Optimized Mapping Methodology and a Multi-Node Parallel Accelerating Technique" Sensors 18, no. 3: 725. https://doi.org/10.3390/s18030725
APA StyleLi, B., Shi, H., Chen, L., Yu, W., Yang, C., Xie, Y., Bian, M., Zhang, Q., & Pang, L. (2018). Real-Time Spaceborne Synthetic Aperture Radar Float-Point Imaging System Using Optimized Mapping Methodology and a Multi-Node Parallel Accelerating Technique. Sensors, 18(3), 725. https://doi.org/10.3390/s18030725