1. Introduction
The radar technology is widely used in noncontact medical measurement, through-wall surveillance and post-disaster rescue operation [
1,
2,
3,
4]. According to [
5], the IEEE maximum permissible exposures are 2 W/m
2 for frequencies between 30 and 400 MHz. It ramps up from 2 to 10 W/m
2 between 400 and 2000 MHz. For frequencies greater than 2000 MHz, the maximum permissible exposure is 10 W/m
2. The electromagnetic radiation from the ordinary microwave radar sensors poses no safety threat [
6]. The electromagnetic (EM) waves transmitted by radar can penetrate non-metallic obstacles (such as the walls and the ruins) and detect the vital signs in a standoff distance [
7]. Therefore, life detection based on radar has become a hot research topic in recent years.
Nowadays, most applied radar techniques for human detection are primarily aimed at single human target detection and have made remarkable progress. However, in a real ruin environment, there always exist multiple buried survivors. The detection and localization of multiple human targets is urgently required since it can remarkably improve the efficiency of the post-disaster rescue. However, automatic detection of multiple stationary humans is a tougher problem due to the mutual interference of multiple human bodies. The mutual interference mainly includes the strong sidelobes of multiple humans and the shadow effect. Due to individual difference, reflection intensity of different humans has significant difference. Sidelobes of human target with strong reflection will submerge the signal of the adjacent human targets with weak reflection.
Humans are no longer point targets in high-resolution imaging radar applications. For example, the frequency range of a common high resolution radar system is usually 0.5–3 GHz or even larger. If the frequency band is 2 GHz, the range resolution is 7.5 cm. However, the size of a common human is 1.8 m × 0.5 m × 0.35 m [
8]. In this case, the human will occupy several resolution units. Considering the relationship between human size and resolution, human must be considered as an extended target, which will obscure the EM propagation. Thus, the EM waves, which should irradiate a certain human and reflected back to receiving element, will be partly blocked by the nearby humans. Therefore, the signal reflected from the distant humans will be weakened. We call the above phenomenon the shadow effect [
9]. If the above interactions are misconducted, the leakage alarm rate will increase when there exist multiple humans in the detection region. The secondary damage may occur in the actual post-disaster rescue scenes. Much research effort has been made to solve the multiple stationary humans detection problem.
Continuous wave (CW) radars are by far the most popular platform of stationary human detection, as they require relatively small hardware expense. Zhou et al. [
10] developed a generalized likelihood ratio test (GLRT) for CW radar to distinguish between the presences of 2, 1 or no subjects. However, the drawback of CW radar is that they do not allow the localization of the humans and will lead to increased difficulty in interference suppression and multiple stationary humans discrimination.
Further performance improvements can be achieved by the ultra-wideband (UWB) radar. UWB waveform provides high range resolution ability, and thus has the potential to determine the distance of humans with high accuracy [
11,
12]. This allows accurate localization of the breathing subject and tracking of the small movements of the diaphragm during breathing [
13]. In [
14], UWB impulse radar was used to monitor the breathing rates of two subjects through a cement wall. Wang et al. [
15] proposed a logarithmic method (LM) utilizing the phase variation of the reflected pulses caused by the periodic thorax displacements to monitor multiple subjects at low power consumption. The stepped frequency continuous wave (SFCW) is a new form of UWB radar waveform, which transmits a series of discrete frequencies in a stepwise manner, covering the radar bandwidth in the time domain to realize the UWB. The SFCW radar technology is superior to the time domain impulse radar for its high reliability and relative easy implementation. Liu et al. [
16] used SFCW radar to detect vital signs from a human subject under laboratory conditions. Cardiologic signals can be achieved when the human subject was in the line of sight. However, these UWB systems use a single input and single output (SISO) channel, in which only range profile image of humans from single sight angle can be attained. For the applications of non-line-of-sight (NLOS), the detection performance of SISO degrades [
16]. Meanwhile, three-dimensional space information of vital signs is projected onto the range dimension, which made it difficult to mitigate the mutual interference between multiple human bodies due to the aliasing problem. These challenges constrain the usage of SISO UWB radar in multiple stationary humans detection.
The radar system with multiple receiving channels, which can be termed as single-input multiple-output (SIMO), is used to achieve further improvement [
17,
18]. SIMO radar systems fuse the information from multiple channels to improve the detection performance. Akiyama et al. [
19] used a system with one transmitting antenna and four receiving antennas to improve the signal-to-noise ratio (SNR) with correlation processing. Liu et al. [
18] demonstrated that the SIMO radar systems have the ability to resolve multiple sources and obtain the angle-of-arrival (AOA) of multiple human targets.
Multiple-input and multiple-output (MIMO) radar is a special type of multiple channels radar which emerged in recent years. The MIMO array with M transmitting elements and N receiving elements can obtain a virtual aperture with M × N virtual transceivers, which greatly reduces the weight and cost of the radar system. MIMO radar echo data can be decomposed as the data from multiple SIMO radar system [
20], since the MIMO radar system can attain and use information from more sight angles. UWB MIMO radar combines the high range resolution property of the UWB signaling with the directional resolution property of the multiple antenna elements, so it has the ability of two-dimensional high-resolution imaging [
21]. Compared with the synthetic aperture radar (SAR), the UWB MIMO systems can get the multiple sight angles of target simultaneously, and the high resolution image sequence can be attained to describe the variation of the scenario. These advantages of the UWB MIMO radar have already attracted interests of researchers. It has been used for through-the-wall imaging of building structure surrounding the humans [
22] and indication of moving human targets [
9]. Salmi et al. [
23] validated the performance of localizing a test subject and tracking his breathing under ideal conditions. Takeuchi et al. [
24] localized survivors using ground-penetrating radar (GPR) with two-dimensional array antenna. However, multiple stationary humans detection is not discussed in these literatures.
In this paper, a UWB MIMO radar system is implemented and a novel signal processing method is proposed to improve the detection performance of multiple stationary humans. The stepped frequency continuous wave (SFCW) waveform is adopted to form ultrawide band and high range resolution capacity. As to the problem of the mutual interference among multiple humans, we propose a vital-sign-enhanced imaging algorithm. On one hand, this algorithm fully utilizes two-dimensional high-resolution imaging of UWB MIMO radar to isolate the human bodies and clutters in space, and thus the mutual interference will be mitigated. On the other hand, the mutual interference can be further suppressed by enhanced imaging. Then, a high resolution vital-sign-enhanced image sequence is formed. Aiming at the detection problem in low signal-to-clutter ratio (SCR) and shadow effect of multiple humans, preprocessing is firstly adopted to improve SCR of vital signs, and further an automatic detection algorithm is realized by using constant false alarm rate (CFAR), morphological filtering and clustering. Via testing the local contrast of the image, weak human targets influenced by clutters and shadow effect can be detected by CFAR. The shape and size features of the human target are utilized by morphological filtering and clustering to reduce the false alarms. The simulation and experimental results show that the proposed method can get high resolution images of multiple humans and accurately detect multiple humans even the targets are adjacent to each other. Two-dimensional localization of the subjects can also be precisely estimated by the proposed method.
The paper is structured as follows.
Section 2 builds the vital signs model of UWB MIMO radar.
Section 3 describes the proposed detection method.
Section 4 gives the simulation results.
Section 5 gives a brief description of the UWB MIMO radar system and illustrates the measurement results. Concluding remarks are given in
Section 6.
2. Vital Signs Model of UWB MIMO Radar
When EM waves emitted from transmitting channel illuminates the human body, part of them will be reflected and received by the receiving channels. Due to respiration, the chest cavity expands and contracts periodically, so the round-trip distance
varies periodically around the nominal distance
d0 accordingly. Considering the monostatic scattering and the line-of-sight situation, the human chest wall movement caused by respiration is
where
represents the slow time which corresponds to the acquisition time of each range profile. The range profile represents the projection of the human target scattering centers on the radar line of sight.
db is the amplitude of the chest wall displacement caused by respiration and
is the respiration frequency.
For UWB MIMO radar, the transmitting antennas and receiving antennas are separately placed, so the bistatic model of vital signs should be considered.
Figure 1 shows the propagation procedure of the incident EM waves transmitted by the
m-th transmitting antenna to the target and the scattered EM waves of human body received by the
n-th receiving antenna. The rectangular coordinates are built as
Figure 1, where
x and
y denote the cross range and range direction, respectively. The origin is set to be the center of linear antenna array for convenience. For simplicity
represents the position vector of the subject with the form of
, where
z is the height coordinate. Assuming that the
m-th transmitting antenna locates at
, the
n-th receiving antenna locates at
, the chest locates in
and the human body is an ideal ellipsoid, the round-trip range of the vital sign is
where
,
and
denote the aspect-angles of the transmitting antenna, the receiving antenna and the normal direction of chest, respectively.
denotes the vector length.
Equation (2) shows that the displacement induced by the respiration movement is a function of the bi-static angles and the attitude angle of the human body. Affected by it is possible to be non-line-of-sight case for some channels. While benefiting from multiple sight angles of UWB MIMO radar, the received echo may be approximately line-of-sight in some channels. Therefore, compared with SISO radar, UWB MIMO radar is not so sensitive in the detection of non-line-of-sight human target.
is the fast time which represents the time axis associated with range along each range profile. It can be thought orthogonal to the slow time dimension. Let
be the transmitted signal. The received signal from the
m-th transmitting antenna and the
n-th receiving antenna can be expressed as
where
c is the speed of flight,
is the Dirac function,
denotes the convolution operation,
is the impulse response of the
p-th vital sign,
is the impulse response of the
q-th clutter including directive wave, coupling clutter and stationary or non-stationary clutter,
is the round-trip distance between the
p-th human and the
m-th transmitting antenna and the
n-th receiving antenna,
is the round-trip distance between the
q-th clutter and the
m-th transmitting and the
n-th receiving antenna.
If the fast time is sampled with the sampling interval
δT and each range profile contains
K samples, the sampling interval of slow time is equal to
δT ×
K which is the processing time of one range profile. For the UWB MIMO radar, if
M transmitting elements sequentially emit SFCW signals, the sampling interval of slow time will be
δT ×
K ×
M. Thus, the MIMO radar which sequentially emits signals sacrifices the sampling frequency along slow time domain to get low-complexity radar system design. The discrete signal of each channel can be expressed as two-dimensional matrix
where
is fast time index and
represents the slow time index.
is the response of vital signs,
is the response of clutters and
is the additive noise. The received data set
will be a three-dimensional matrix, which contains fast time, slow time and equivalent channel data, respectively. The information of respiration movement is contained in the received data
and can be utilized in multiple humans detection.
3. Automatic Detection Method
In this section, a novel detection method based on UWB MIMO radar is proposed to detect vital signs of multiple stationary humans automatically. As illustrated in the flowchart shown in
Figure 2, the detection method consists of three main procedures including preprocessing, enhanced imaging and automatic detection and localization.
3.1. Preprocessing
The preprocessing procedure is first applied to the raw data obtained in each channel before enhanced imaging. System calibration, background removal, bandpass filtering along slow-time domain for SNR improvement and inverse fast Fourier transform (IFFT) along fast time domain are included in the preprocessing step.
The aim of system calibration is to guarantee the coherence between multiple channels. The calibration data are collected by the following two manners. One manner is to set up a reference channel. The other manner is to use a point-like object, such as the trihedral, as the reference target, and collect the data of each channel in empty background as the calibration data. In this paper, the phase of calibration data collected by the above two manners is used to calibrate the incoherence.
The background components can be seen as strong and static clutters. Therefore, the vital signs can be enhanced by change detection (CD) [
25]. Background removal is one simple method of CD. A simple way to remove background is subtracting the mean value over the slow-time window as
The respiration frequency is about 0.2–0.3 Hz, which is much lower than the slow time pulse repetition frequencies (PRF) of the UWB MIMO echo. As a result, the raw data is severely oversampled and lots of clutters are induced into the echo. According to the prior knowledge of respiration frequency, bandpass filtering along slow time is used to eliminate the clutters and harmonic components with high frequency and ultra-low frequency components.
For SFCW radar, the echo can be viewed as the frequency response of the target. Thus, the IFFT is performed to get high resolution range profile (HRRP). A frequency window, such as hanning window and hamming window, is added to suppress the range sidelobes.
3.2. Enhanced Imaging of Multiple Vital Signs
3.2.1. BP Imaging
The UWB MIMO radar is usually performed in near-field and bistatic mode. Among the image formation methods, the back-projection (BP) imaging algorithm, which is well-known for its high precision and simplicity and well adaptation to near-field imaging, is employed in this paper as the basic imaging method.
For certain slow-time sample
, the BP image of the MIMO array with
M transmitting channels and
N receiving channels can be obtained by the coherent sum of the images of
M SIMO arrays as
where
and
are the range coordinate and cross-range coordinate of the image grid,
is the image formed via the
m-th SIMO array and can be represented as
where
and
are the window functions of transmitting element and receiving element to calibrate the antenna directional pattern and control the aperture shape.
The range resolution of BP image is determined by
where
is bandwidth of UWB MIMO radar system. The cross-range resolution is determined by [
26]:
where
is the wavelength of the center frequency, and
and
are the accumulated and squint angles between the array and the target, respectively, as depicted in
Figure 1. Wider bandwidth, higher center frequency, shorter range, and longer array result in finer
and
. However, considering the portability and penetrability requirement of the radar, the array length and the center frequency of the UWB MIMO radar system are strictly restricted, accompanied with the limited range and cross-range resolution.
3.2.2. Vital Signs Enhancement Based on CD
The human body can be seen as a complex extended target with certain size and shape [
11]. The reflection of the torso is the strongest, and it has large area and strong sidelobes in BP image. On one hand, the torso will interfere with the arms and legs in the image formation, which makes it difficult to get a silhouette image by the conventional imaging algorithms. On the other hand, strong sidelobes of torsos interfere with each other, which lead to the deteriorated imaging quality of multiple humans. In addition, strong environmental clutters make the detection of multiple stationary humans more difficult. Although nominal resolution of UWB MIMO image is high, the conventional BP image cannot satisfy the requirement of detection and localization of multiple humans.
In through-barrier applications, heavy stationary clutters are usually removed by coherent or noncoherent CD, based on the fact that there always exist features of humans characterized by respiration, movements of limbs, etc. [
27,
28]. The motions of limbs and respiratory movement are prominently distinguishable features between humans and environmental clutters, which can facilitate the human detection. For a stationary human, except for chest fluctuation caused by respiration motion and micro movement of limbs, the other parts of body can be seen relatively stationary. After CD processing, the vital signs are enhanced and stationary body parts are suppressed. The strong sidelobes of human torso are suppressed effectively. Thus, the equivalent distances between multiple humans and environmental clutters become larger in image after CD.
For a trapped human, the limbs are relatively stationary, so the vital signs can be utilized are only respiration signal. The vital signs can be approximated as sinusoidal signal with small amplitudes. For a standing human, the micro motions of the limbs are inevitable even we try to keep stationary. These motions will form signal component with high reflectivity in received echo, but they varies randomly along the slow time. Delay line canceller based on two frames or several frames is unsuitable to extract the random signal [
25].
The CD algorithm should consider the above two conditions. In this paper, we adopt CD with the form of variation for its satisfaction with the above requirement and easy implementation to integrate motion information during long time
where
is the total number of the slow time samples in the attained UWB MIMO image sequence. In Equation (11), UWB MIMO image sequence is projected onto range-cross-range plane by variation along slow time. For each pixel sequence, the change part will be reserved and accumulated. The stationary part will be seen as mean value and removed by variation operation. Therefore, the vital signs with micro-motion from multiple humans are enhanced. However, some weak non-stationary interference in the environment will also be enhanced. This may cause false alarm in the detection procedure.
Weak interference with micro motions can be suppressed by average of BP image sequence along the slow time domain as follows
The operation of Equation (12) is actually a simple low pass filter, which can eliminate the high frequency micro displacement, and thus achieve the purpose of suppressing the micro motion component. The enhanced image of the multiple humans are given as [
29]
where
is the relaxation factor which controls the enhancement of vital signs. In this paper,
is selected to be 1 according to the practical experience.
3.3. Automatic Detection and Localization of Multiple Stationary Humans
3.3.1. Prescreening Based on Global Threshold
In order to avoid false alarms due to weak noise generated in the process of calculating the local contrast of the image in CFAR, prescreening based on global threshold should be performed first to process the near-zero value pixels. This process is represented as Equation (14): the pixel values smaller than
will be replaced by zero; otherwise, the pixels keep the original values.
where
is the global threshold determined by all the pixel values of the enhanced image as
where
is the operation of computing the elements number which meets the given condition in braces.
is determined according to the experimental results and the value is selected to be 0.1 in our experiment. Thus, the pixels whose values are the smallest 10% are set to be zero.
3.3.2. CFAR Detection
Although the vital signs are notably enhanced by the above processing, there still exist heavy clutters in complex scenarios, e.g., detection of trapped survivors in the ruins. Influenced by the shadow effect, reflection of some humans may be much weaker than the other humans. CFAR is adopted to automatically detect multiple vital signs with large magnitude difference in low SCR scenarios.
As is shown in
Figure 3, CFAR uses a 2-D sliding window to scan all pixels in the vital-sign-enhanced image to search suspected vital signs. The pixel to be detected locates at the center of the sliding window. The sliding window includes the guard window and the clutter window. As
Figure 3 shows,
and
are the cross-range and range dimensions of guard window, respectively.
and
are cross-range and range dimensions of the clutter window. The guard window is designed to be overlaid on the vital sign. Clutter window is designed to be superimposed on local background. Generally, vital sign spreads in range and cross-range dimensions because of the multipath reflections of each body part as well as the propagation, attenuation, and reflection of the EM waves inside the body. Hence, guard window is a buffer between the tested pixel and clutter window to ensure the vital sign is not captured by the clutter window as the clutter background.
According to the size of human chest,
and
are set to be:
where
denotes the smallest odd integer larger than
x,
and
denote the grid width in cross-range and range dimension,
and
represent the prior knowledge of the thickness and width of human chest which denote the influence induced by penetration and multiple reflections within the human body,
is the length of the arm. Considering that the stationary human body has micro body movement, the reflection of arms and legs maybe the strongest in this situation. Therefore, the size of guard window is expended by half length of the arm.
Clutter window is defined by , where is the extended distance considering the interference between two human bodies. Usually, when the value of is selected to be large, the pixel number used for distribution parameter estimation of the clutter is large. Thus, the estimation of distribution parameter is relatively accurate. However, in order to eliminate the influence of nearby human body, should be smaller than the distance between two human targets. In most real applications, even the adjacent humans are also separated with an interval of larger than 0.1 m. Thus, in our processing procedure, is set to be 0.2 m.
The statistical distribution model of the surrounding clutter in the clutter window is then estimated. For the real data, the probability density function (PDF) of clutter is unknown. The best probability density function for the corresponding clutter was determined by non-parameter histogram method [
30]. The histograms of background image are firstly constructed. Then, classic PDF models (Gaussian distribution, gamma distribution, Weibull distribution, lognormal distribution, etc.) are compared with the obtained histogram. The model with the minimized mean squared error is chosen as the PDF of clutter. It was found that background clutters are best approximated by the lognormal distribution, which exhibits non-Gaussian characteristic.
The threshold is calculated with a given false alarm rate (FAR) based on the estimated clutter model. The detected pixel is determined to be part of a human target when the amplitude exceeds the threshold, which can be defined as [
31]:
where
denotes the threshold of CFAR.
3.3.3. Morphological Filtering
Mathematical morphology is a well-known nonlinear image processing methodology based on the application of lattice theory to spatial structures. Along with the development of morphological theory, morphological image processing method has gradually become a new trend in image processing field and a favorable tool in weak target detection [
32]. The basic morphological operations include erosion, dilation, opening and closing. We can obtain some important compound operations with different characteristics by combining them. In this paper, morphological filtering is used to eliminate the irrelevant object which is a different size from vital signs.
The opening Top-Hat used to subtract clutters with large size is defined as follows [
33]:
where “
” denotes the morphological opening including erosion operations followed by dilation operations, and
g is the morphological structuring element. The pixels with negative amplitude are set to be zero,
The opening operation is used to subtract clutters of small size:
where
s is the morphological structuring element to subtract small clutters.
In this paper, both g and s are flat, disk-shaped structuring elements but different in radius size. The sizes are determined by combination of experimental data analysis and the prior knowledge of human size.
3.3.4. Clustering
In CFAR image, a small number of scatters typically dominate the target returns and these bright pixels cannot be connected into a region, so the close-by target pixels are clustered in target-size regions via a clustering algorithm, e.g., the K-means clustering method [
34]. Then a chip around each cluster center is taken out and considered as a suspected target.
Assuming that the image contains P vital signs with P centroids , V non-zero points are surrounded around the P centroids. An iterative procedure is used to identify the centroids of suspected vital signs as follows:
- Step 1:
The strongest non-zero pixel is chosen as the initial cluster center . The range between the i-th pixel and the cluster center is computed as . The location of the centroid is then updated as using the pixels of nearby non-zero pixel satisfying the condition of , where is the pixel number used in the updating and is the clustering radius. According to the prior information of human body size, is set to be 0.5 m in this paper. The pixels satisfying are categorized into cluster .
- Step 2:
is obtained by removing the pixel in from . The strongest pixel in is chosen as the initial cluster center . The centroid is updated similar as Step 1, and cluster is obtained.
- Step 3:
Repeat Step 1 and Step 2 until all pixels in are categorized into the according clusters. clusters and corresponding cluster centroids are obtained.
- Step 4:
Compute the number of pixels in each cluster. If , the cluster is removed. is the smallest pixel number of the vital sign, and determined by real experiments. Thus, we get P clusters and P centroids in the final clustering results.
The number of life signs and the detailed vital information are automatically given by the results of the above clustering algorithm. If the number of clusters is zero, we decide that no life sign exists. If the number of clusters is not zero, the locations are given by the cluster centroid. The pixel sequences at the estimated locations are extracted and the respiration frequency is estimated by maximum magnitude of the spectrum.
4. Simulations and Results
The received echo reflected from the displacement of human chest for a UWB MIMO system was simulated with MATLAB 2013. The uniform linear MIMO array is composed of two transmitting elements and four receiving elements with interelement space of 0.5 m. The two transmitting antennas are settled on the leftmost side and rightmost side of the MIMO array. The transmitted signal was SFCW waveform with frequency range from 40 MHz to 4400 MHz. The stepped frequency interval is 5 MHz. The pulse repeated frequency (PRF) is about 110 Hz. The vital signs are simulated as ideal point targets with periodical sinusoidal displacement. In this simulation, the center of the MIMO array is set to be origin of the coordinate system. The range coordinates of three vital signs are 2 m, and the cross-range coordinates are −0.2 m, 0.2 m and 0.3 m, respectively. The vital signs are simulated in free space and their simulated respiration frequencies are 0.2 Hz, 0.3 Hz and 0.4 Hz, respectively.
As
Figure 4a shows, not all the vital signs can be discriminated in range profile of each virtual channel because the range coordinates of three vital signs are similar. In
Figure 4b, three vital signs can be discriminated. However, sidelobes of vital signs are strong and interference among multiple vital signs is serious. In vital-sign-enhanced image, the sidelobes are suppressed and thus the image quality of vital signs is much better. We can distinguish multiple vital signs even when the distance between the two vital signs is 0.1 m.
Figure 5 shows the final detection results of simulated vital signs. In the clustering result, three vital signs are detected. The estimated range coordinates of all the three vital signs are 2.0040 m, and the estimated cross-range coordinates are −0.2000 m, 0.2240 m and 0.2960 m, respectively. The estimation error of the localizations is 0.004 m which is close to the size of the image grid.