基于微波雷达回波信号的智能车道划分方法

引用本文

修超, 曹林, 王东峰, 张帆. 基于微波雷达回波信号的智能车道划分方法[J]. 计算机应用, 2017, 37(10): 3017-3023.DOI: 10.11772/j.issn.1001-9081.2017.10.3017. 复制到剪切板

XIU Chao, CAO Lin, WANG Dongfeng, ZHANG Fan. Automatic lane division method based on echo signal of microwave radar[J]. Journal of Computer Applications, 2017, 37(10): 3017-3023. DOI: 10.11772/j.issn.1001-9081.2017.10.3017. 复制到剪切板

基金项目

国家自然科学基金资助项目（61671069）；北京高等学校高水平人才交叉培养项目

通信作者

曹林, E-mail:charlin26@163.com

作者简介

修超(1991-), 男, 山东烟台人, 硕士研究生, 主要研究方向:信号处理、模式识别;
曹林(1977-), 男, 辽宁沈阳人, 教授, 博士, 主要研究方向:图像处理、模式识别;
王东峰(1974-), 男, 陕西宝鸡人, 教授, 博士, 主要研究方向:雷达信号处理;
张帆(1994-), 男, 安徽亳州人, 硕士研究生, 主要研究方向:信号处理、图像识别

文章历史

收稿日期：2017-04-05
修回日期：2017-06-22

Contents Abstract Full text Figures/Tables PDF

基于微波雷达回波信号的智能车道划分方法

修超¹, 曹林¹, 王东峰^1,2, 张帆¹

1. 北京信息科技大学通信工程系, 北京 100101;
2. 北京川速微波科技有限公司, 北京 100080

收稿日期：2017-04-05；修回日期：2017-06-22

基金项目：国家自然科学基金资助项目（61671069）；北京高等学校高水平人才交叉培养项目

作者简介：修超(1991-), 男, 山东烟台人, 硕士研究生, 主要研究方向:信号处理、模式识别;
曹林(1977-), 男, 辽宁沈阳人, 教授, 博士, 主要研究方向:图像处理、模式识别;
王东峰(1974-), 男, 陕西宝鸡人, 教授, 博士, 主要研究方向:雷达信号处理;
张帆(1994-), 男, 安徽亳州人, 硕士研究生, 主要研究方向:信号处理、图像识别

通信作者：曹林, E-mail:charlin26@163.com

摘要: 利用多目标交通测速雷达进行交通执法时，只有正确地判断出车辆所在的车道，抓拍照片才能作为交通执法的依据。传统的分车道方法主要通过人工测量的固定阈值以及坐标系旋转的方法来达到车道划分的目的，但这种方法误差较大并且不易于操作。基于统计和密度特征的核聚类算法（K-CSDF）分两步进行：首先对雷达获取的车辆数据进行特征提取，包括基于统计特征的阈值处理和基于密度特征的动态半径提取；然后引入基于核的相似性的动态聚类算法对筛选出的有效点进行聚类。通过和高斯混合模型（GMM）算法以及自组织映射神经网络（SOM）算法进行仿真对比表明：当只取100个有效点进行聚类时，K-CSDF和SOM算法能达到90%以上的分车道正确率，而GMM算法不能给出车道中心线；在算法用时上，当取1000个有效点时，K-CSDF和GMM算法用时均小于1s，可以保证实时性，而SOM算法则需要2.5s左右；在算法鲁棒性上，K-CSDF对不均匀样本的适应性优于这两种算法。当取不同数量的有效点进行聚类时，K-CSDF可以达到95%以上的平均分车道正确率。

关键词: 多目标雷达车道划分统计特征动态半径核动态聚类

Automatic lane division method based on echo signal of microwave radar

XIU Chao¹, CAO Lin¹, WANG Dongfeng^1,2, ZHANG Fan¹

1. Department of Telecommunication Engineering, Beijing Information Science and Technology University, Beijing 100101, China;
2. Beijing TransMicrowave Science and Technology Company Limited, Beijing 100080, China

Foundation Item: This work is partially supported by the National Natural Science Foundation of China (61671069), the Cross Training of High Level Talents Real-training Plan of Beijing Municipal Commission of Education

Author introduction: XIU Chao, born in 1991, M. S. candidate. His research interests include signal processing, pattern recognition;
CAO Lin, born in 1977, Ph. D., professor. His research interests include image processing, pattern recognition;
WANG Dongfeng, born in 1974, Ph. D., professor. His research interests include radar signal processing;
ZHANG Fan, born in 1994, M. S. candidate. His research interests include signal processing, image recognition

Abstract: When police carry out traffic law enforcement using multi-target speed measuring radar, one of the most essential things is to judge which lane each vehicle belongs to, and only in this way the captured pictures can serve as the law enforcement evidence. To achieve lane division purpose, traditional way is to obtain a fixed threshold by manual measurement and sometimes the method of coordinate system rotation is also needed, but this method has a large error with difficulty in operating. A new lane division algorithm called Kernel Clustering algorithm based on Statistical and Density Features (K-CSDF) was proposed, which includes two steps: firstly, a feature extraction method based on statistical feature and density feature was used to process the vehicle data captured by radar; secondly, a dynamic clustering algorithm based on kernel and similarity was introduced to cluster the processed data. Simulations with Gaussian Mixture Model (GMM) algorithm and Self-Organizing Maps (SOM) algorithm were conducted. Simulation results show that the proposed algorithm and SOM algorithm can achieve a lane accuracy of more than 90% when only 100 sample points are used, while GMM algorithm cannot detect the lane center line. In terms of running time, when 1000 sample points are taken, the proposed algorithm and GMM algorithm spend less than one second, and the real-time performance can be guaranteed, while SOM algorithm takes about 2.5 seconds. The robustness of the proposed algorithm is better than GMM algorithm and SOM algorithm when sample points have a non-uniform distribution. When different amounts of sample points are used for clustering, the proposed algorithm can achieve an average lane division accuracy of more than 95%.

Key words: multi-target radar lane division statistical feature dynamic radius kernel dynamic clustering

0 引言

在智能交通系统中, 车道检测是一个长期的研究热点。车道检测包括车道线的检测、道路边界的检测以及车辆可通行区域的检测等。目前, 基于视觉的检测技术^[1-2]由于摄像机获取信息量大、成本低等优势应用最为广泛。但摄像机拍摄的图片极易受到光照和天气等外部环境的影响, 对环境条件要求较为苛刻。近年来, 随着雷达探测技术^[3-5]的发展, 研究人员开始采用毫米波雷达和激光雷达来代替或者辅助摄像机。雷达不受光照和恶劣天气等环境因素影响, 并且具有探测范围广、测距精度高等优点。

史鹏波^[6]利用雷达数据并采用了一种双阈值的方法来提取道路边界点, 但需要预先确定两个阈值, 缺乏自适应性; Xu^[7]将雷达获取的数据点分成若干个区域, 计算该区域的随机密度来检测路边, 方法简单, 但计算量大要计算每个区域的协方差矩阵; Han等^[8]采用阈值分割和综合概率数据关联滤波器(Integrated Probabilistic Data Association Filter, IPDAF)算法来检测和跟踪道路边沿; 吴维一等^[9]采用改进的迭代自组织数据分析算法(Iterative Self Organizing Data Analysis Techniques Algorithm, ISODATA)对雷达数据进行聚类, 虽然算法具有一定的自组织性和启发性, 但还是需要给出先验的最小样本数目和长度约束。

本文提出了一种基于统计和密度特征的核聚类算法(Kernel Clustering algorithm based on Statistical and Density Features, K-CSDF), 从毫米波雷达获取的车辆数据中提取道路信息。无论雷达采用正装还是侧装的方式(见图 1), 该算法都可以对车道进行智能划分, 不需要人为地测量雷达的摆角以及雷达安装位置到车道中心的距离等信息, 提高了人工操作的简便性以及车道划分的准确率。K-CSDF的流程见图 2。

图 1 相机抓拍的照片 Figure 1 Pictures captured by camera

图 2 K-CSDF流程 Figure 2 Flow chart of the K-CSDF algorithm

1 雷达数据获取

本文的实验载体是北京川速微波科技有限公司的多目标交通测速雷达, 主要用于在测速卡口对车辆进行超速抓拍。该雷达系统主要包括三部分:相机、雷达和补光灯。其中雷达是系统的核心设备, 它能捕获到车辆并触发相机对车辆进行抓拍。

多目标交通雷达采用频移键控(Frequency Shift Keying, FSK)体制, 利用多普勒频移对目标进行测速, 利用不同发射频率的相位差对目标进行测距, 并通过一发两收的天线设计来测量目标的角度。雷达对目标的测距和测角公式如下:

$R = \frac{{c \cdot \Delta \varphi }}{{4{\rm{\pi }}\left( {{f_1} - {f_2}} \right)}}$

(1)

$\theta {\kern 1pt} {\kern 1pt} {\rm{ = }}{\kern 1pt} {\kern 1pt} \arcsin \left( {\frac{{\lambda \cdot \Delta {\varphi ^\prime }}}{{2{\rm{\pi }}d}}} \right)$

(2)

其中:R为雷达到目标的距离; c为光速; Δφ为同一接收天线两个不同发射频率f₁和f₂的相位差; θ为雷达天线法向与目标的夹角; λ_w为雷达发射电磁波的波长; Δφ′为两个不同接收天线的同一发射频率的相位差; d为两个接收天线之间的距离。

在一段时间内, 雷达识别出的车辆目标的行驶轨迹分布如图 3所示, 为了直观起见, 已将雷达获取的车辆目标的极坐标距离信息(R, θ)转换成直角坐标信息(x, y), 即图 3中每个点坐标(x, y)表示车辆的距离信息, y的正负代表车辆行驶的方向。将每个样本点对应的幅度记为z, 所有点的幅度按由小到大排序得到车辆信号的能量分布, 记为q(z)。

图 3 雷达获取的原始车辆数据 Figure 3 Original vehicle data captured by radar

2 车辆数据特征提取 2.1 基于统计特征的阈值处理

对雷达获取的车辆数据, 首先利用车辆目标幅度信息的统计特征对数据进行阈值处理, 剔除掉部分异常数据。以监测来向车为例, 取y＜0的数据进行分析, 如图 4所示。

图 4 在正装和侧装情况下的来向车辆轨迹分布 Figure 4 Trajectory distribution of coming vehicle in case of front and side mounting

通常, 雷达照射范围内的车辆反射信号很强, 但同时也存在邻近车道的车辆产生的干扰信号, 图 4中“鬼影区”就是受天线的测角范围所限, 由非监测区域的干扰目标所产生的干扰信号, 因此阈值处理的目的就是去掉“鬼影区”的异常数据。

为了提取出有效的车辆信息, 将车辆信号的能量分布q(z)的上分位数α定义如下:

$\alpha = \frac{1}{{{N_q}}}\sum\limits_{z > {z_\alpha }} {q\left( z \right)} $

(3)

其中:α表示能量高于z_α的样本点的百分比, 0＜α＜1;N_q表示样本总数。q(z)的统计分布如图 5所示, 呈现出“双峰”特性, 低峰值处表示“鬼影区”的样本分布, 高峰值处表示监测车道区域的样本分布。

图 5 样本的能量直方图和概率密度曲线 Figure 5 Energy histogram and probability density curve of the sample

对于任一α值, 可以将q(z)分成两组, 取两组数据之间的方差达到最大时的α值作为保留数据的百分比, 即保留能量较高的α·N_q个点, 剔除能量较低的(1-α)N_q个点。假设两组数据的均值分别为λ₁和λ₂, 分别对应于α和1-α, 则样本总体均值λ为:

$\lambda = \alpha {\lambda _1} + \left( {1 - \alpha } \right){\lambda _2}$

(4)

两组数据之间的方差δ²定义如下:

$\begin{array}{l} {\delta ^2}\left( \alpha \right) = \alpha {\left( {{\lambda _1} - \lambda } \right)^2} + \left( {1 - \alpha } \right){\left( {{\lambda _2} - \lambda } \right)^2} = \\ \;\;\;\;\;\;\;\;\;\;\alpha \left( {1 - \alpha } \right){\left( {{\lambda _1} - {\lambda _2}} \right)^2} \end{array}$

(5)

在0~1之间改变α, 便能求得式(5) 取得最大值时的α, 此时阈值的取值为能量分布q(z)中的第(1-α)N_q」个点对应的幅值。经实验验证, 即使q(z)的统计分布无明显的“双峰”特性, 这种方法也能很好地剔除“鬼影区”的异常数据。

2.2 基于样本密度特征的动态半径提取

对样本数据的特征提取对后续算法以及最终结果具有直接的影响, 更好的特征能够降低模型的复杂度并提高车道划分的准确性。

假设经过上述处理后的数据样本集合为X={x⁽¹⁾ , x⁽²⁾ , …, x^(m); x⁽ⁱ⁾∈Rⁿ}。其中:x⁽ⁱ⁾是一个n维的向量, 代表第i个样本的n维信息, m表示样本的数量。

将第i个样本点的局部密度^[10]定义如下:

${\rho _i} = \sum\limits_j {\chi \left( {{d_{ij}} - {d_c}} \right)} $

(6)

其中:

$\chi \left( x \right) = \left\{ {\begin{array}{*{20}{c}} {1,\;\;\;x < 0}\\ {0,\;\;\;x \ge 0} \end{array}} \right.$

(7)

d_ij表示样本x⁽ⁱ⁾与x^(j)之间的距离, d_c是截断距离(Cut-off distance)。

由定义可知, ρ_i表示与样本x⁽ⁱ⁾距离小于d_c的样本点的个数, 当ρ_i大于N时, 该样本被视为有效样本。其中d_c和N是超参数, 需要人为指定, 参数设置的不同可能会导致结果的较大差异。为了降低算法的参数敏感性, 把d_c看作一个变量, 将第i个样本点的动态半径定义如下:

${\tau _i} = \mathop {\min }\limits_{{\rho _i} > N} \left( {{d_c}} \right)$

(8)

其中:τ_i表示样本x⁽ⁱ⁾达到密度N所需要的最小半径(如图 6所示), τ的值越小, 表明该样本点越可能为有效点; τ的值越大, 表明该样本点越可能为噪声点。

图 6 样本的动态半径示意图 Figure 6 Schematic diagram of dynamic radius of the sample

3 基于统计和密度特征的核聚类算法

本章主要介绍K-CSDF的第二步:对提取出的有效点进行聚类。首先分析了传统K均值算法的不足, 然后通过引入核函数改进了原有算法, 最后给出了K-CSDF的聚类实现流程。

3.1 K均值算法的不足

K均值算法^[11]是一种简单、高效的动态聚类算法, 其时间复杂度接近线性, 因此在工业中有广泛的应用。

K均值算法采用迭代的思想, 利用最小误差平方和准则来判断失真函数(Distortion function)是否收敛, 定义失真函数如下:

${J_{c,\mathit{\boldsymbol{\mu }}}} = \frac{1}{m}{\sum\limits_{i = 1}^m {\left\| {{\mathit{\boldsymbol{x}}^{\left( i \right)}} - {\mathit{\boldsymbol{\mu }}_{{c^{\left( i \right)}}}}} \right\|} ^2}$

(9)

其中:

${\mathit{\boldsymbol{\mu }}_j} = \frac{1}{{{l_j}}}\sum\limits_{\mathit{\boldsymbol{x}} \in class\;j} \mathit{\boldsymbol{x}} ;j = 1,2,...,k$

(10)

μ_j是第j类样本的均值向量, 代表第j类的聚类中心; l_j表示第j类样本的数量; 下标c⁽ⁱ⁾表示第i个样本的类别标签; k表示类别数。当J_{c, μ}取得最小值时的聚类就是误差平方和准则下的最优结果。

在本文的应用场景中, 直接采用K均值算法来进行聚类是不合适的。由于该算法采用欧氏距离来定义样本间的相似性, 并用均值来更新聚类中心, 只有当类内样本分布为超球状或接近超球状时, 才能取得较好的效果。另一种距离度量方法是采用闵可夫斯基距离(Minkowski distance), 设两个样本为(a₁, a₂, …, a_n)和(b₁, b₂, …, b_n), 则它们之间的闵可夫斯基距离定义为:

$d = {\left( {\sum\limits_{i = 1}^n {{{\left| {{a_i} - {b_i}} \right|}^p}} } \right)^{1/p}}$

(11)

图 7显示了当p取不同值时, 样本逼近聚类中心的趋势。但采用闵可夫斯基距离也不能解决所有问题, 更一般的距离或相似性度量方式可以通过引入核函数^[12]的方法来实现。

图 7 p取不同值时, 样本逼近聚类中心的趋势 Figure 7 Trend of sample's approaching to the cluster center with different p

3.2 K-CSDF的实现

通过上述分析可知, 设计一个有效的核函数是K-CSDF第二步的关键。在本文应用场景中, 样本数据是由车辆在车道内行驶而产生的, 因此样本数据的特点是集中在相应的主轴方向, 即车道中心线的方向。因此, 定义主轴核函数如下:

${K_j}\left( {{\mathit{\boldsymbol{x}}^{\left( i \right)}},{\mathit{\boldsymbol{U}}_j}} \right) = {\mathit{\boldsymbol{U}}_j}^{\rm{T}}{\mathit{\boldsymbol{x}}^{\left( i \right)}}$

(12)

其中:U_j是样本类内离散度矩阵S_j的最大特征值所对应的特征向量。

${\mathit{\boldsymbol{S}}_j} = \sum\limits_{\mathit{\boldsymbol{x}} \in class\;j} {\left( {\mathit{\boldsymbol{x}} - {\mathit{\boldsymbol{\mu }}_j}} \right){{\left( {\mathit{\boldsymbol{x}} - {\mathit{\boldsymbol{\mu }}_j}} \right)}^{\rm{T}}}} $

(13)

相应地, 可以将样本与核函数之间的距离ζ定义如下:

$\xi \left( {{\mathit{\boldsymbol{x}}^{\left( i \right)}},{K_j}} \right) = {\mathit{\boldsymbol{\eta }}^{\rm{T}}}\mathit{\boldsymbol{\eta }}$

(14)

$\mathit{\boldsymbol{\eta }} = \left( {{\mathit{\boldsymbol{x}}^{\left( i \right)}} - {\mathit{\boldsymbol{\mu }}_j}} \right) - {\mathit{\boldsymbol{U}}_j}{\mathit{\boldsymbol{U}}_j}^{\rm{T}}\left( {{\mathit{\boldsymbol{x}}^{\left( i \right)}} - {\mathit{\boldsymbol{\mu }}_j}} \right)$

(15)

图 8 样本到核函数的距离示意图 Figure 8 Schematic diagram of the distance from sample to kernel

在定义了代表不同类的核函数以及样本与核函数之间的距离之后, 就可以参照K均值算法来构造K-CSDF的聚类部分, 具体流程如下。

输入样本集合{x⁽¹⁾ , x⁽²⁾ , …, x^(m); x⁽ⁱ⁾∈Rⁿ}

初始化将样本x⁽ⁱ⁾初始化成k类; 随机初始化每类的核K_j。

Repeat

对每个样本x⁽ⁱ⁾利用式(14) 计算出它到初始核的距离ζ, 取最小距离并将样本归为c⁽ⁱ⁾类;

对所有归为c⁽ⁱ⁾类的样本利用式(13) 求出其S_j和U_j, 并更新核K_j

Until

${J_{c,\mathit{\boldsymbol{\mu }}}} = \frac{1}{m}{\sum\limits_{i = 1}^m {\left\| {{\mathit{\boldsymbol{x}}^{\left( i \right)}} - \Delta } \right\|} ^2}$的值不再改变或改变量小于ε=0.01

4 实验仿真和路测结果 4.1 实验仿真结果

首先, 给出K-CSDF特征提取的实验仿真流程, 见图 9。经过K-CSDF第一步的特征提取后通常可以得到2000个左右的有效样本点, 取100个有效点进行聚类, 最终的聚类结果见图 10。从图 10中可以看出算法识别出的三条车道中心线是由每一类的样本在核方向上的投影产生的。

图 9 K-CSDF特征提取过程 Figure 9 Feature extraction process of K-CSDF algorithm

图 10 样本在核方向上的投影示意图 Figure 10 Schematic diagram of the projection of samples in the direction of kernel

其次, 分别取样本点数量为100、500、1000、2000对聚类效果进行分析。此时, 聚类结果的实验仿真如图 11所示, 通过对100组实际采集的数据进行分析, 结果表明即使只取100个样本点, 算法仍能很好地识别出车道中心线。表 1给出了图 11的聚类结果所对应的聚类中心μ_j, 样本类内离散度矩阵S_j和其最大特征值所对应的最大特征向量U_j的数值。

图 11 聚类识别车道中心线示意图 Figure 11 Schematic diagram of identification of lane center line by clustering

表 1 实验结果对应的聚类参数值 Table 1 Values of clustering parameters corresponding to the experimental results

样本点数	聚类中心μ_j	类内离散度矩阵S_j	最大特征向量U_j
100	$\left[ \begin{matrix} -5.23 \\ 2.24 \\ \end{matrix} \right]$, $\left[ \begin{matrix} -0.07 \\ -0.29 \\ \end{matrix} \right]$, $\left[ \begin{matrix} 4.53 \\ -1.20 \\ \end{matrix} \right]$	$\left[ \begin{matrix} 0.94 & -1.23 \\ -1.23 & 4.02 \\ \end{matrix} \right]$, $\left[ \begin{matrix} 1.41 & -1.91 \\ -1.91 & 4.80 \\ \end{matrix} \right]$, $\left[ \begin{matrix} 1.04 & -1.91 \\ -1.91 & 4.84 \\ \end{matrix} \right]$	$\left[ \begin{matrix} -0.33 \\ 0.94 \\ \end{matrix} \right]$, $\left[ \begin{matrix} -0.41 \\ 0.91 \\ \end{matrix} \right]$, $\left[ \begin{matrix} -0.38 \\ 0.92 \\ \end{matrix} \right]$
500	$\left[ \begin{matrix} -4.33 \\ 1.03 \\ \end{matrix} \right]$, $\left[ {\begin{array}{{20}{c}} {0.45}\\ { - 0.08} \end{array}} \right]$, $\left[ {\begin{array}{{20}{c}} {5.01}\\ { - 1.26} \end{array}} \right]$	$\left[ {\begin{array}{{20}{c}} {1.28}&{ - 1.71}\\ { - 1.71}&{3.82} \end{array}} \right]$, $\left[ {\begin{array}{{20}{c}} {1.06}&{ - 1.33}\\ { - 1.33}&{4.15} \end{array}} \right]$, $\left[ {\begin{array}{*{20}{c}} {0.85}&{ - 1.33}\\ { - 1.33}&{3.94} \end{array}} \right]$	$\left[ {\begin{array}{{20}{c}} { - 0.45}\\ {0.89} \end{array}} \right]$, $\left[ {\begin{array}{{20}{c}} { - 0.35}\\ {0.94} \end{array}} \right]$, $\left[ {\begin{array}{*{20}{c}} { - 0.35}\\ {0.93} \end{array}} \right]$
1000	$\left[ {\begin{array}{{20}{c}} { - 4.79}\\ {1.44} \end{array}} \right]$, $\left[ {\begin{array}{{20}{c}} {0.10}\\ { - 0.14} \end{array}} \right]$, $\left[ {\begin{array}{*{20}{c}} {4.65}\\ { - 1.14} \end{array}} \right]$	$\left[ {\begin{array}{{20}{c}} {1.16}&{ - 1.62}\\ { - 1.62}&{3.86} \end{array}} \right]$, $\left[ {\begin{array}{{20}{c}} {1.21}&{ - 1.70}\\ { - 1.70}&{4.86} \end{array}} \right]$, $\left[ {\begin{array}{*{20}{c}} {1.05}&{ - 1.58}\\ { - 1.58}&{4.16} \end{array}} \right]$	$\left[ {\begin{array}{{20}{c}} { - 0.42}\\ {0.91} \end{array}} \right]$, $\left[ {\begin{array}{{20}{c}} { - 0.37}\\ {0.93} \end{array}} \right]$, $\left[ {\begin{array}{*{20}{c}} { - 0.39}\\ {0.92} \end{array}} \right]$
2000	$\left[ {\begin{array}{{20}{c}} { - 4.38}\\ {1.02} \end{array}} \right]$, $\left[ {\begin{array}{{20}{c}} {0.20}\\ {0.03} \end{array}} \right]$, $\left[ {\begin{array}{*{20}{c}} {4.75}\\ { - 1.29} \end{array}} \right]$	$\left[ {\begin{array}{{20}{c}} {2.21}&{ - 2.67}\\ { - 2.67}&{4.78} \end{array}} \right]$, $\left[ {\begin{array}{{20}{c}} {1.21}&{ - 1.73}\\ { - 1.73}&{4.52} \end{array}} \right]$, $\left[ {\begin{array}{*{20}{c}} {1.13}&{ - 1.84}\\ { - 1.84}&{4.74} \end{array}} \right]$	$\left[ {\begin{array}{{20}{c}} { - 0.53}\\ {0.85} \end{array}} \right]$, $\left[ {\begin{array}{{20}{c}} { - 0.39}\\ {0.92} \end{array}} \right]$, $\left[ {\begin{array}{*{20}{c}} { - 0.39}\\ {0.92} \end{array}} \right]$

表 1 实验结果对应的聚类参数值 Table 1 Values of clustering parameters corresponding to the experimental results

此外, 在实际采集数据时很容易遇到的一个问题就是, 各个车道的过车数量可能并不均匀, 这会导致采集到的样本分布不均且有明显的“间断”。为了衡量聚类效果, 定义评价指标eva如下:

$eva={\left( {{\beta }_{i}}-{{\alpha }_{i}} \right)}/{\max \left( {{\alpha }_{i}},{{\beta }_{i}} \right)}\;$

(16)

其中:α_i表示第i个样本到此类的其他样本的平均距离; β_i表示第i个样本分别到其他各类样本的平均距离中的最小值; eva值在-1~1范围内, 越接近1表明聚类效果越好。如图 12所示, 仿真结果能识别出车道中心线并且大部分样本的eva值都大于0.6, 表明算法能很好地适应这种情况。

图 12 过车不均时的聚类效果 Figure 12 Clustering results with non-uniform traffic flow

最后, 考虑到国内该类产品仍处于研发阶段并涉及商业机密, 关于多目标交通雷达的车道划分理论研究还未见公开的文献以及实际的工程结果, 因此将本文提出的K-CSDF和另外两种具有代表性的聚类算法在算法用时以及车道划分的准确率上进行对比:

1) 高斯混合模型(Gaussian Mixture Model, GMM):相对于K均值算法强制地将每个样本分给某个类, GMM算法给出的是样本分到每个类的概率, 因而又称作软聚类(soft clustering)。图 13是GMM的聚类结果以及样本分类的后验概率。

图 13 GMM算法的聚类效果 Figure 13 Clustering results by GMM algorithm

2) 自组织映射神经网络(Self-Organizing Maps, SOM):采用2×3的拓扑网络将样本分成6类, 训练次数取200次。SOM网络的聚类结果给出了6个类的中心, 如图 14所示, 分别取左、中、右拓扑结构的两个聚类中心的连线作为识别出的车道中心线。

图 14 SOM算法的聚类效果 Figure 14 Clustering results by SOM algorithm

实验仿真显示三种聚类算法在大多数情况下都能达到90%以上的分车道正确率。其中, 本文算法和SOM算法可以给出车道中心线, 而GMM算法不能; 在样本分布不均衡的情况下, 本文算法也具有很好的鲁棒性, 仍能达到95%以上的正确率, GMM算法可以达到90%左右的正确率, 而SOM算法无法正确分类; 在算法用时上, 以取1000个样本点为例, GMM算法用时最快, 在0.2 s左右, 本文算法需要0.8 s左右, SOM算法需要2.5 s左右。具体对比见表 2和图 15。

表 2 本文算法和其他两种算法的性能以及训练时间对比 Table 2 Comparison of performance and training time among the proposed and the other two algorithms

图 15 取不同数量样本点时分车道正确率对比 Figure 15 Comparison of the lane division accuracy with different sample points

4.2 路测结果

测试设备:PC、雷达、相机和三脚架等。

测试地点:天桥。

测试步骤:

1) 将雷达用三脚架正装, 并连接相机和PC, 然后用雷达采集车辆数据5 min(约20辆车)。

2) 通过上位机发送命令, 执行车道划分算法, 通过聚类结果的聚类中心和特征向量对车道进行拟合, 上位机界面见图 16。

图 16 上位机界面 Figure 16 Interface of host computer

3) 将雷达设置成工作状态, 对车辆进行正常抓拍并保存原始数据、抓拍照片和视频用于统计分析。

4) 对雷达进行侧装, 重复上述三个步骤。

对三组测试结果分别统计分车道正确率, 并对每辆过车取10帧数据进行单帧分析, 如图 17所示。路测统计结果见表 3。

图 17 实际视频的单帧图像以及对应的单帧数据 Figure 17 Single frame image from video and the corresponding single frame data

表 3 多目标雷达路测结果统计 Table 3 Statistical results of the road test for multi-target radar

表 3中, 第1组数据为正装时采集, 第2、3组为侧装时采集, 可以看出正装的车道划分正确率要稍高于侧装时的正确率, 但从总体上, 该方法在两种安装方式下都可以达到95%以上的车道划分正确率, 可以满足实际应用的需求。

5 结语

本文提出了K-CSDF用于车道划分, 该算法主要包括对原始数据的特征提取以及基于核的相似性的动态聚类两步。从实验仿真和路测结果可以看出, 该方法在保证实时性的同时, 可以达到95%以上的分车道正确率；当只取100个样本点进行拟合时, 算法也具有很好的鲁棒性。但本文只对监测3个车道的情况进行了分析, 对于监测更多车道以及更复杂环境下的道路情况, 还需要后续的研究。

参考文献(References)

[1]	BEYELER M, MIRUS F, VERL A. Vision-based robust road lane detection in urban environments [C]//ICRA 2014: Proceedings of the 2014 IEEE International Conference on Robotics and Automation. Piscataway, NJ: IEEE, 2014: 4920-4925.
[2]	DU X, TAN K K, HTET K K K. Vision-based lane line detection for autonomous vehicle navigation and guidance [C]//ASCC 2015: Proceedings of the 2015 10th Asian Control Conference. Piscataway, NJ: IEEE, 2015: 1-5.
[3]	FELGUERA-MARTÍN D, GONZÁLEZ-PARTIDA J T, ALMOROX-GONZÁLEZ P, et al. Vehicular traffic surveillance and road lane detection using radar interferometry[J]. IEEE Transactions on Vehicular Technology, 2012, 61(3): 959-970. DOI:10.1109/TVT.2012.2186323
[4]	THUY M, LEÓN F. Lane detection and tracking based on lidar data[J]. 4 THUY M LEÓN F Lane detection and tracking based on lidar data Metrology and Measurement Systems 2010 17 3 311 321 THUY M, LEÓN F. Lane detection and tracking based on lidar data[J]. Metrology and Measurement Systems, 2010, 17(3): 311-321. 2fj 4 THUY M LEÓN F Lane detection and tracking based on lidar data Metrology and Measurement Systems 2010 17 3 311 321 THUY M, LEÓN F. Lane detection and tracking based on lidar data[J]. Metrology and Measurement Systems, 2010, 17(3): 311-321. 2fmms.2010.xvii.issue-3 4 THUY M LEÓN F Lane detection and tracking based on lidar data Metrology and Measurement Systems 2010 17 3 311 321 THUY M, LEÓN F. Lane detection and tracking based on lidar data[J]. Metrology and Measurement Systems, 2010, 17(3): 311-321. 2fv10178-010-0027-3 4 THUY M LEÓN F Lane detection and tracking based on lidar data Metrology and Measurement Systems 2010 17 3 311 321 THUY M, LEÓN F. Lane detection and tracking based on lidar data[J]. Metrology and Measurement Systems, 2010, 17(3): 311-321. 2fv10178-010-0027-3.pdf?t:ac=j 4 THUY M LEÓN F Lane detection and tracking based on lidar data Metrology and Measurement Systems 2010 17 3 311 321 THUY M, LEÓN F. Lane detection and tracking based on lidar data[J]. Metrology and Measurement Systems, 2010, 17(3): 311-321. 2fmms.2010.xvii.issue-3 4 THUY M LEÓN F Lane detection and tracking based on lidar data Metrology and Measurement Systems 2010 17 3 311 321 THUY M, LEÓN F. Lane detection and tracking based on lidar data[J]. Metrology and Measurement Systems, 2010, 17(3): 311-321. 2fv10178-010-0027-3 4 THUY M LEÓN F Lane detection and tracking based on lidar data Metrology and Measurement Systems 2010 17 3 311 321 THUY M, LEÓN F. Lane detection and tracking based on lidar data[J]. Metrology and Measurement Systems, 2010, 17(3): 311-321. 2fv10178-010-0027-3.xml" target="_blank" title="点击浏览原文">Metrology and Measurement Systems, 2010, 17(3): 311-321.
[5]	JANDA F, PANGERL S, SCHINDLER A. A road edge detection approach for marked and unmarked lanes based on video and radar [C]//FUSION 2013: Proceedings of the 2013 16th International Conference on Information Fusion. Piscataway, NJ: IEEE, 2013: 871-876.
[6]	史鹏波. 基于单线激光雷达的道路特征检测[D]. 南京: 南京理工大学, 2013. (SHI P B. Road feature detection based on single line lidar [D]. Nanjing: Nanjing University of Science and Technology, 2013.) http://cdmd.cnki.com.cn/Article/CDMD-10288-1013165272.htm
[7]	XU Z. Laser rangefinder based road following[C]//ICMA 2005: Proceedings of the 2005 IEEE International Conference on Mechatronics and Automation. Piscataway, NJ: IEEE, 2005, 2: 713-717.
[8]	HAN J, KIM D, LEE M, et al. Enhanced road boundary and obstacle detection using a downward-looking LIDAR sensor[J]. IEEE Transactions on Vehicular Technology, 2012, 61(3): 971-985. DOI:10.1109/TVT.2012.2182785
[9]	吴维一, 刘大学, 戴斌. 一种处理激光雷达数据的聚类分析方法[J]. 计算机仿真, 2007, 24(8): 236-240. (WU W Y, LIU D X, DAI B. A clustering analysis for lidar data[J]. Computer Simulation, 2007, 24(8): 236-240.)
[10]	RODRIGUEZ A, LAIO A. Clustering by fast search and find of density peaks[J]. Science, 2014, 344(6191): 1492-1496. DOI:10.1126/science.1242072
[11]	FORGY E W. Cluster analysis of multivariate data: efficiency versus interpretability of classifications[J]. Biometrics, 1965, 61(3): 768-769.
[12]	SCHÖLKOPF B, SMOLA A, MVLLER K R. Nonlinear component analysis as a kernel eigenvalue problem[J]. Neural Computation, 1998, 10(5): 1299-1319. DOI:10.1162/089976698300017467