Administrator by China Associction for Science and Technology
Sponsored by China Society of Automotive Engineers
Published by AUTO FAN Magazine Co. Ltd.

Automotive Engineering ›› 2022, Vol. 44 ›› Issue (5): 691-700.doi: 10.19562/j.chinasae.qcgc.2022.05.006

Special Issue: 智能网联汽车技术专题-规划&控制2022年

Previous Articles     Next Articles

K-means Complementary Iterative Vehicle Information Data Clustering Based on DHSSA Optimization

He Huang1,2(),Wenlong Li1,2,Lan Yang1,Huifeng Wang1,Biao Wang1,Feng Ru1,2   

  1. 1.Chang’an University,Xi’an  710064
    2.Xi’an Key Laboratory of Intelligent Expressway Information Fusion and Control,Xi’an  710064
  • Received:2021-11-22 Revised:2021-12-19 Online:2022-05-25 Published:2022-05-27
  • Contact: He Huang E-mail:huanghe@chd.edu.cn

Abstract:

For the problems that the traditional method is greatly affected by the initialization center in the process of vehicle information data clustering, resulting in low clustering accuracy and poor robustness, and the selection of clustering center by calculating the mean in the iterative process is greatly affected by the outliers, a K-means complementary iterative vehicle information data clustering optimized by DHSSA is proposed. Firstly, for the problem of insufficient update of discoverer position and insufficient population diversity in SSA algorithm, a disturbance factor-head optimization strategy is designed. The influence of the optimal individual is strengthened by the adaptive head strategy, and the search space is expanded by the disturbance factor, which improves the accuracy of cluster center searching. Secondly, the initialization of cluster centers optimized by screening maximum and minimum distance product method (SMMP) is designed, and the screening mechanism is added on the basis of MMP, so that the initial centers are more evenly distributed in each cluster as much as possible. Finally, DHSSA and SMMP are integrated to optimize the K-means complementary iteration, which reduces the number of iterations and increases the search efficiency to obtain better clustering results. Using a variety of data sets for testing, through the convergence curve and performance indicators in the experimental results, it can be seen that the proposed DHSSA-KMC method is of higher search accuracy, convergence speed and lower clustering cost than SSA-KMC, IMFO-KMC, KMC and KMC++, and the time consumption is reduced compared with SSA-KMC and IMFO-KMC, which proves the effectiveness and superiority of the algorithm. In the process of vehicle information data processing, DHSSA-KMC can efficiently cluster and generate competitive models for consumers to choose, with obvious application value.

Key words: KMC, screening maximum and minimum distance product, SSA, data sets, car type information data