Administrator by China Associction for Science and Technology
Sponsored by China Society of Automotive Engineers
Published by AUTO FAN Magazine Co. Ltd.

Automotive Engineering ›› 2025, Vol. 47 ›› Issue (10): 1914-1922.doi: 10.19562/j.chinasae.qcgc.2025.10.007

Previous Articles    

High-Fidelity Unsupervised Image Translation for Mismatched Semantic Data

Zhuo Li1,Libo Cao1(),Jiacai Liao2,Haowei Cui3,Yue Zhang3   

  1. 1.Hunan University,State Key Laboratory of Advanced Design and Manufacturing Technology for Vehicle,Changsha 410085
    2.College of Mechanical and Vehicle Engineering,Changsha University of Science & Technology,Changsha 410076
    3.North China Vehicle Research Institute,Beijing 100072
  • Received:2025-03-18 Revised:2025-04-30 Online:2025-10-25 Published:2025-10-20
  • Contact: Libo Cao E-mail:hdclb@163.com

Abstract:

In the field of intelligent driving environment perception, image translation models often fail when there is a significant semantic mismatch between source and target domains, leading to semantic inversion and detail degradation. To address this challenge, in this paper a high-fidelity image translation method tailored for asymmetric domain data is proposed. Based on the diffusion-model generator structure, a multi-adaptive skip connection (MASC) module and a high-dimensional vector consistency loss (HVC loss) are proposed. The MASC module combines dynamic normalization and attention mechanism to adaptively process semantic and style information in skip connection, while the HVC loss constrains semantic mapping relationship in high-dimensional symbolic space. Compared to the optimal results, the proposed model reduces the FID and KID scores by 15.36 and 0.003 4 on RainSurface, and by 1.44 and 0.000 8 on public datasets, respectively.

Key words: image translation, deep learning, data augmentation, diffusion models