In this paper, a human-like behavior decision-making strategy (HBDS) is established by analyzing drivers’ driving behavior generation mechanism. HBDS has a framework that matches the driving behavior generation mechanism, obtains the human-like reward function through maximum entropy inverse reinforcement learning, and adopts the Boltzman noisily-rational model to build the mapping relationship between behavior probability and its cumulative reward. By discretizing the expected trajectory space, the curse of dimensionality in the integration of continuous high-dimensional space is avoided, and based on statistical law and safety constraint, the expected trajectory space is compressed and pruned, enhancing the sampling efficiency of HBDS. The strategy is trained and tested on NGSIM dataset, and the results show that HBDS can make behavior decisions that conform to the driver’s personalized cognitive and behavioral characteristics.