基于深度学习模型的电磁超表面设计关键技术

Key technologies in electromagnetic metasurface design based on deep learning models

  • 摘要:
    目的 针对基于深度学习的超表面优化高度依赖大规模全波仿真数据所导致的优化效率受限问题,需深入开展超表面设计场景中的数据集构建方法、深度学习模型结构构建方法、深度学习模型训练方法以及超表面优化方法研究。
    方法 首先,建立以样本重要性为依据的训练样本集采样方法,在减少训练所需样本数量的同时降低模型估算超表面电磁响应的误差;其次,构建一种多模态深度学习模型用于同时提取超表面向量化结构参数特征以及图案特征,提高模型估算响应的性能;然后,提出一种利用深度学习域外泛化特性的模型训练方法,充分利用模型自身泛化性能生成低保真度样本,从而减少高保真度训练样本集的规模;最后,不使用高代价、高准确性的精确模型,而是提出一种使用有限样本规模训练得到的粗略深度学习模型进行超表面设计的新方法。
    结果 数值仿真结果表明,在维持深度学习模型性能的前提下,所提出的数据集构建方法、多模态模型构建方法和训练方法能够将训练所需的全波仿真样本减少30%~50%,大幅减少优化超表面所需的全波仿真工作量。此外,所提出的基于粗略深度学习模型的优化方法能够以数十次迭代和全波仿真设计得到性能优越的超表面。
    结论 所构建的贯穿“数据−模型−训练−应用”闭环的低数据依赖型系统框架,通过4个层面的针对性重构,系统性地缓解了制约超表面设计效率的数据规模难题,为深度学习在电磁工程领域的低成本应用提供了通用的方法论支撑与范式。

     

    Abstract:
    Objective To address the issue that deep-learning-based metasurface optimization relies heavily on large-scale full-wave simulation data, thereby limiting optimization efficiency, this study investigates dataset construction methods, deep learning model architecture design methods, deep learning model training strategies, and optimization approaches for metasurface structure design scenarios.
    Method  First, a training-set sampling method based on sample importance is developed. By evaluating the gradient information of the loss function with respect to individual samples, this method strategically identifies and selects highly informative data points, significantly reducing the required sample size while improving the accuracy of electromagnetic response prediction for metasurface structures. Second, a multimodal deep learning model is constructed to simultaneously extract and integrate features from both vectorized structural parameters and pixelated pattern representations. Through a systematic feature fusion mechanism, the model enhances structural representation capability and further improves electromagnetic response prediction performance. Third, a novel training strategy that exploits the out-of-distribution (OOD) generalization capability of the deep learning model is proposed. This strategy leverages the intrinsic generalization ability of the model to synthesize and incorporate low-fidelity response samples outside the original training distribution, dynamically expanding the feature space and thereby reducing the required scale of the high-fidelity training dataset. Finally, instead of relying on a globally accurate model that incurs substantial computational training costs, an efficient optimization method for metasurface structure design is proposed. This approach employs a coarse deep learning model trained on a strictly limited dataset and combines it with an iterative refinement mechanism to guide the optimization process.
    Results Numerical results demonstrate the high efficiency of the proposed data-driven framework. Specifically, while rigorously maintaining the baseline performance of the deep learning models, the proposed dataset construction method, multimodal model architecture, and OOD-based training strategy each reduce the number of full-wave simulation samples required for initial training by 30%–50%. This substantial reduction significantly alleviates the computational burden associated with generating high-fidelity datasets. Furthermore, during the practical optimization stage, the proposed optimization algorithm based on a coarse deep learning model is shown to achieve rapid convergence. Metasurface structures with excellent electromagnetic performance can be successfully designed and synthesized using only dozens of additional iterative updates and full-wave simulation validations. These results demonstrate the capability of the proposed method to eliminate the dependence on highly accurate yet computationally expensive surrogate models.
    Conclusion A low-data-dependency system framework covering the complete process of "data−model−training−application" is established. Through targeted restructuring at four different levels, it systematically addresses the challenge of limited data scale that constrains the efficiency of metasurface design. As a result, it provides a general methodological foundation and practical paradigm for the low-cost application of deep learning techniques in the field of electromagnetic engineering.

     

/

返回文章
返回