电熔镁炉多模态信息工况智能识别方法研究

李帷韬; 顾嘉钦; 王殿辉; 吴高昌

引用本文:	李帷韬,顾嘉钦,王殿辉,吴高昌.电熔镁炉多模态信息工况智能识别方法研究[J].控制理论与应用,2025,42(5):931~946.[点击复制]
	LI Wei-tao,GU Jia-qin,WANG Dian-hui,WU Gao-chang.Research on intelligent recognition method of multimodal information working condition in fused magnesium furnace[J].Control Theory & Applications,2025,42(5):931~946.[点击复制]

电熔镁炉多模态信息工况智能识别方法研究

Research on intelligent recognition method of multimodal information working condition in fused magnesium furnace

摘要点击 3238 全文点击 63 投稿时间：2023-05-29 修订日期：2024-10-20

查看全文查看/发表评论下载PDF阅读器

DOI编号 10.7641/CTA.2024.30367

2025,42(5):931-946

中文关键词电熔镁炉多模态强化学习 Transformer 工况识别

英文关键词 fused magnesium furnace multi-modal reinforcement learning Transformer working condition recognition

基金项目国家重点研发计划项目(2018AAA0100304), 国家自然科学基金项目(62173120, 62103092), 安徽省自然科学基金项目(2108085UD11), 111引智项目(BP0719039)资助.

作者	单位	E-mail
李帷韬	合肥工业大学电气与自动化工程学院	wtli@hfut.edu.cn
顾嘉钦	合肥工业大学电气与自动化工程学院
王殿辉^*	中国矿业大学人工智能研究院	dh.wang@deepscn.com
吴高昌	东北大学流程工业综合自动化国家重点实验室

中文摘要

针对电熔镁炉识别过程中出现的模态完备性不足问题, 本文阐述了一种电熔镁炉多模态信息工况智能识别方法. 首先, 通过语义神经网络提取工况图像特征和双向编码语言模型提取语言特征, 构建多模态工况的完备联合特征向量, 并通过Transformer编码层实现全局交互, 捕捉视觉与语言信息的细粒度对齐. 引入自适应Transformer 解码层的自注意力机制, 采用全连接网络获得多模态工况识别结果. 基于强化学习定义门控单元评估策略, 实时评估不确定工况识别结果, 构建解码层的动态调节机制, 以获取多模态工况的细粒度特征, 并采用模糊积分集成模型库的识别结果. 实验证明了本方法的有效性和鲁棒性.

英文摘要

Aiming at the lack of modal completeness in the identification process of fused magnesium furnace, an intelligent recognition method of multi-modal information working condition of electric smelting magnesia furnace is described in this paper. Firstly, we use semantic neural network to extract image features and two-way coding language model to extract linguistic features to construct a complete joint feature vector of multimodal working conditions, and then we realize global interaction through the Transformer encoding layer to capture the fine-grained alignment of visual and linguistic information. The self-attention mechanism of the adaptive Transformer decoding layer is introduced to obtain the multimodal work condition recognition results using fully connected networks. Based on reinforcement learning, we define the evaluation strategy of gating unit, evaluate the recognition results of uncertain working conditions in real time, construct the dynamic adjustment mechanism of the decoding layer to obtain the fine-grained features of multimodal working conditions, and use the fuzzy integration to integrate the recognition results of the model library. Experiments demonstrate the effectiveness and robustness of this method.