肖庆汇, 刘治纲, 陶冶华. 面向边缘计算的轻量化YOLOv8学生课堂行为识别模型[J]. 南昌航空大学学报(自然科学版), 2025, 39(5): 87-96. DOI: 10.3969/j.issn.2096-8566.2025.05.011
引用本文: 肖庆汇, 刘治纲, 陶冶华. 面向边缘计算的轻量化YOLOv8学生课堂行为识别模型[J]. 南昌航空大学学报(自然科学版), 2025, 39(5): 87-96. DOI: 10.3969/j.issn.2096-8566.2025.05.011
Qinghui XIAO, Zhigang LIU, Yehua TAO. Lightweight YOLOv8 Model for Student Classroom Behavior Recognition Toward Edge Computing[J]. Journal of nanchang hangkong university(Natural science edition), 2025, 39(5): 87-96. DOI: 10.3969/j.issn.2096-8566.2025.05.011
Citation: Qinghui XIAO, Zhigang LIU, Yehua TAO. Lightweight YOLOv8 Model for Student Classroom Behavior Recognition Toward Edge Computing[J]. Journal of nanchang hangkong university(Natural science edition), 2025, 39(5): 87-96. DOI: 10.3969/j.issn.2096-8566.2025.05.011

面向边缘计算的轻量化YOLOv8学生课堂行为识别模型

Lightweight YOLOv8 Model for Student Classroom Behavior Recognition Toward Edge Computing

  • 摘要: 针对课堂场景中学生行为存在目标密集、遮挡频繁以及边缘设备算力受限等问题,本文提出一种面向边缘计算的轻量化YOLOv8 学生课堂行为识别方法,在保证检测精度的同时显著降低模型复杂度。首先,提出一种 Neck-DSC 轻量化特征融合结构,通过深度可分离卷积重构特征融合路径,在基本不损失表达能力的前提下降低计算量。其次,设计了一种动态稀疏注意力机制(DSA),以检测置信度为先验,仅对潜在遮挡区域自适应分配注意力计算资源,从而增强遮挡场景下的行为特征建模能力。最后,结合 TensorRT 实现 INT8 量化部署,进一步提升模型在边缘设备上的推理效率。在本文课堂行为数据集上的实验结果表明,所提出方法在模型参数量仅为 2.39×106 的情况下,相较原 YOLOv8n 减少 24.13%,同时保持约 95%的mAP。消融实验验证了 Neck-DSC 与 DSA 模块在轻量化与遮挡鲁棒性方面的有效性。

     

    Abstract: To address the problems of dense targets, frequent mutual occlusions in student behaviors with classroom scenarios and limited computing power of edge devices, this paper proposes a lightweight YOLOv8-based student classroom behavior recognition method for edge computing, which maintains detection accuracy while significantly reducing model complexity. First, a Neck-DSC lightweight feature fusion structure is proposed. It reconstructs the feature fusion path through depthwise separable convolutions, reducing computational complexity basically without losing representation capability. Secondly, a Dynamic Sparse Attention (DSA) mechanism is designed. With detection confidence as a prior, it adaptively allocates attention computation resources only to potential occluded regions, thereby enhancing the behavioral feature modeling capability in occlusion scenarios. Finally, the INT8 quantization deployment is achieved by integrating TensorRT, further improving the inference efficiency of the model on edge devices. Experiment results on a self-constructed classroom behavior dataset show that the proposed method has a model parameter count of only 2.39×106, presenting a 24.13% reduction compared with the original YOLOv8n, while maintaining mAP at approximately 95%. Ablation studies validate the effectiveness of the Neck-DSC and DSA modules in terms of lightweight capability and occlusion robustness.

     

/

返回文章
返回