面向密集行人場景的YOLOv8n改進算法-AET-電子技術應用

面向密集行人場景的YOLOv8n改進算法

電子技術應用

王麗黎1，2，樊盼盼1，張詩雨1

1.西安理工大學自動化與信息工程學院；2.無線光通信與網絡研究重點實驗室

摘要： 為了解決傳統(tǒng)算法在密集行人場景中識別精度不足和檢測不準確的問題，提出一種基于YOLOv8n的改進型密集行人檢測模型。首先，引入SPPELAN模塊替換骨干網絡中的SPPF模塊，以提升模型對多尺度目標的特征感知能力。其次，設計一種殘差注意力機制，提高模型對細微特征的提取能力，進而提高檢測精度。最后通過添加DySample算子、改進的小目標檢測層提高模型對小尺度目標的定位識別能力。實驗結果顯示，改進的模型相較于YOLOv8n在CrowdHuman數(shù)據集上的召回率、mAP50和mAP50-95分別提升了2.5%、2.9%和2.4%，并且該模型在WiderPerson和CityPersons數(shù)據集上表現(xiàn)優(yōu)異。實驗結果表明，該算法能更好適用于密集行人檢測任務。

關鍵詞： YOLOv8n 密集行人檢測 SPPELAN模塊殘差注意力機制 DySample

中圖分類號：TP391.41 文獻標志碼：A DOI: 10.16157/j.issn.0258-7998.245589
中文引用格式： 王麗黎，樊盼盼，張詩雨. 面向密集行人場景的YOLOv8n改進算法[J]. 電子技術應用，2025，51(2)：15-20.
英文引用格式： Wang Lili，F(xiàn)an Panpan，Zhang Shiyu. An improved YOLOv8n algorithm for dense pedestrian scenarios[J]. Application of Electronic Technique，2025，51(2)：15-20.

An improved YOLOv8n algorithm for dense pedestrian scenarios

Wang Lili1，2，F(xiàn)an Panpan1，Zhang Shiyu1

1.School of Automation and Information Engineering， Xi’an University of Technology； 2.Key Laboratory of Wireless Optical Communication and Network Research

Abstract： To address the issues of insufficient recognition accuracy and inaccurate detection of traditional algorithms in dense pedestrian scenarios, an improved dense pedestrian detection model based on YOLOv8n is proposed. Firstly, by introducing the SPPELAN module to replace the SPPF module in the backbone network, the model’s ability to perceive features of multi-scale targets is enhanced. Secondly, a residual attention mechanism is devised to improve the model’s ability to capture subtle features, thereby enhancing detection accuracy. Finally, by adding DySample operator and improving the small object detection layer, the model’s ability to locate and recognize small-scale objects is enhanced. Experimental results show that the improved model, compared to YOLOv8n, increases recall rate, mAP50, and mAP50-95 by 2.5%, 2.9%, and 2.4%, respectively, on the CrowdHuman dataset, and performs excellently on the WiderPerson and CityPersons datasets. The results of the experiments show that this algorithm is more effective for dense pedestrian detection tasks.

Key words : YOLOv8n；dense pedestrian detection；SPPELAN module；residual attention mechanism；DySample；small object detection layer

引言

近年來隨著自動駕駛、視頻監(jiān)控系統(tǒng)的高速發(fā)展，現(xiàn)實需求對行人檢測要求越來越高。傳統(tǒng)的行人檢測算法根據檢測方法的不同，分為雙階段檢測算法和單階段檢測算法。常見的雙階段目標檢測算法主要包含目標定位和目標分類兩個步驟。首先，提取出圖像中所有可能包含目標的區(qū)域，即目標定位。然后對所提取的每個區(qū)域進行分類，確定含有的目標類型。常見算法包括Faster R-CNN[1]、RetinaNet[2]等。單階段檢測算法是直接將從圖像中提取的特征進行位置預測和種類識別，常見的單階段檢測算法包括YOLO等。

傳統(tǒng)的目標檢測算法在大多數(shù)現(xiàn)實場景中都取得了不錯的效果，但是在一些密集場景中，如大型商場、景區(qū)、人流量大的客運站等，算法對行人目標的檢測性能仍需要提升，主要原因是密集場景中的行人目標密集且行人尺度不一。

針對密集目標檢測中目標數(shù)量多、尺寸不一的問題，王澤宇等[3]基于YOLOv8n提出密集行人檢測算法MER-YOLO，使用MobileVit作為主干網絡提升模型對聚集區(qū)域的特征提取能力，之后通過引入高效多尺度注意力（Efficient Multi-scale Attention，EMA）提高全局信息交互能力，采用排斥損失函數(shù)作為邊界損失函數(shù)降低模型的漏檢情況。黃昆等[4]基于YOLOv8提出檢測算法Crowd-YOLOv8，通過使用nostride-Conv-SPD模塊加強網絡對細微信息的特征提取能力，并引入小目標檢測頭和上采樣算子對多尺度特征進行融合，增強了小尺度目標檢測效果。呂志軒等[5]提出了多分支無錨框行人檢測算法MBAN，該算法通過在主干網絡后引入多分支網絡結構，并結合距離損失函數(shù)來對關鍵區(qū)域進行指導，使網絡對行人局部檢測位置進行差異化學習，強化對行人局部特征的捕捉，改善網絡對遮擋行人的識別能力。

為了有效解決上述問題，本文提出了一種基于YOLOv8n模型改進的密集行人檢測算法，旨在增強對多尺度行人目標的特征提取能力，加強檢測頭的定位檢測能力，更好識別密集行人目標，提升檢測精度。

本文詳細內容請下載：

http://forexkbc.com/resource/share/2000006319

作者信息：

王麗黎1，2，樊盼盼1，張詩雨1

（1.西安理工大學自動化與信息工程學院，陜西西安 710048；

2.無線光通信與網絡研究重點實驗室，陜西西安 710048）

Magazine.Subscription.jpg

原創(chuàng)聲明：此內容為AET網站原創(chuàng)，未經授權禁止轉載。

相關內容