《電子技術(shù)應用》
您所在的位置:首頁 > 其他 > 設(shè)計應用 > 面向船載遠程會議的麥克風陣列高精度DOA估計
面向船載遠程會議的麥克風陣列高精度DOA估計
2022年電子技術(shù)應用第3期
劉雨佶1,,2,,3,,童 峰1,2,,3,,陳東升1,2,,3,,盧榮富4,馮萬健4
1.廈門大學 水聲通信與海洋信息技術(shù)教育部重點實驗室,,福建 廈門361002;2.廈門大學 海洋與地球?qū)W院,,福建 廈門361002,; 3.廈門大學深圳研究院,廣東 深圳518000,;4.廈門億聯(lián)網(wǎng)絡技術(shù)股份有限公司,,福建 廈門361000
摘要: 隨著船舶智能化水平提高,船載遠程會議系統(tǒng)對提高應急處理能力,、推進船岸一體化網(wǎng)絡建設(shè)有重要意義,,麥克風陣列是保證遠程會議系統(tǒng)語音效果和支持多模態(tài)交互的重要語音前端。但船舶艙室狹小尺寸一方面導致只能采用小尺寸麥陣,,另一方面小艙室導致的強混響以及嘈雜艙室噪聲也使傳統(tǒng)麥克風陣列算法性能嚴重下降,。考慮船舶艙室復雜環(huán)境下小尺寸麥陣DOA估計場景,,提出了一種輕量級Mask-DOA估計神經(jīng)網(wǎng)絡模型,。該方法在DOA估計神經(jīng)網(wǎng)絡引入Mask算法降低噪聲和混響的干擾,并提取增強后的GCC-PHAT作為網(wǎng)絡特征,,從而在小尺寸麥陣上實現(xiàn)高精度DOA估計,。仿真和實驗結(jié)果表明,所提出的Mask-DOA模型面對復雜的船舶艙室環(huán)境更魯棒,,泛化能力更強,。
中圖分類號: TN912.3
文獻標識碼: A
DOI:10.16157/j.issn.0258-7998.212108
中文引用格式: 劉雨佶,,童峰,陳東升,,等. 面向船載遠程會議的麥克風陣列高精度DOA估計[J].電子技術(shù)應用,,2022,48(3):32-36,,77.
英文引用格式: Liu Yuji,,Tong Feng,Chen Dongsheng,,et al. High precision DOA estimation of microphone array for shipboard teleconferencing[J]. Application of Electronic Technique,,2022,48(3):32-36,,77.
High precision DOA estimation of microphone array for shipboard teleconferencing
Liu Yuji1,,2,3,,Tong Feng1,,2,3,,Chen Dongsheng1,,2,3,,Lu Rongfu4,,F(xiàn)eng Wanjian4
1.Key Laboratory of Underwater Acoustic Communication and Marine Information Technique of the Ministry of Education, Xiamen University,,Xiamen 361002,,China; 2.College of Earth and Ocean Sciences,,Xiamen University,,Xiamen 361002,China,; 3.Shenzhen Research Institute of Xiamen University,,Shenzhen 518000,China,; 4.Xiamen Yilian Network Technology Co.,,Ltd.,Xiamen 361000,,China
Abstract: With the improvement of ship intelligence level, shipboard teleconferencing system is of great significance to improve the emergency handling capacity and promote the construction of shipboard integrated network. Microphone array is an important voice front-end to ensure the voice effect as well as the multi-mode interaction of teleconferencing system. However, while the small size of ship cabins leads to the adoption of small-size array, strong reverberation caused by small cabins and noisy cabin noise also seriously degrade the performance of traditional microphone array algorithm. Considering the direction of arrival(DOA) estimation scenario of small-size array in complex environment of ship cabin, a lightweight Mask-DOA estimation neural network model is proposed in this paper. With this method, Mask algorithm is introduced into the DOA estimation neural network to reduce the noise and reverb interference, then the enhanced GCC-PHAT is extracted as the network feature, so as to realize the high-precision DOA estimation on the small-size microphone array. Simulation and experimental results show that the Mask-DOA model proposed in this paper is more robust and has better generalization ability in the complex environment of ship cabin.
Key words : direction of arrival estimation,;ship cabin noise and reverberation environment;neural network,;time-frequency masking

0 引言

    船載遠程會議系統(tǒng)在船舶智能化方面發(fā)揮著顯著作用,,特別是可提高應急處理能力,,推進船岸一體化網(wǎng)絡建設(shè)。近些年來,,船載遠程會議監(jiān)測系統(tǒng)發(fā)展迅速[1-3],。麥克風陣列通過提供準確波達方向(Direction Of Arrival,DOA)估計可實現(xiàn)語音增強處理,同時還可以為遠程會議系統(tǒng)攝像機提供說話人方位信息,,實現(xiàn)多模態(tài)交互,,已成為遠程會議系統(tǒng)的重要語音前端[4-5]

    一般遠程會議場所較為理想,,因此往往采用較大的麥克風陣列以保證DOA估計,,提高語音增強性能和多模態(tài)交互效果。但是,,船載遠程會議所在船舶艙室屬于非常典型復雜聲學場景,。一方面,艙室尺寸狹小,,既造成嚴重混響,,也導致無法方便容納尺寸較大的遠程會議麥克風陣列;另一方面,,受嚴重船舶艙室噪聲干擾[6],,包括由各個艙室有限的空間里集中了非常多的電氣設(shè)備與發(fā)動機等設(shè)備造成嚴重的內(nèi)部噪聲,以及其他艦船噪聲,、海浪等導致的外部噪聲,。這些都將使得船舶艙室聲學特性變得復雜,對麥陣DOA估計提出了更高的挑戰(zhàn),。

    近些年,隨著人工智能的發(fā)展,,Xiao等人提出利用多層感知機(Multilayer Perceptron,,MLP)來進行DOA估計[7],利用深層網(wǎng)絡與大數(shù)據(jù)來提高DOA估計準確率,,遠遠超過傳統(tǒng)DOA估計算法,。Diaz-Guerra等人利用帶相位變換導向響應功率特征作為特征,建立神經(jīng)網(wǎng)絡模將DOA估計任務轉(zhuǎn)化為回歸問題[8],。Nguyen等人使用具有多任務學習功能的2D卷積神經(jīng)網(wǎng)絡從短時空間偽譜魯棒地估計聲源的數(shù)量和到達方法[9],,這種方法減少了神經(jīng)網(wǎng)絡學習聲音類別和方向信息之間不必要的關(guān)聯(lián),加速模型的收斂,。




本文詳細內(nèi)容請下載:http://forexkbc.com/resource/share/2000003998,。




作者信息:

劉雨佶1,2,,3,,童  峰1,,2,3,,陳東升1,,2,3,,盧榮富4,,馮萬健4

(1.廈門大學 水聲通信與海洋信息技術(shù)教育部重點實驗室,福建 廈門361002,;2.廈門大學 海洋與地球?qū)W院,,福建 廈門361002;

3.廈門大學深圳研究院,,廣東 深圳518000,;4.廈門億聯(lián)網(wǎng)絡技術(shù)股份有限公司,福建 廈門361000)




wd.jpg

此內(nèi)容為AET網(wǎng)站原創(chuàng),,未經(jīng)授權(quán)禁止轉(zhuǎn)載,。