引用本文: | 刘秉权,徐帅,李相前.双阈值的特定英语音频句子边界检测[J].哈尔滨工业大学学报,2010,42(2):259.DOI:10.11918/j.issn.0367-6234.2010.02.018 |
| LIU Bing-quan,XU Shuai,LI Xiang-qian.Boundary detection of special English audio sentence based on dual-threshold[J].Journal of Harbin Institute of Technology,2010,42(2):259.DOI:10.11918/j.issn.0367-6234.2010.02.018 |
|
摘要: |
为了提高英语音频句子切分的效果,提出了基于双阈值的句子边界检测方法.该方法针对VOA、BBC等特别适合英语学习者的音频所具有的波形规范、环境噪声小、速率通常比较稳定等特点,利用静音能量阈值和静音时延阈值来检测音频句子的边界,并辅以对照文本信息进行校正.针对VOA慢速英语的实验结果表明:单纯使用双阈值方法,音频切分的召回率超过96%,精确率超过94%;利用对照文本校正后,可进一步提高精确率. |
关键词: 音频切分 边界检测 双阈值 |
DOI:10.11918/j.issn.0367-6234.2010.02.018 |
分类号:TN912.34 |
基金项目:国家自然科学基金资助项目(60673037);国家高技术研究发展计划资助项目(2006AA01Z197);黑龙江省自然科学基金资助项目(E200635) |
|
Boundary detection of special English audio sentence based on dual-threshold |
LIU Bing-quan, XU Shuai, LI Xiang-qian
|
School of Computer Science and Technology,Harbin Institute of Technology,Harbin 150001,China
|
Abstract: |
To improve the effect of sentence-evel English audio segmentation,a method of sentence boundary detection based on dual-threshold is proposed.With consideration of the characteristics of normative waveform,small noise and stable speed for English audio such as VOA and BBC,the method in this paper detects the sentence boundary of English audio via its quiet energy threshold and quiet delay threshold,and the corresponding audio text is used to revise the segmentation result.Experiments on special English of VOA show that the recall rate of segmentation exceeds 96% and the precision rate exceeds 94% by using the dual-threshold method only.After revision via the corresponding text,the precision rate can be improved further. |
Key words: audio segmentation boundary detection dual-threshold |