ES 202 050-2002
语音处理 传输和质量方面(STQ);分布式语音识别;先进的前端特征提取算法;压缩算法(V1.1.1)

Speech Processing@ Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Advanced Front-End Feature Extraction Algorithm; Compression Algorithms (V1.1.1)

2014-04

 

 

非常抱歉,我们暂时无法提供预览,您可以试试: 免费下载 ES 202 050-2002 前三页,或者稍后再访问。

您也可以尝试购买此标准,
点击右侧 “购买” 按钮开始采购(由第三方提供)。

点击下载后,生成下载文件时间比较长,请耐心等待......

 

标准号
ES 202 050-2002
发布
2002年
发布单位
ETSI - European Telecommunications Standards Institute
替代标准
ES 202 050-2003
当前最新
ES 202 050-2007
 
 
适用范围
"The present document specifies algorithms for advanced front-end feature extraction and their transmission which form part of a system for distributed speech recognition. The specification covers the following components: - the algorithm for advanced front-end feature extraction to create Mel-Cepstrum parameters; - the algorithm to compress these features to provide a lower data transmission rate; - the formatting of these features with error protection into a bitstream for transmission; - the decoding of the bitstream to generate the advanced front-end features at a receiver together with the associated algorithms for channel error mitigation. The present document does not cover the ""back-end"" speech recognition algorithms that make use of the received DSR advanced front-end features. The algorithms are defined in a mathematical form or as flow diagrams. Software implementing these algorithms written in the 'C' programming language is contained in the ZIP file es~202050v010101pO.zipw hich accompanies the present document. Conformance tests are not specified as part of the standard. The recognition performance of proprietary implementations of the standard can be compared with those obtained using the reference 'C' code on appropriate speech databases. It is anticipated that the DSR bitstream will be used as a payload in other higher level protocols when deployed in specific systems supporting DSR applications. In particular@ for packet data transmission@ it is anticipated that the IETF AVT RTP DSR payload definition (see bibliography) will be used to transport DSR features using the frame pair format described in clause 7. The Advanced DSR standard is designed for use with discontinuous transmission and to support the transmission of Voice Activity information. Annex A describes a VAD algorithm that is recommended for use in conjunction with the Advanced DSR standard@ however it is not part of the present document and manufacturers may choose to use an alternative VAD algorithm."

ES 202 050-2002相似标准


推荐

人机交互语音识别技术

线性预测编码技术(Linear Predict Coding, LPC)被Itakura成功应用于语音识别;SakoeChiba将动态规划思想应用到语音识别并提出动态时间规整算法,有效解决了语音信号特征提取不等长语音匹配问题;同时提出了矢量量化(VQ)隐马尔可夫模型(HMM)理论。...

这有一份来自腾讯AI Lab2018答卷

2018 年,我们又提出了一些新方法改进,在语音增强、语音分离、语音识别语音合成等技术方向都取得了一定进展。语音唤醒上,我们针对误唤醒、噪声环境中唤醒、快语速唤醒儿童唤醒等问题,提出了一种新语音唤醒模型[1],能显著提升关键词检测质量,在有噪声环境下表现突出,还能显著降低前端关键词检测模块功耗需求。...

AI时代,10 年之后我们还能干什么?|71页完整报告

目前,中国占据全球 44% 需求,海康威视已成为安防摄像头全球龙头。后端设备强化计算分析功能后端设备强化计算分析功能,以支持复杂视频分析:传统 NVR(Network video recorder)主要功能是压缩存储视频信息。通过添加 GPU 等人工智能加速芯片应用处理软件,智能 NVR 能够实现图像识别特征提取,人体识别、人员检索等功能。...

Nature纪念人工智能60周年专题:深度学习综述

这些方法在许多方面都带来了显著改善,包括较先进语音识别、视觉对象识别、对象检测许多其它领域,例如药物发现基因组学等。深度学习能够发现大数据中复杂结构。它是利用BP算法来完成这个发现过程。BP算法能够指导机器如何从前一层获取误差而改变本层内部参数,这些内部参数可以用于计算表示。...





Copyright ©2007-2022 ANTPEDIA, All Rights Reserved
京ICP备07018254号 京公网安备1101085018 电信与信息服务业务经营许可证:京ICP证110310号