Dimension-based Quality Modeling of Transmitted Speech

It includes a perceptual and a cognitive model which simulate the whole quality judgment process. Springer Professional. Back to the search result list. Table of Contents Frontmatter Chapter 1.

Speech Quality in Telecommunications Abstract. As this book deals with the measurement, analysis and prediction of the speech transmission quality as perceived by a listener, this chapter will describe the perceptual characteristics of speech production and perception that.

This exhaustive description leads to the definition of perceived quality. This chapter deals with measurement methods particularly relevant to any assessment of the perceived quality of voice and speech.

Such voice and speech quality measurement methods are employed in several scientific fields, such as medicine e. Each field has its own assessment paradigm.

Both are defined in Sec. From available data, instrumental models have, thus, been developed mainly for Narrow-Band NB speech.

However, for the network, the introduction of a new transmission paradigm implies some flexibility hardly possible with the fixed-line telephony system. For instance, a WB transmission requires a specific speech codec. Moreover, in the next near future, both NB and WB transmissions will be available.

But, this mixed-band context constrains instrumental models to assess both bandwidths on a single quality scale. An instrumental measure should provide a correct ranking of various speech processing systems. Even though new instrumental methods have been developed for the perceptual assessment of VoIP transmissions, none of the current model correctly estimates the degradations introduced by all in-use telecommunication systems. The reliable assessment of very different connections i.

The current ITU-T standards show some limits in their quality estimations. They are mainly caused by the ongoing complexity of network topologies and their speech processing components.

These limits have been detailed in the previous Chapter. This selection process is detailed in Sec.


A new model, called Diagnostic Instrumental Assessment of Listening quality DIAL , developed as part of this standardization program is presented in this chapter. The Core model assesses the nonlinear degradations introduced by a speech transmission system, whereas the dimension estimators quantify the linear degradations on the four perceptual dimensions defined in Sec. Then, an aggregation of all the degradations simulates a cognitive process employed by human subject during the quality judgment process. In accordance with the development procedure of an instrumental model introduced in Sec.

The validation process of this candidate model is presented in Chap. The resulting dimension scores together with respective overall quality ratings form the basis for a new parametric model for the quality estimation of transmitted speech based on the perceptual dimensions. In a two-step model approach, instrumental dimension models estimate dimension impairment factors in a first step.

