Perceptual Evaluation of Speech Quality explained

Perceptual Evaluation of Speech Quality (PESQ) is a family of standards comprising a test methodology for automated assessment of the speech quality as experienced by a user of a telephony system. It was standardized as Recommendation ITU-T P.862[1] in 2001. PESQ is used for objective voice quality testing by phone manufacturers, network equipment vendors and telecom operators. Its usage requires a license. The first edition of PESQ's successor POLQA (Recommendation ITU-T P.863[2]) entered into force in 2011.

Measurement scope

PESQ was developed to model subjective tests commonly used in telecommunications (e.g., Recommendation ITU-T P.800) to assess the voice quality perceived by human beings. Consequently, it employs true voice samples as test signals. In order to characterize the listening quality as perceived by users, it is of paramount importance to load modern telecom equipment with speech-like signals. Many systems are optimized for speech and would respond in an unpredictable way to non-speech signals (e.g., tones, noise). Guidelines for proper applications of voice test samples are defined in the PESQ application guide contained in Recommendation ITU-T P.862.3.[3]

Genealogy of related standards

ITU-T's family of full reference objective voice quality measurements started in 1997 with Recommendation ITU-T P.861 (PSQM), which was superseded by ITU-T P.862 (PESQ) in 2001. P.862 was later complemented with Recommendations ITU-T P.862.1[4] (mapping of PESQ scores to a MOS scale), ITU-T P.862.2[5] (wideband measurements) and ITU-T P.862.3 (application guide). The first edition of ITU-T P.863 (POLQA) entered into force in 2011. An Application guide for Recommendation ITU-T P.863 was approved in 2019 and published as ITU-T P.863.1.[6]

In addition to the above listed full reference methods, the list of ITU-T's objective voice quality measurement standards also includes ITU-T P.563[7] (no-reference algorithm).

Testing typology

Depending on the information that is made available to an algorithm, voice-quality test algorithms can be divided into two main categories:

PESQ is a full-reference algorithm and analyzes the speech signal sample-by-sample after a temporal alignment of corresponding excerpts of reference and test signal. PESQ can be applied to provide an end-to-end (E2E) quality assessment for a network, or characterize individual network components.

PESQ results principally model mean opinion scores (MOS) that cover a scale from 1 (bad) to 5 (excellent). A mapping function to MOS-LQO is outlined in Recommendation ITU-T P.862.1.

See also

References

External links

Notes and References

  1. Web site: P.862 : Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs. 2021-04-20. www.itu.int.
  2. Web site: P.863 : Perceptual objective listening quality prediction. 2021-04-11. www.itu.int.
  3. Web site: P.862.3 : Application guide for objective quality measurement based on Recommendations P.862, P.862.1 and P.862.2. 2021-04-20. www.itu.int.
  4. Web site: P.862.1 : Mapping function for transforming P.862 raw result scores to MOS-LQO. 2021-04-11. www.itu.int.
  5. Web site: P.862.2 : Wideband extension to Recommendation P.862 for the assessment of wideband telephone networks and speech codecs. 2021-04-11. www.itu.int.
  6. Web site: P.863.1 : Application guide for Recommendation ITU-T P.863. 2021-04-11. www.itu.int.
  7. Web site: P.563 : Single-ended method for objective speech quality assessment in narrow-band telephony applications. 2021-04-11. www.itu.int.