Singapore Institute of Technology
Browse

Single channel AI speech reverberation time modification for room dimension matching

Download (600.33 kB)

Reverberation is caused by room impulse reflection from walls, ceilings and other plane surfaces contributing an additive echo proportional to the geometry of the environment. Reverberation in speech plays a role in interpretation, giving listeners a sense of the size, shape and materials of the room. In an immersive VR/AR scenario where a speaker is presented in a different virtual environment to their actual environment, a mismatch between the recording and purported room geometries causes psychoacoustic dissonance. This paper presents an AI-based algorithm to estimate and remove room reverberation impulse effects from speech recorded in a source environment, and add reverberation modeled on the target environment. Effectiveness is measured via psychoacoustic assessment.

History

Journal/Conference/Book title

2024 IEEE International Symposium on Mixed and Augmented Reality Adjunct (ISMAR-Adjunct)

Publication date

2024-10-21

Version

  • Post-print

Rights statement

© 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Usage metrics

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC