Event Details

Spatial Sound Rendering Using Measured Room Impulse Responses

Presenter: Yan Li
Supervisor: Dr. Peter Driessen and Dr. George Tzanetakis

Date: Fri, August 6, 2010
Time: 10:00:00 - 11:00:00
Place: EOW 430

ABSTRACT

ABSTRACT:

Spatial sound rendering aims at artificially reproducing the acoustics of a space. It has many applications such as music production, movies, electronic gaming and teleconferencing. Conventionally, spatial sound rendering is implemented by digital signal processing algorithms derived from perceptual models or simplified physical models. While being flexible and/or efficient, these models are not able to capture the acoustical impression of a space faithfully. On the other side, convolving the sound sources with properly measured impulse responses produces highest possible fidelity, but it is not practically useful for many applications because the sources or the listeners can not be relocated.

In this thesis, techniques for measuring multichannel room impulse responses (MMRIR) are reviewed. Then, methods for analyzing measured MMRIR and rendering virtual acoustical environment based on such analysis are presented and evaluated. The analysis can be performed off-line. During this stage, a set of filters that represents the characteristics of the air and walls inside the acoustic space are obtained. Then, appropriate segments that can be used as reverb tails are extracted from the measured MMRIR. The rendering system, often performed on-line, first constructs an early reflection model based on the positions of the listener-source pair and the filters derived, then combines with the late reverb segments to form a complete listener-source-room acoustical model that can be used to synthesize high quality multi-channel audio for arbitrary listener-source positions. Another merit of the proposed framework is that it is scalable.At the expense of slightly degraded rendering quality, the computational complexity can be greatly reduced. This makes this framework suitable for a wide range of applications that have different quality and complexity requirements.

The proposed framework has been evaluated by formal listening tests. These tests have proven the effectiveness in preserving the spatial quality while positioning the listener-source pair accurately.