Sound Source Localization (SSL)

Sound Source Localization (SSL) is an important component to microphone array signal processing for speech applications. SSL serves as the front-end to acoustic beamforming by steering the beam to the desired sound source. The location estimation can also be used as a visual cue for smart and augmented reality devices.

The methods used for Sound Source Localization of speech signals differ from classical direction of arrival (DOA) solutions. Classical DOA approaches (Capon Minimum Variance, MUSIC) used in radar and sonar applications have the advantage of narrowband and statistically stationary far-field source signals with minimal multi-path reflections. Speech is a wideband non-stationary sound source, often located in reverberant environments.

To help combat the challenges of direction finding for speech applications, Sound Source Localization solutions consist of multiple stages. The first stage is to precondition the signal to focus on the frequencies of interest that have the best signal-to-noise ratio. The next step is to perform the time delay estimation and sound localization. The final stage is to cluster and smooth the multitude of estimates to make a final decision.

The engineering tradeoffs for SSL are quite straightforward. Adding more microphones into the system design adds redundancy and helps to improve accuracy. As one would expect, this will increase the computational complexity of the solution. VOCAL is an engineering design house, which can help guide you with the product design utilizing this technology.

Sound Source Localization Applications

Augmented Reality Systems
Beamsteering for Beamforming algorithms
Video Conferencing Camera Steering
Noise Localization
Robot Steering

Platforms

VOCAL’s optimized sound source localization software is available for the following platforms. Please contact us for specific supported platforms and performance information.

Processors	Operating Systems
Texas Instruments – C6xx (TMS320C62x, TMS320C64x, TMS320C645x, TMS320C66x, TMS320C67x), DaVinci, OMAP, C5xx (TMS320C54x, TMS320C55x) Analog Devices – Blackfin, ADSP-21xx, TigerSHARC, SHARC PowerPC, PowerQUICC MIPS – MIPS32, MIPS64, MIPS4Kc ARM – ARM7, ARM9, ARM9E, ARM10E, ARM11, StrongARM, ARM Cortex-A8/A9/A15, Cortex-M3/M4 Intel / AMD – x86, x64 (both 32 and 64 bit modes)	Linux, uClinux, BSD, Unix Microsoft Windows ACM / RTC / CE / Mobile Apple iOS / iPhone / iPad & MacOS eCOS / eCOSPro Google Android Green Hills Integrity Micrium μCOS Symbian Wind River VxWorks VOCAL LANsEN

Complete Communications Engineering

Sound Source Localization

Sound Source Localization Applications

Platforms

More Information on Sound Source Localization

Complete Communications Engineering

Sound Source Localization

Sound Source Localization Applications

Platforms

<img loading="lazy" decoding="async" class="alignnone wp-image-9971" src="https://vocal.com/wp-content/uploads/2012/05/img-supported.png" alt="supported-platforms" width="447" height="45" />

More Information on Sound Source Localization