Complete Communications Engineering

Sound Source Localization (SSL) is an important component to microphone array signal processing for speech applications. SSL serves as the front-end to acoustic beamforming by steering the beam to the desired sound source.  The location estimation can also be used as a visual cue for smart and augmented reality devices.

sound source localization

The methods used for Sound Source Localization of speech signals differ from classical direction of arrival (DOA) solutions. Classical DOA approaches (Capon Minimum Variance, MUSIC) used in radar and sonar applications have the advantage of narrowband and statistically stationary far-field source signals with minimal multi-path reflections. Speech is a wideband non-stationary sound source, often located in reverberant environments.

To help combat the challenges of direction finding for speech applications, Sound Source Localization solutions consist of multiple stages. The first stage is to precondition the signal to focus on the frequencies of interest that have the best signal-to-noise ratio. The next step is to perform the time delay estimation and sound localization. The final stage is to cluster and smooth the multitude of estimates to make a final decision.

The engineering tradeoffs for SSL are quite straightforward. Adding more microphones into the system design adds redundancy and helps to improve accuracy.  As one would expect, this will increase the computational complexity of the solution. VOCAL is an engineering design house, which can help guide you with the product design utilizing this technology.

Sound Source Localization Applications

Platforms

supported-platforms

VOCAL’s optimized sound source localization software is available for the following platforms. Please contact us for specific supported platforms and performance information.

ProcessorsOperating Systems
  • Texas Instruments – C6xx (TMS320C62x, TMS320C64x, TMS320C645x, TMS320C66x, TMS320C67x), DaVinci, OMAP, C5xx (TMS320C54x, TMS320C55x)
  • Analog Devices – Blackfin, ADSP-21xx, TigerSHARC, SHARC
  • PowerPC, PowerQUICC
  • MIPS – MIPS32, MIPS64, MIPS4Kc
  • ARM – ARM7, ARM9, ARM9E, ARM10E, ARM11, StrongARM, ARM Cortex-A8/A9/A15, Cortex-M3/M4
  • Intel / AMD – x86, x64 (both 32 and 64 bit modes)
  • Linux, uClinux, BSD, Unix
  • Microsoft Windows ACM / RTC / CE / Mobile
  • Apple iOS / iPhone / iPad & MacOS
  • eCOS / eCOSPro
  • Google Android
  • Green Hills Integrity
  • Micrium μCOS
  • Symbian
  • Wind River VxWorks
  • VOCAL LANsEN

More Information on Sound Source Localization