Projects

Current Projects

TÜBİTAK 1001 RESEARCH PROJECT (EEEAG-119E254), METU II, Ankara, Turkey (2019-ongoing)

"Audio Signal Processing for Six Degrees of Freedom (6DOF) Immersive Media"

The ISO/IEC MPEG-I standard expected to be finalised in the second half of 2021 concerns the capture, coding, and reproduction of immersive multimedia signals. More specifically, the standard encompasses navigable 360° videos. This type of content allows a user to navigate an immersive video in six degrees of freedom and has applications in cinema, virtual reality, journalism, broadcast Technologies and (following the widespread adoption of 5G networks) in immersive telepresence applications. There exist two components in the capture, coding and reproduction of immersive content: video and audio. The former necessitates the use of Technologies such as light field capture or video capture using omnidirectional cameras. The first commercial examples of such cameras have recently been demonstrated by Facebook recently. Work on the capture and processing of sound fields have very recently started and although some theoretical results have been obtained for interpolating pressure fields fort his purpose, the applicability of such an approach in an actual 6DOF system is limited. This project will investigate techniques for 6DOF sound field capture and processing. The research will concentrate on the development of algorithms for the interpolation of sound fields using rigid spherical microphone array (RSMA) recordings. The overarching aim of the project will be to synthesise the sound field at a location in the recording volume using a limited number of RSMA recordings of the acoustic scene. The techniques to be developed to that aim will include sound field extrapolation using spherical harmonic decompositions and object-based sparse approximations using a dictionary-based representation of the plane wave decomposition of the sound field. The planned research also includes work on transcoding the sound field for interactive binaural reproduction. Specifically, transcoding between interpolated higher-order Ambisonics and binaural audio will be investigated. The theoretical results of the project will be validated using real recordings to be made during the project. The quality of experience afforded by the developed algorithms will be tested using subjective evaluations. The project is planned to have 7 work packages and to have a duration of 36 months. The project will the first in Turkey addressing Part 4 of the upcoming MPEG-I standard and it will run in parallel with the development of such while contributing to it directly or indirectly. Apart from its ambitious dissemination targets, another important contribution of the project will be the training of experts in this emerging field.


Past Projects

TÜBİTAK 1001 RESEARCH PROJECT (EEEAG-113E513), METU II, Ankara, Turkey (2014-2018)

“Spatial audio reproduction using analysis-based synthesis techniques”

This project aims to develop a spatial audio recording, coding, reproduction chain based on the analysis of sound scenes from recordings made with special microphone arrays. The project will investigate novel sound source localisation and separation algorithms and direct/diffuse separation from recordings; develop sound scene authoring tools and necessary methods for 3D and spatial sound source generation.

METU BAP-1 STARTUP GRANT (BAP-08-11-2013-057), METU II, Ankara, Turkey (2013-2015)

“Sound source localisation using open spherical acoustic intensity probes”

This project aims to design and develop open spherical microphone arrays to accurately measure acoustic intensity and use the acoustic intensity vectors for localising sound sources, even under highly reverberant environments.