What Is Ambisonics
What Is Ambisonics
What Is Ambisonics
Introduction
Ambisonics is a type of spatial audio which was invented in the 1970s by British engineer Michael
gerzon, it wasn’t commercially success back then due to lack of technology. It has always attracted
researchers in the spatial audio field.
It is mostly recorded using special microphones placed at the centre of sound field giving the listener
a 360 immersive experience of the field. It is being widely used in the VR industry.
What is Ambisonics?
The basic approach of Ambisonics is to consider an audio scene as a 360-degree sound field with
signals from different directions around a centre point, covering both the vertical and horizontal axis
(Gerzon, 1973). The centre point is where the microphone is placed during recording or where the
listener's 'sweet spot' is during playback.(waves)
It is a channel-based audio format. In Ambisonics, each channel has information about specific
physical properties of the acoustic field, such as the pressure or the acoustic velocity compared to
traditional multichannel audio (e.g. stereo, 5.1 and 7.1 surround) where each channel contains the
signal of a specific speaker. (Arteaga, 15).
Orders
0. Ambisonics at zeroth order has information about the pressure field at the origin or the
centre point as mentioned before (recording of an omnidirectional microphone at the origin).
The channel for the pressure field is called W.
1. Ambisonics at first order adds information about the acoustic velocity and pressure at the
origin by using recording of three figure-of-eight microphones along each one of the axis).
These channels are called X, Y, Z. X –
channel containing information from front
and rear. Y- Channel containing from the left
and right and the Z – channel containing
from top and bottom of sound field.
(Arteaga, 15).
Formats
A-Format at first order contains raw recording of 4 channels. These recordings don’t have a specific
space in the sound field. These can be individual mono recordings or synthesized sources.
B-Format contains the components of ambisonics channels after placing them in the acoustic field.
(W, X, Y, Z) (Arteaga, 2015)
AmbiX Vs FuMa
There are two conventions within the Ambisonics B- format standard: AmbiX and FuMa. They are
quite similar but not interchangeable: they differ in the order in which the four channels are
arranged, AmbiX being WYZX while FuMa is WXYZ.
FuMa files have an extension of .amb and Ambix has .caf
The main disadvantage of FuMa is the limitation of the file size to 4 GB due to the Microsoft WAV
format as data container. (Nachbar et al, 2011)
Encoding and recording ambisonics
The first step in the Ambisonics chain is to generate a sound field by placing the components in the
acoustic field. These components can synthesized or be a mono recordings or recorded using a
special microphone. (Which we will discuss later.) (Arteaga, 15)
Encoding
A simple Ambisonics panner (or encoder) takes a source signal S and two parameters, the horizontal
angle θ and the elevation angle Ф . It uses them to position the source at the desired angle by
distributing the signal over the Ambisonics components with different gains:
W = S . 1/√2
X = S . cos θ sin Ф
Y = S . sin θ cos Ф
Recording Ambisonics
In practice it is not possible to place all microphones at the same point and hence four cardioid or
sub cardioid capsules are placed in the vertices of a tetrahedron. The 4 microphones give A format
recording.(Arteaga, 2015)
1) W = FLU+FRD+BLD+BRU
2) X = FLU+FRD-BLD-BRU
3) Y = FLU-FRD+BLD-BRU
4) Z = FLU-FRD-BLD+BRU (A-format to B-format conversion)
Most of the microphones deliver an A-Format recording and need to be converted to B-Format using
external software which is sometimes provided with the microphone. (Arteaga, 2015)Some of them
discussed above.
The SoundField ST450 is sold together with a preamplifier, which includes an encoder. This device
enables to encode from the recorded A-format directly the B-format or a stereo format.( Kurz, E.,
Pfahler, F., Frank, M., 2015)
Decoding ambisonics
Ambisonics can be played on almost any loudspeaker array and reproduce the spherical sound field
at the listening position. To do this, however, you must decode the four B-format channels for the
specific speaker array. All four B-format channels are summed to loudspeaker array. Each of the four
channels is summed with different gain and phase, depending on the direction of the speakers.
Some of the sources in the mix are summed in phase, while others are summed out of phase at each
speaker. The result is that sources placed in the direction of the speaker are louder, while the
sources in the opposite direction are
softer. (Ambisonics Explained: A Guide
for Sound Engineers) An Ambisonic
decoder generates loudspeaker signals through linear combination of the individual signal
components. ( Kronlachner, M.,2014)
This can be done using plugins like the IEM AllRADecoder which allows you to manually enter a
speaker layout or import loudspeaker coordinates and channel indices, you can have your
ambisonics file decoded for the specific layout. Sometimes an imaginary speaker might need to be
added below whose signal is omitted to make the decoder mathematically functional. (Zotter, F.,
Frank, M., 2019. Ambisonics )
The AllRADecoder is based on All-Round Ambisonic Panning (AllRAP) it is an algorithm for arbitrary
loudspeaker arrangements, aiming at the creation of sources of stable loudness and adjustable
width it uses the combination of a virtual optimal loudspeaker arrangement with Vector-Base
Amplitude Panning. (Zotter, F., Frank, M., 2012)
Binaural Decoding is decoding of ambisonics to headphones this can be done with plugins.
additional effects like Eq, Reverb, Width and more can be added to this signal
Reaper (reaper.fm) is highly recommended, as it enables higher order ambisonics by allowing tracks
with up to 64 channels. It is also relatively inexpensive and there is a fully functional free trial version
available. You can also use any other DAW that supports VST and sufficiently many multi-track
channels.(Zotter, F., Frank, M., 2019.)
Plugins
Free plugin bundles from Kronlachner’s ambix plugin suite and the IEM plugin suite have all
the basic plugins needed to get started.(Zotter, F., Frank, M., 2019.) These plugin bundles
also include effect plugins like reverb, delay, compressor and a few other which can help
improve the project these need to be added into the chain before the decoding stage.
( Rudrich, D., 2018.)
aXBundle from SSA plugins offers bundles for the entire ambisonics chain.(SSA plugins)
Conclusion
We have seen what is ambisonics at first order and how an ambisonics chain works. The encoding of
a chain by using a recorded source from an ambisonics microphone or mono recordings and
synthesized source have been seen using plugins. The principle behind the working of an ambisonics
microphone and the conversion of it to different format. The decoding of it to play it back on
loudspeakers and headphones. Some suitable DAWs and plugins have been suggested to get one
started. This essay only introduces one to ambisonics in first order which is a small part of
ambisonics. There was no discussion about Higher orders of Ambisonics which can be investigated.
Further research can be done to understand the scientific and mathematic principles on the topic.
Book
Zotter, F., Frank, M., 2019. Ambisonics : A Practical 3D Audio Theory for Recording, Studio
Production, Sound Reinforcement, and Virtual Reality. Springer Nature.
Papers
Nachbar, C., Zotter, F., Deleflie, E., Sontacchi, A., n.d. AMBIX - A SUGGESTED AMBISONICS FORMAT
2011.
Michael A. Gerzon, Periphony: With-Height Sound Reproduction. Journal of the Audio Engineering
Society, 1973, 21
Kurz, E., Pfahler, F., Frank, M., 2015. Comparison of first-order Ambisonic microphone arrays.
Best Ambisonic Microphones (First-Order) [WWW Document], n.d. . Acoustic Nature. URL
https://acousticnature.com/journal/best-ambisonic-microphones-first-order (accessed 12.29.22).
Braun, S., Frank, M., n.d. Localization of 3D Ambisonic Recordings and Ambisonic Virtual Sources.
(2011)
Zotter, F., Frank, M., 2012. All-Round Ambisonic Panning and Decoding. J. Audio Eng. Soc. 60.
Kronlachner, M.,2014. n.d. Plug-in Suite for Mastering the Production and Playback in Surround
Sound and Ambisonics.
Frank, M., Zotter, F., Sontacchi, A., 2015. Producing 3D Audio in Ambisonics, in: Audio Engineering
Society Conference: 57th International Conference: The Future of Audio Entertainment Technology –
Cinema, Television and the Internet.