At this year’s Orkney International Science Festival, Clive and Marianne Greated will both be speaking about sound, from different perspectives. Dr Marianne Greated is an artist who has been researching the relationships between sound and vision and how this relates to current art practice. Her father, physicist and musician Professor Clive Greated, has studied the way in which sounds are produced in various musical instruments and the effect of the spaces in which they are played. Here they give an overview of the basic concepts of sound which forms a backdrop to the two presentations.
As human beings we live at the bottom of a sea of air. This sea is relatively shallow as about three-quarters of the air mass lies within a distance of 11 kilometres from the earth’s surface.
The pressure at the bottom of this sea of air however is extremely high, about 100,000 Pascals (a Pascal is the unit of pressure). We don’t however feel this as our bodies are completely surrounded by air. To put this pressure into perspective, it is equivalent to the increase of pressure a diver in the sea experiences at a depth of about 10 metres.
The ear, however, is sensitive to tiny pressure changes. So for instance a sound level of 100 decibels corresponds to pressure changes of only about 2 Pascals.
Sound waves
The pressure fluctuations that we call sound come in the form of waves of different frequency. The lowest frequency we can hear is about 20 Hertz (20 vibrations per second) and the highest frequency is about 20,000Hz.
Unlike waves we see on the surface of the deep ocean, sound waves are non-dispersive. In other words, all wavelengths travel at the same speed; about 330 metres per second in air.
Sounds can be measured with a microphone which, like the ear, responds to pressure fluctuations. There are many different types, all having different directional characteristics, but the most common are the condenser microphone and the dynamic microphone.
Loudness
The loudness of a sound is measured by its sound pressure level in decibels. This can be measured with a sound level meter which has been calibrated against a standard sound.
Some typical sound levels are:
* a rifle shot 140dB
* pneumatic drill 100dB
* street traffic 80dB
* and normal conversation 60dB.
The threshold of hearing is about 0dB.
Musical sounds
Sounds from most musical instruments have a definite pitch. The pressure fluctuations that the instruments produce are in the form of a complex but cyclical wave which can be decomposed into components known as harmonics.
For example, the sound wave from a musical instrument might repeat itself 1,000 times a second corresponding to a pitch of about C two octaves above middle C on the piano.
This complex wave can be decomposed into pure sinusoidal waves of frequency 1,000Hz, 2,000Hz, 3000Hz and so on, referred to as the fundamental, second harmonic, third harmonic and so on.
In fact any cyclical wave can be decomposed into a set of harmonics which are integer multiples of the fundamental – a process known as Fourier decomposition.
Sounds from different instruments

The sound from the violin comes from the vibration of a string which produces a complete set of harmonics. The saxophone, flute and clarinet on the other hand are wind instruments which generate their sounds from the resonances of the air inside them.
Both the saxophone and clarinet are reed instrument but they have different shapes, the saxophone having a conical bore and the clarinet a cylindrical bore. Because of this the saxophone produces a complete set of harmonics, but the clarinet only produces odd harmonics.
Like the clarinet, the flute also has a cylindrical bore but it has no reed so is open at both ends giving rise to a complete set of harmonics. The harmonics produced have implications for the fingering of the instruments, for example when a flute is over-blown the pitch rises an octave so the fingerings in the first octave are essentially the same as in the second octave. The same applies to the saxophone but not the clarinet.
Instruments like the piano produce a set of resonant frequencies that are very close to true harmonics; in fact just a little sharper. In this case the amount of inharmonicity is reflected in the quality of the sound.
The voice
When we speak or sing, the sound is produced by the vibration of the vocal folds. Air from the lungs passes past the vocal folds and through the vocal tract which has a filtering effect on the sound.
Rather like the clarinet, the vocal tract is akin to a cylindrical tube closed at one end. Sounds which lie close to a resonance of this tube are accentuated whilst those with frequencies far from a resonance are suppressed. This leads to concentrations of acoustic energy around certain frequencies known as formants.
The formants can be altered by changing the mouth opening and the position of the tongue. Formant frequencies are highest for young children and lowest for mature males.
Acoustic spaces
When you listen to someone speaking to you in a room you are hearing two things. There is (a) the direct sound from the mouth and there is (b) the reflected sound from the ceiling walls, floor and objects in the room. This reflected sound is known as the reverberant sound.
If you are close to the speaker, you are hearing more direct sound and this will be very clear. If you are standing far away, for example near the back of a concert hall, you will be hearing mostly the multiple reflections which all arrive at different times so the sound is less clear and more diffuse.
At a certain distance from the sound source the direct and reverberant sound levels will be equal; this is known as room radius and is typically about 2 or 3 metres.
When recording an orchestra or live band, it is usual to place some microphones closer than the room radius to record the direct sound. Other microphones are placed at a distance greater than the room radius to record the reverberant sound. These are then mixed to obtain the desired ambience.
Reverberation
Arguably the most important quantitative characteristic of a space is the reverberation time. The concept of reverberation time was introduced by Wallace Sabine. He defined it as the time for the sound level of an impulse in a room to fall by 60dB – that is, by a factor of one part in a million. It is dependent on the volume of the room and the total sound absorption.
Soft surfaces such as carpets and ceiling tiles generally have a high absorption whereas hard surfaces such as stone walls, have low absorption. However, the amount of absorption is dependent on the frequency of the sound. Concert halls with good acoustic characteristics, such as the Waterfront in Belfast, have their seats designed so that their absorption does not change significantly when it is filled or vacant.
Air absorbs sound. It absorbs high frequencies more quickly than low frequencies, hence a thunderclap sounds like a low rumble at a long distance from the source.
Absorbing or echoing
Laboratory testing in acoustics is normally done in either an anechoic chamber or reverberation chamber.
In an anechoic chamber the walls and ceiling are covered with sound-absorbing wedges to eliminate reflections thus virtually eliminating the reverberant sound field. In this case the direct sound from the source is dominant.
In a reverberation chamber, on the other hand, the walls are hard plastered to reflect as much sound as possible so the reverberant sound field dominates over the direct sound.
You can hear the difference between the two types of acoustic space in this video.
Noise
Noise is generally un-pitched or has a wide range of frequency components making the pitch ill-defined.
There are analogies between the visual spectrum of colours and the sound spectrum. White noise has an essentially flat spectrum of frequency components corresponding to the concept of white light.
Sonically you can obtain different colours of sound by passing white noise through filters with different frequency characteristics. This allows you to divide specific bands of sound frequencies to create colours of sound from the initial signal (sometimes referred to as band-pass filtered white noise).
The names used for coloured noise give an indication of the optical frequencies. For example, red noise has more energy at the low frequencies, corresponding to the long wavelengths of red light. Purple (sometimes called violet) noise, on the other hand, has a predominance of high frequencies corresponding to the short wavelengths in the colour spectrum.
The analogy even follows through to colours in the mid frequency range. For example yellow noise is band-pass limited around about 3,000Hz.
Environmental noise is normally considered to be unwanted sound and is often un-pitched. It may also hide pitched sounds, for example, the noise generated by a wind turbine can have a well-defined pitch superimposed on a broad-band noise background.
Spatial Hearing
The horizontal placing of the ears on humans maximises the ability to localise the position of sounds in the horizontal plane, rather than from above or below. This matches the terrain-based environment in which humans live.
Some animals are different, for example the ears on a barn owl are of different size and placed at different heights, enhancing its capability to localise sounds in the vertical.
Location, location, location
Lord Rayleigh (1842-1919) was the first person to give an explanation of how we can localise the directions from which sounds are coming. The most important cues that allow us to localise an auditory event in the horizontal plane come from the intensity and time differences experienced by the two ears. These cues are describes as the interaural time difference (ITD) and the interaural level difference (ILD) which is also sometimes called the interaural intensity difference (ITD).
Rayleigh found that for low frequencies (less than 1500 Hz) the brain primarily uses interaural time differences to localize the sound. This is because at low frequencies the head diffracts (bends) the sound waves so that even if the sound is coming from the right or left both ears hear approximately the same sound intensity.
At higher frequencies the brain primarily uses the intensity differences between the two ears to localize the direction.
Recording the sound as we hear it
With the advent of iPhones and other portable sound generating devices using earphones, binaural sound recording techniques are increasingly being used. Miniature microphones are placed in the ears so that when the recorded sounds are replayed through headphones the listener gets a very realistic reproduction of the actual sound.
For recording in concert halls, a dummy head is often used rather than having the microphones inserted into the ears of an actual person.
Underwater sound
In water the speed of sound is much higher than in air, about 1,500 metres per second, and so for a given frequency the wavelengths are much longer.
Sounds in the ocean are generated by a wide range of natural sources such as rain falling onto the surface, breaking waves, sea ice and sounds from marine life.
In recent years there has also been an increasing amount of sound introduced by shipping and general industrial activity, such as the construction of offshore installations.
Many of the sounds in the ocean are created by bubbles. For example rain on the surface and breaking waves entrains air in the form of bubbles but also rotating propellers or other underwater machinery often generate them.
Bubbles of different size have their own characteristic resonant frequency with small bubbles being associated with high pitched sounds and large ones with low frequency sounds. This can be demonstrated in a tank of water.
Due to water pressure decreasing with height, the size of the bubbles increases as they rise. Sounds are initiated when the bubbles becomes detached from the air source in the small tube supplying the air. As the size of the bubbles increases, the sound resonating in bubble decreases in pitch, and the sounds gradually die away as the bubbles rise.