SIZE DISCRIMINATION OF TRANSIENT SOUNDS:
PERCEPTION AND MODELLING

Humans are able to get an impression of the size of an object by hearing it resonate. While this ability is well described for periodic speech sounds we investigate here the ability to discriminate the size of non-periodic transient impact sounds. Three experiments were performed on normal listeners (n=19) to investigate the importance of the spectral cue in different frequency regions. Recordings from pulse resonance sounds made by a metal ball hitting polystyrene spheres of 5 different sizes were used in the experiments. Recordings were manipulated in order to show that the same cues used in speaker size discrimination are used for transient signals. Results show that the most prominent resonances are the most important cue, but frequencies above 8 kHz also contribute. The results are explained by physiologically inspired model of size discrimination that is based on the Auditory Image Model, and its key part is the Mellin transform. The model can predict which of two objects is bigger. We conclude that similar cues that are used for speaker size discrimination are important for transient sounds.

REFERENCES (12)

Patterson RD, Smith DRR, Dinther R, Walters TC et al. Size Information in the Production and Perception of Communication Sounds. In: Yost WA, Popper NA, Fay PR (eds.). Auditory Perception of Sound Sources. New York, Springer Science+Business Media, 2008l; 43–75.

Google Scholar

Irino T, Patterson RD, Kawahara H. Speech segregation using an auditory vocoder with event-synchronous enhancements. In: IEEE Transactions on Audio, Speech and Language Processing. [Online]. November 2002 Wakayama Univ, Fac Syst Engn, Wakayama 6408510, Japan Univ Cambridge, Dept Physiol Dev & Neuroci, Ctr Neural Basis Hearing, Cambridge CB2 3EG, England. 2002; 2212–21.

Google Scholar

Ives DT, Smith DRR, Patterson RD. Discrimination of speaker size from syllable phrases. Journal of the Acoustical Society of America. [Online], 2005; 118(6): 3816–22.

Google Scholar

Patterson RD, Robinson KEN, Holdsworth J, McKeown D, et al. Complex sounds and auditory images. In: K. Horner Y Cazals, L. Demany (eds.). Auditory physiology and perception, Proc. 9th International Symposium on Hearing. 1992; Oxford, Pergamon.

Google Scholar

Bleeck S, Ives T, Patterson RD, Ives DT. Aim-mat: The Auditory Image Model in MATLAB. Acta Acustica, 2004; 90(4): 781–87.

Google Scholar

Irino T, Patterson RD. Segregating information about the size and shape of the vocal tract using a time-domain auditory model: The stabilised wavelet-Mellin transform. Speech Communication [Online], 2002; 36(3–4): 181–203.

Google Scholar

Irino T. Noise suppression using a time-varying, analysis/synthesis gamma chirp filterbank. 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258). [Online], 1999; 1: 97–100.

Google Scholar

Carello C, Anderson KL, Kunkler-Peck AJ. Perception of Object Length by Sound. Psychological Science [Online], 1998; 9(3): 211–14.

Google Scholar

Houben MMJ, Kohlrausch A, Hermes DJ. Perception of the size and speed of rolling balls by sound. Speech Communication [Online], 2004; 43(4): 331–45.

Google Scholar

10.

Grassi M. Recognising the size of objects from sounds with manipulated acoustical parameters. Fechner Day 2002: Proceedings of the International … [Online], 2002.

Google Scholar

11.

Irino T, Patterson RD. A Dynamic Compressive Gammachirp Auditory Filterbank. IEEE transactions on audio, speech, and language processing. [Online], 2006; 14(6): 2222–32.

Google Scholar

12.

Patterson R, Holdsworth J. A functional model of neural activity patterns and auditory images. Advances in speech, hearing and … [Online], 1996; 3547–58.

Google Scholar

Before submitting

Submit your paper

Send by email

A PSYCHOPHYSICAL MODEL OF ACCEPTABLE NOISE LEVEL FROM A HUMAN SOUND PROCESSING PERSPECTIVE

MATURATION OF TEMPORAL PROCESSING IN CHILDREN: MEASUREMENTS USING SPEECH AND NON-SPEECH STIMULI

Indexes

Keywords index

Topics index

Authors index