"

Hearing in Complex Environments

68 Sound Segregation

Learning Objectives

Be able to describe how to use primitive strategies to separate sounds (bottom-up strategies).

Be able to explain schema-based segregation strategies (top-down strategies).

Know the cocktail party problem.

It is rare that we hear just one sound at a time. Usually, we’re hearing several things—a bird chirping, a car driving by, a conversation on the sidewalk—and we have to separate them from each other in order to make sense of them. This is called sound segregation. Thankfully, we have several tools to help us.

Schema-based strategies are top-down. A schema is a structure in our brain that holds and organizes the information that we obtain throughout our lifetime. Schema-based strategies are essentially using prior knowledge to locate and understand the distance of the sound. Primitive strategies, on the other hand, are bottom-up strategies: reflexive strategies that help us group sounds together based on similarity or location.

First, there is the primitive auditory stream segregation, where the brain groups the sound perceptually to form a consistent representation of the object from the sound it makes. A good example of this is when we hear an orchestra. As they play, we separate sounds with similar features (ex. the blare of trumpets) from non-like sounds (ex. the whisper of flutes). Grouping sounds by timbre like the trumpet/flute example is one primitive strategy. Other primitive strategies are grouping by pitch or grouping by location.

 

Fig.7.7.1. Listening to Music. We use primary auditory stream segregation to identify the melody of a song by its individual notes. Our schema-based analysis will tell us whose voice we hear and the exact instruments being used. (Credit: Jarod Davis. Provided by: University of Minnesota. License: CC-BY 4.0)

The other process we can use to identify the information given from a mixture of sounds is the schema-based analysis. This is a top-down strategy in which the brain matches the sensory signal from the knowledge stored in the memory. An example of this is when you hear a distinct noise at the park and recognize it as a bird chirping because your schema of what a bird sounds like helps you differentiate the sound of the bird from all the rest of the sounds in the park (Bregman, 1990).

Even with the help of primitive strategies like grouping by similarity and schema-based strategies like recognizing familiar voices, we still have to work to focus on a sound of interest in a complex auditory environment to hear it. Imagine talking to someone in a loud, crowded environment and trying to make out what they are saying. We are able to hear the sound of interest (the other person’s voice) by focusing intently on it. The task of picking out a certain sound in a complex auditory environment is known as the “cocktail party problem” (Cherry, 1953).

 

Exercises

  1. The cocktail party problem, where an individual listens to a target sound more while in a noisy environment, works by:
    A. making background noise less interesting to the individual listening
    B. subconsciously increasing attention to a sound
    C. making the individual instinctively leave a noisy environment so that they can listen more clearly
    D.  subconsciously tuning out background noise

Answer: B

 

CC LICENSED CONTENT, SHARED PREVIOUSLY
Cheryl Olman PSY 3031 Detailed Outline
Provided by: University of Minnesota
Download for free at http://vision.psych.umn.edu/users/caolman/courses/PSY3031/
License of original source: CC Attribution 4.0
Adapted by: Jin Yong Lee & Rachel Lam, David Girin
References:

Bregman, A. S. (1990). Auditory scene analysis: The perceptual organization of sound. Cambridge, MA: MIT Press

Cherry, E. C. (1953). Some experiments on the recognition of speech with one and with two ears. Journal of the Acoustical Society of America, 25, 975-979.

License

Icon for the Creative Commons Attribution 4.0 International License

Introduction to Sensation and Perception Copyright © 2022 by Students of PSY 3031 and Edited by Dr. Cheryl Olman is licensed under a Creative Commons Attribution 4.0 International License, except where otherwise noted.