Fast Computation of Excitation Pattern, Auditory Pattern and Loudness

Andreas Spanias (Inventor)

Research output: Patent

Abstract

Modeling the way human ears perceive sound has resulted in microphones and software that can better select a persons voice while filtering out background noise. The inner ear filters sound waves based on their shape and frequency, containing groups of auditory nerve cells that respond to specific intervals of frequencies known as critical bands. Multiple overlapping critical bands combine via auditory masking to form an excitation pattern that the brain perceives as loudness. Frequency-weighting techniques, such as those commonly used in broadcasting, do not consider masking effects and perform poorly amidst loudness. Additionally, they cannot be used to calculate excitation patterns needed for digital compression or volume control in hearing aids. Recently, more elaborate models have been developed that imitate the ear. However, they rely on large banks of critical band-pass filters and are computationally expensive, making them unsuitable for real-time applications. Researchers at ASU have developed a method for accurately and efficiently estimating excitation patterns and loudness with real-time functionality. This method reduces the filter bank size by iteratively selecting only the perceptually relevant components of sound waves, such that the total neural activity is preserved and the general shape of the excitation pattern is initially captured. Critical bands are then chosen based on their relevancy to the general shape, resulting in greater accuracy of estimated excitation patterns and loudness measures with minimal computational overhead. This method can be applied to loudness measurement systems, perceptual loudness based volume control, loudness equalization systems, and digital audio encoding schemes. Potential Applications Audio Coding Cloud Voice Hearing Aids Microphones Speech Recognition Voice Over IP Benefits and Advantages Accurate Iterative reevaluation of critical bands results in greater accuracy of excitation patterns and loudness measurements. Efficient Calculates final loudness estimate and intermediate quantities with minimal computational overhead. Versatile Can be used for computationally demanding encoding schemes as well as rapid real-time streaming. Saves Power Ideal for light-weight software processes or portable audio devices. Download Original PDF For more information about the inventor(s) and their research, please see Dr. Andreas Spanias's directory webpage
Original languageEnglish (US)
StatePublished - Mar 8 2014

Fingerprint

Hearing aids
Acoustic waves
Microphones
Speech coding
Filter banks
Bandpass filters
Broadcasting
Speech recognition
Acoustic noise
Neurons
Brain

Cite this

@misc{dc3214d01f3a4da888f7c54761a0901c,
title = "Fast Computation of Excitation Pattern, Auditory Pattern and Loudness",
abstract = "Modeling the way human ears perceive sound has resulted in microphones and software that can better select a persons voice while filtering out background noise. The inner ear filters sound waves based on their shape and frequency, containing groups of auditory nerve cells that respond to specific intervals of frequencies known as critical bands. Multiple overlapping critical bands combine via auditory masking to form an excitation pattern that the brain perceives as loudness. Frequency-weighting techniques, such as those commonly used in broadcasting, do not consider masking effects and perform poorly amidst loudness. Additionally, they cannot be used to calculate excitation patterns needed for digital compression or volume control in hearing aids. Recently, more elaborate models have been developed that imitate the ear. However, they rely on large banks of critical band-pass filters and are computationally expensive, making them unsuitable for real-time applications. Researchers at ASU have developed a method for accurately and efficiently estimating excitation patterns and loudness with real-time functionality. This method reduces the filter bank size by iteratively selecting only the perceptually relevant components of sound waves, such that the total neural activity is preserved and the general shape of the excitation pattern is initially captured. Critical bands are then chosen based on their relevancy to the general shape, resulting in greater accuracy of estimated excitation patterns and loudness measures with minimal computational overhead. This method can be applied to loudness measurement systems, perceptual loudness based volume control, loudness equalization systems, and digital audio encoding schemes. Potential Applications Audio Coding Cloud Voice Hearing Aids Microphones Speech Recognition Voice Over IP Benefits and Advantages Accurate Iterative reevaluation of critical bands results in greater accuracy of excitation patterns and loudness measurements. Efficient Calculates final loudness estimate and intermediate quantities with minimal computational overhead. Versatile Can be used for computationally demanding encoding schemes as well as rapid real-time streaming. Saves Power Ideal for light-weight software processes or portable audio devices. Download Original PDF For more information about the inventor(s) and their research, please see Dr. Andreas Spanias's directory webpage",
author = "Andreas Spanias",
year = "2014",
month = "3",
day = "8",
language = "English (US)",
type = "Patent",

}

TY - PAT

T1 - Fast Computation of Excitation Pattern, Auditory Pattern and Loudness

AU - Spanias, Andreas

PY - 2014/3/8

Y1 - 2014/3/8

N2 - Modeling the way human ears perceive sound has resulted in microphones and software that can better select a persons voice while filtering out background noise. The inner ear filters sound waves based on their shape and frequency, containing groups of auditory nerve cells that respond to specific intervals of frequencies known as critical bands. Multiple overlapping critical bands combine via auditory masking to form an excitation pattern that the brain perceives as loudness. Frequency-weighting techniques, such as those commonly used in broadcasting, do not consider masking effects and perform poorly amidst loudness. Additionally, they cannot be used to calculate excitation patterns needed for digital compression or volume control in hearing aids. Recently, more elaborate models have been developed that imitate the ear. However, they rely on large banks of critical band-pass filters and are computationally expensive, making them unsuitable for real-time applications. Researchers at ASU have developed a method for accurately and efficiently estimating excitation patterns and loudness with real-time functionality. This method reduces the filter bank size by iteratively selecting only the perceptually relevant components of sound waves, such that the total neural activity is preserved and the general shape of the excitation pattern is initially captured. Critical bands are then chosen based on their relevancy to the general shape, resulting in greater accuracy of estimated excitation patterns and loudness measures with minimal computational overhead. This method can be applied to loudness measurement systems, perceptual loudness based volume control, loudness equalization systems, and digital audio encoding schemes. Potential Applications Audio Coding Cloud Voice Hearing Aids Microphones Speech Recognition Voice Over IP Benefits and Advantages Accurate Iterative reevaluation of critical bands results in greater accuracy of excitation patterns and loudness measurements. Efficient Calculates final loudness estimate and intermediate quantities with minimal computational overhead. Versatile Can be used for computationally demanding encoding schemes as well as rapid real-time streaming. Saves Power Ideal for light-weight software processes or portable audio devices. Download Original PDF For more information about the inventor(s) and their research, please see Dr. Andreas Spanias's directory webpage

AB - Modeling the way human ears perceive sound has resulted in microphones and software that can better select a persons voice while filtering out background noise. The inner ear filters sound waves based on their shape and frequency, containing groups of auditory nerve cells that respond to specific intervals of frequencies known as critical bands. Multiple overlapping critical bands combine via auditory masking to form an excitation pattern that the brain perceives as loudness. Frequency-weighting techniques, such as those commonly used in broadcasting, do not consider masking effects and perform poorly amidst loudness. Additionally, they cannot be used to calculate excitation patterns needed for digital compression or volume control in hearing aids. Recently, more elaborate models have been developed that imitate the ear. However, they rely on large banks of critical band-pass filters and are computationally expensive, making them unsuitable for real-time applications. Researchers at ASU have developed a method for accurately and efficiently estimating excitation patterns and loudness with real-time functionality. This method reduces the filter bank size by iteratively selecting only the perceptually relevant components of sound waves, such that the total neural activity is preserved and the general shape of the excitation pattern is initially captured. Critical bands are then chosen based on their relevancy to the general shape, resulting in greater accuracy of estimated excitation patterns and loudness measures with minimal computational overhead. This method can be applied to loudness measurement systems, perceptual loudness based volume control, loudness equalization systems, and digital audio encoding schemes. Potential Applications Audio Coding Cloud Voice Hearing Aids Microphones Speech Recognition Voice Over IP Benefits and Advantages Accurate Iterative reevaluation of critical bands results in greater accuracy of excitation patterns and loudness measurements. Efficient Calculates final loudness estimate and intermediate quantities with minimal computational overhead. Versatile Can be used for computationally demanding encoding schemes as well as rapid real-time streaming. Saves Power Ideal for light-weight software processes or portable audio devices. Download Original PDF For more information about the inventor(s) and their research, please see Dr. Andreas Spanias's directory webpage

M3 - Patent

ER -