Stimulus-spectrum irregularity and the generation of evoked and spontaneous otoacoustic emissions: Comments on the model of Nobili et al.
Christopher
A. Shera
Eaton-Peabody Laboratory of Auditory Physiology
Massachusetts Eye & Ear Infirmary
243 Charles Street
Boston, MA 02114, USA
Department of Otology & Laryngology
Harvard Medical School
Boston, MA 02115, USA
Arnold Tubis
Department of Physics
Purdue University
West Lafayette, IN 47907, USA
Institute for Nonlinear Science
University of California, San Diego
La Jolla, CA 92093, USA
Carrick L. Talmadge
National Center for Physical Acoustics
University of Mississippi
University, MI 38677, USA
The present white paper
can be ALSO downloaded as a pdf file
(104 k) clicking here.
The original article by Dr Nobili et al
titled Otoacoustic Emissions from Residual
Oscillations of the Cochlear Basilar Membrane in a
Human Ear Model, can be downloaded from here
as a pdf file (380 k) . In addition
a chapter titled "OTOACOUSTIC EMISSIONS SIMULATED
IN THE TIME-DOMAIN BY A HYDROYNAMIC MODEL OF THE HUMAN
COCHLEA" with relative material on Dr. Nolibi's model
can le downloaded from here
as a pdf file (350 k).
|
|
I. Introduction
Ever since their discovery, transient-evoked otoacoustic
emissions (TEOAEs) have generally been ascribed to the reflection
of cochlear traveling-wave energy by mechanical impedance perturbations
arrayed (or induced) in various ways along the cochlear partition
(e.g., Kemp, 1978; Manley, 1983; Ruggero
et al., 1983; Sutton and Wilson, 1983; Zwicker, 1986; Furst and Lapid,
1988; Strube, 1989; but see Yates and Withnell, 1999).
Nobili and colleagues, however, have recently proposed a new mechanism
for generating TEOAEs (Nobili, 2000;
Nobili et al., 2003a,b). As an alternative to scattering
by mechanical perturbations, Nobili et al. suggest that TEOAEs result
from prolonged “residual oscillations” of the basilar membrane (BM)
that trace their origin to spectral irregularities in middle-ear transmission.
In their model simulations, residual BM oscillations
and TEOAEs appear when the evoking stimulus has an intensity sufficient
to partially saturate the nonlinear amplification mechanisms within
the cochlea and a frequency spectrum irregular enough to produce a
complex spatial vibration pattern along the basilar membrane. They
suggest that most sounds, no matter how smooth their frequency spectra
appear in the ear canal, acquire the necessary spectral irregularity
simply by passing through the middle ear, from which they inherit
the spectral features characteristic of middleear transfer functions.
Nobili et al. also find—in agreement with many others before them—that
simulated TEOAEs can be produced by introducing small mechanical perturbations
into the equations representing the organ of Corti. In their model,
however, these mechanical perturbations often create spontaneous emissions
(SOAEs), an emission type not invariably associated with TEOAEs. Nobili
et al. therefore argue that “when found in the absence of spontaneous
emissions, transient evoked OAEs are mainly attributable to the characteristics
of forward middle-ear filtering.”
Nobili et
al. show several computer simulations but provide relatively little
comparison between theory and experiment. This paper grew out of
our attempt to understand whether the predictions of the proposed
mechanism were truly “in impressive accord with experimental data,”
as claimed. Evaluating the model in this way requires knowing what
the model predicts. As we demonstrate below, however, Nobili et
al. have left their readers unable to determine which features of
the OAE-like oscillations evident in the simulations are actually
predicted by the model and which features may result from unrecognized
artifacts of the computation. Nevertheless, by identifying qualitative
predictions of the proposed middle-ear filtering mechanism that
can be deduced without recourse to numerical modeling, we sidestep
the uncertainties surrounding Nobili et al.’s simulations and demonstrate
that their model fails to reproduce basic empirical properties of
actual evoked OAEs.
II. Accuracy of the Model Simulations
Although Nobili
et al. solve their time-domain equations of motion using a computational
technique better known for its conceptual simplicity than for its
numerical accuracy (e.g., Press et
al., 1992; Diependaal et al., 1987), they report
no checks of the validity of their model simulations. An elementary
test—necessary but not sufficient to ensure the integrity of the
computation—would be to verify that the purported solution does
not change significantly when the integration step size is decreased.
We performed such a test using the program Nobili et al. published
on the internet to allow readers to simulate emissions using the
model (Nobili, 2003).
Although limitations of the program precluded a definitive analysis,
our preliminary results were not reassuring.
To rebut
our analysis, Nobili and Mammano performed an authoritative test
as part of their review of a previous version of this manuscript.
At one location along the BM they computed the model response to
an acoustic click applied at the eardrum and compared the answer
obtained using their standard integration step size to a computation
performed when the temporal resolution was increased by a factor
of 2. As Nobili and Mammano point out in their review, the two oscillatory
responses are qualitatively similar in appearance. However, when
the responses are overlaid on the same graph, large quantitative
differences immediately become apparent. The two curves are clearly
distinguishable as early as 3 ms after stimulus onset, and by 10
ms the waveforms are more than 180 degrees out of phase with one
another. The presence of such large quantitative discrepancies at
times coincident with the appearance of TEOAE-like oscillations
in their simulations demonstrates that Nobili et al.’s published
otoacoustic responses cannot be reliable solutions of their model
equations.
The tests
described above explore only the accuracy of the time-domain integration.
Nobili et al. also necessarily discretized the spatial coordinate
in their model. To approximate the desired spatial integration they
divided the cochlear partition into 500 longitudinal segments and
summed the results from each section, weighting each response by
a numerical approximation to the local hydrodynamic Green’s
function. Just as with the temporal integration, employing too coarse
a grid can lead to spurious results. Unfortunately, Nobili et al.
again present no checks on the validity of their procedure. Since
the optimal grid spacing depends on both the numerical algorithms
employed and the size of the acceptable error, there are no hard
and fast rules for determining the number of required sections.
In the context of modeling OAEs, a reasonable lower bound might
be the number (Nmin) necessary to represent the spatial frequencies
important for emission generation. The theory of coherent reflection
filtering yields the estimate Nmin ~ 8L / λ
sections, where L is the cochlear length and λ is
the wavelength at the traveling-wave peak for the frequency of interest.
[Note that 8 = 4 * 2, where the factor of 4 is needed to encompass
the range of spatial frequencies falling within the pass-band of
the “spatial-frequency filter” (e.g., Zweig
and Shera, 1995, Fig. 6) and the additional factor
of 2 arises from Nyquist’s sampling theorem.] Using estimates
of λ for the human cochlea obtained
from measurements of SFOAEs (Shera
and Guinan, 2003, Table II) yields Nmin ~ 650 sections
for a model that matches human SFOAE group delays over the full
range of human hearing. Nobili et al. use a nonuniform grid spacing
with a number of sections roughly equivalent to this lower bound.
Whether any particular grid spacing suffices in practice can only
be determined by detailed numerical analysis. As with the time-domain
integration, decreasing the grid spacing until the solution no longer
varies on scales relevant to the issues at hand often provides a
useful empirical assay. Our experience modeling OAEs indicates that
the necessary number of sections can sometimes be significantly
greater than the lower bound estimated above. Talmadge
et al. (1998), for example, found that obtaining
reliable solutions can require as many as 4000 sections, considerably
more than the number used by Nobili et al.
The
computational dangers are especially acute when simulating OAEs
in active cochlear models. Active models propagate and amplify numerical
errors much as they do actual responses to the stimulus. Once they
appear, small errors can grow rapidly and thereafter masquerade
as genuine otoacoustic responses. Since relative OAE amplitudes
are often quite small—human TEOAEs and SFOAEs are typically
10–100 times smaller than the stimulus—computational
procedures that suffice when solving solely for the primary response
to the stimulus may fail completely when calculating OAEs.
III. Qualitative Tests of the Model
The unresolved
computational issues discussed above imply that the characteristics
of any OAEs predicted by Nobili et al.’s proposed mechanism
remain largely unknown. It is therefore difficult to compare the
model’s predictions with measured OAEs and determine whether
the responses are indeed “strikingly similar to those well
known to audiologists.” For example, measured TEOAEs are generally
rather dispersive, meaning that their waveforms exhibit a decrease
over time in their instantaneous frequency of oscillation (e.g.,
Kemp, 1978). The TEOAE waveforms computed from the
model, however, show little evidence of this phenomenon (see Nobili
et al. 2003b, Fig. 1e).Unfortunately, the reader
cannot easily determine whether the apparent absence of frequency
dispersion reflects a shortcoming of the model or whether it arises
as an artifact of inaccurate numerical methods.
Despite limitations
such as these on any quantitative comparison between theory and
experiment, several important qualitative predictions of the proposed
middleear filtering mechanism can be deduced without solving the
model equations. It is easy to demonstrate, for example, that the
mechanism cannot account for stimulusfrequency OAEs (SFOAEs): Although
TEOAEs are evoked by transient stimuli containing many frequency
components, and are therefore potentially sensitive to frequency
variations in middle-ear transmission as proposed, SFOAEs are evoked
by pure (single-frequency) tones and, ipso facto, cannot originate
via any mechanism that operates across frequency. Since SOAE bandwidths
are much smaller than any significant variation in middle-ear transfer
functions, similar remarks apply to the generation of spontaneous
emissions.
Because
their proposed mechanism can produce neither SFOAEs nor SOAEs, Nobili
et al. are forced to introduce mechanical perturbations along the
BM, a model for which there is longstanding precedent. Compelled
to invoke two different mechanisms to explain the appearance of
TEOAEs, SFOAEs, and SOAEs, Nobili et al. conclude that “there
are at least two main sources of OAEs in the cochlea: one related
to CA [cochlear-amplifier] gain irregularities [i.e., mechanical
perturbations] and the other to middle-ear characteristics.”
But since mechanical perturbations, by themselves, can give rise
to all three emission types (e.g., Shera
and Zweig, 1993b; Talmadge and Tubis, 1993; Zweig and Shera, 1995;
Talmadge et al., 1998), the proposed middle-ear
filtering mechanism may be entirely superfluous.
If it operates at all, the proposed mechanism is clearly limited
to high sound pressure levels. Nobili et al. remark that a linearized
version of their model, identical to the nonlinear model at low
sound pressure levels, yields unmeasurable TEOAEs. But any emission
mechanism that relies on driving the cochlear amplifier into saturation
is inconsistent with some of the key phenomenology of evoked OAEs.
Both TEOAEs and SFOAEs can be measured at sound-pressure levels
near threshold (i.e., far below saturation levels and in the regime
of near-linear BM amplification). Indeed, it is at near-threshold
stimulus levels that the amplitudes of both TEOAEs and SFOAEs are
largest relative to the evoking stimulus (e.g.,
Kemp, 1978; Wilson, 1980; Zwicker and Schloth, 1984; Shera and Zweig,
1993a). Rather than rapidly disappearing to zero,
as Nobili et al.’s model predicts, TEOAE amplitudes actually
grow relative to the stimulus as sound-pressure levels decrease
below 30–40 dB SPL. Nobili et al.’s proposed mechanism
cannot reproduce this basic empirical finding. If the mechanism
worked in the linear regime near the threshold of hearing, then
the otoacoustic response to any transient (i.e., TEOAEs) could be
synthesized by superposition from responses to pure tones (i.e.,
SFOAEs). But, as discussed above, the proposed mechanism cannot
produce SFOAEs at any level of stimulation; as a result, the model
cannot produce TEOAEs at low sound levels, in clear contradiction
with experimental data.
Although
this is hardly the place to rehearse the arguments in detail (for
which see, e.g., Zweig and Shera,
1995; Talmadge et al., 1998; Shera and Guinan, 1999; Talmadge et
al., 2000; Shera, 2003), we note that models that
trace the origin of reflection-source OAEs to scattering by mechanical
perturbations suffer none of these deficiencies. Indeed, these models
provide both a unified framework for exploring the generation of
TEOAEs, SFOAEs, and SOAEs (as well as DPOAE fine-structure) and
a successful, quantitative account of what Zwicker
and Schloth (1984) once characterized as the manifold
“interrelations of different otoacoustic emissions.”
IV. Summary
By circumventing
uncertainties about the numerical accuracy of their published simulations,
we have demonstrated that Nobili et al.’s proposed middle-ear
filtering mechanism fails to reproduce basic empirical properties
of actual evoked OAEs (e.g., persistence of TEOAEs at low stimulation
levels, relations between TEOAEs and SFOAEs, etc). In addition, their
model cannot produce both TEOAEs and SFOAEs (or SOAEs) unless the
model is supplemented with mechanical perturbations along the BM;
in this case, Nobili et al.’s results corroborate findings well
established in the literature (e.g, Sutton
and Wilson, 1983; Zwicker, 1986; Furst and Lapid, 1988; Strube, 1989;
Shera and Zweig, 1993b; Zweig and Shera, 1995; Talmadge et al., 1998).
Acknowledgements
We
thank Egbert de Boer, Paul F. Fahey, John J. Guinan, Jr., Radha Kalluri,
and Robert H. Withnell for valuable discussions and suggestions. Renato
Nobili, Fabio Mammano, and a third, anonymous reviewer provided helpful
comments on the manuscript. This work was supported by grants R01
DC03687 and R29 DC03094 from the NIDCD, National Institutes of Health.
References
Diependaal, R. J., H. Duifhuis, H. W. Hoogstraten, and
M. A. Viergever (1987). Numerical methods for solving one-dimensional
cochlear models in the time domain. J. Acoust. Soc. Am. 82, 1655–1666.
Furst, M. and M. Lapid (1988). A cochlear model
for acoustic emissions. J. Acoust. Soc. Am. 84, 222–229. Kemp,
D. T. (1978). Stimulated acoustic emissions from within the human
auditory system. J. Acoust. Soc. Am. 64, 1386–1391.
Manley, G. A. (1983). Frequency spacing of acoustic
emissions: A possible explanation. In W. R. Webster and L. M. Aitkin
(Eds.), Mechanisms of Hearing, pp. 36–39. Clayton, Australia:
Monash University Press.
Nobili, R., A. Vete¹sn´¦k, L. Turicchia, and F. Mammano
(2003b). Otoacoustic emissions from residual oscillations of the
cochlear basilar membrane in a human ear model. J. Assoc. Res. Otolaryngol.
4, 478–494.
Nobili, R., A. Vete¹sn´¦k, L. Turicchia, and F. Mammano
(2003a). Otoacoustic emissions simulated in the time domain by a
hydrodynamic model of the human cochlea. In A. W. Gummer (Ed.),
Biophysics of the Cochlea: From Molecules to Models, pp. 524–530.
Singapore: World Scientific.
Nobili, R. (2000). Otoacoustic emissions simulated
by a realistic cochlear model. In H. Wada, T. Takasaka, K. Ikeda,
K. Ohyama, and T. Koike (Eds.), Recent Developments in Auditory
Mechanics, pp. 402–408. Singapore: World Scientific.
Nobili, R. (2003). Simoae at http://www.pd.infn.it/~rnobili/simoae/simoae.htm
(as of 29 July 2003).
Press, W. H., S. A. Teukolsky, W. T. Vetterling, and B.
P. Flannery (1992). Numerical Recipes in C: The Art of
Scientific Computing. Cambridge: Cambridge University Press.
Ruggero, M. A., N. C. Rich, and R. Freyman (1983).
Spontaneous and impulsively evoked otoacoustic emissions: Indicators
of cochlear pathology? Hear. Res. 10, 283–300.
Shera, C. A. and J. J. Guinan (1999). Evoked otoacoustic
emissions arise by two fundamentally different mechanisms: A taxonomy
for mammalian OAEs. J. Acoust. Soc. Am. 105, 782–798.
Shera, C. A. and J. J. Guinan (2003). Stimulus-frequency-emission
group delay: A test of coherent reflection filtering and a window
on cochlear tuning. J. Acoust. Soc. Am. 113, 2762–2772.
Shera, C. A. and G. Zweig (1993a). Noninvasive
measurement of the cochlear traveling-wave ratio. J. Acoust. Soc.
Am. 93, 3333–3352.
Shera, C. A. and G. Zweig (1993b). Order from
chaos: Resolving the paradox of periodicity in evoked otoacoustic
emission. In H. Duifhuis, J. W. Horst, P. van Dijk, and S. M. van
Netten (Eds.), Biophysics of Hair Cell Sensory Systems, pp. 54–63.
Singapore: World Scientific.
Shera, C. A. (2003). Mammalian spontaneous otoacoustic
emissions are amplitudestabilized cochlear standing waves. J. Acoust.
Soc. Am. 114, 244–262.
Strube, H. W. (1989). Evoked otoacoustic emissions
as cochlear Bragg reflections. Hear. Res. 38, 35–45. Sutton,
G. T. and J. P. Wilson (1983). Modelling cochlear echoes: The influence
of irregularities in frequency mapping on summed cochlear activity.
In E. Boer and M. A. Viergever (Eds.), Mechanics of Hearing, pp.
83–90. The Hague: Martinus Nijhoff.
Talmadge, C. L., A. Tubis, G. R. Long, and P. Piskorski
(1998). Modeling otoacoustic emission and hearing threshold
fine structures. J. Acoust. Soc. Am. 104, 1517–1543.
Talmadge, C. L., A. Tubis, G. R. Long, and C. Tong
(2000). Modeling the combined effects of basilar membrane nonlinearity
and roughness on stimulus frequency otoacoustic emission fine structure.
J. Acoust. Soc. Am. 108, 2911–2932.
Talmadge, C. L. and A. Tubis (1993). On modeling
the connection between spontaneous and evoked otoacoustic emissions.
In H. Duifhuis, J. W. Horst, P. van Dijk, and S. M. van Netten (Eds.),
Biophysics of Hair Cell Sensory Systems, pp. 25–32. Singapore:
World Scientific.
Wilson, J. P. (1980). Evidence for a cochlear
origin for acoustic re-emissions, threshold fine-structure and tonal
tinnitus. Hear. Res. 2, 233–252.
Yates, G. K. and R. H. Withnell (1999). The role
of intermodulation distortion in transient-evoked otoacoustic emissions.
Hear. Res. 136, 49–64.
Zweig, G. and C. A. Shera (1995). The origin of
periodicity in the spectrum of evoked otoacoustic emissions. J.
Acoust. Soc. Am. 98, 2018–2047.
Zwicker, E. and E. Schloth (1984). Interrelation
of different oto-acoustic emissions. J. Acoust. Soc. Am. 75, 1148–1154.
Zwicker, E. (1986). ‘Otoacoustic’
emissions in a nonlinear cochlear hardware model with feedback.
J. Acoust. Soc. Am. 75, 154–162.
|