Temporal envelope and fine structure explained

Temporal envelope (ENV) and temporal fine structure (TFS) are changes in the amplitude and frequency of sound perceived by humans over time. These temporal changes are responsible for several aspects of auditory perception, including loudness, pitch and timbre perception and spatial hearing.

Complex sounds such as speech or music are decomposed by the peripheral auditory system of humans into narrow frequency bands. The resulting narrow-band signals convey information at different time scales ranging from less than one millisecond to hundreds of milliseconds. A dichotomy between slow "temporal envelope" cues and faster "temporal fine structure" cues has been proposed to study several aspects of auditory perception (e.g., loudness, pitch and timbre perception, auditory scene analysis, sound localization) at two distinct time scales in each frequency band.[1] [2] [3] [4] Over the last decades, a wealth of psychophysical, electrophysiological and computational studies based on this envelope/fine-structure dichotomy have examined the role of these temporal cues in sound identification and communication, how these temporal cues are processed by the peripheral and central auditory system, and the effects of aging and cochlear damage on temporal auditory processing. Although the envelope/fine-structure dichotomy has been debated and questions remain as to how temporal fine structure cues are actually encoded in the auditory system, these studies have led to a range of applications in various fields including speech and audio processing, clinical audiology and rehabilitation of sensorineural hearing loss via hearing aids or cochlear implants.

Definition

Notions of temporal envelope and temporal fine structure may have different meanings in many studies. An important distinction to make is between the physical (i.e., acoustical) and the biological (or perceptual) description of these ENV and TFS cues.

Any sound whose frequency components cover a narrow range (called a narrowband signal) can be considered as an envelope (ENVp, where p denotes the physical signal) superimposed on a more rapidly oscillating carrier, the temporal fine structure (TFSp).[5]

Many sounds in everyday life, including speech and music, are broadband; the frequency components spread over a wide range and there is no well-defined way to represent the signal in terms of ENVp and TFSp. However, in a normally functioning cochlea, complex broadband signals are decomposed by the filtering on the basilar membrane (BM) within the cochlea into a series of narrowband signals.[6] Therefore, the waveform at each place on the BM can be considered as an envelope (ENVBM) superimposed on a more rapidly oscillating carrier, the temporal fine structure (TFSBM).[7] The ENVBM and TFSBM depend on the place along the BM. At the apical end, which is tuned to low (audio) frequencies, ENVBM and TFSBM vary relatively slowly with time, while at the basal end, which is tuned to high frequencies, both ENVBM and TFSBM vary more rapidly with time.[7]

Both ENVBM and TFSBM are represented in the time patterns of action potentials in the auditory nerve[8] these are denoted ENVn and TFSn. TFSn is represented most prominently in neurons tuned to low frequencies, while ENVn is represented most prominently in neurons tuned to high (audio) frequencies.[8] [9] For a broadband signal, it is not possible to manipulate TFSp without affecting ENVBM and ENVn, and it is not possible to manipulate ENVp without affecting TFSBM and TFSn.[10] [11]

Temporal envelope (ENV) processing

Neurophysiological aspects

The neural representation of stimulus envelope, ENVn, has typically been studied using well-controlled ENVp modulations, that is sinusoidally amplitude-modulated (AM) sounds. Cochlear filtering limits the range of AM rates encoded in individual auditory-nerve fibers. In the auditory nerve, the strength of the neural representation of AM decreases with increasing modulation rate. At the level of the cochlear nucleus, several cell types show an enhancement of ENVn information. Multipolar cells can show band-pass tuning to AM tones with AM rates between 50 and 1000 Hz.[12] [13] Some of these cells show an excellent response to the ENVn and provide inhibitory sideband inputs to other cells in the cochlear nucleus giving a physiological correlate of comodulation masking release, a phenomenon whereby the detection of a signal in a masker is improved when the masker has correlated envelope fluctuations across frequency (see section below).[14]

Responses to the temporal-envelope cues of speech or other complex sounds persist up the auditory pathway, eventually to the various fields of the auditory cortex in many animals. In the Primary Auditory Cortex, responses can encode AM rates by phase-locking up to about 20–30 Hz,[15] [16] [17] [18] while faster rates induce sustained and often tuned responses.[19] [20] A topographical representation of AM rate has been demonstrated in the primary auditory cortex of awake macaques.[21] This representation is approximately perpendicular to the axis of the tonotopic gradient, consistent with an orthogonal organization of spectral and temporal features in the auditory cortex. Combining these temporal responses with the spectral selectivity of A1 neurons gives rise to the spectro-temporal receptive fields that often capture well cortical responses to complex modulated sounds.[22] [23] In secondary auditory cortical fields, responses become temporally more sluggish and spectrally broader, but are still able to phase-lock to the salient features of speech and musical sounds.[24] [25] [26] [27] Tuning to AM rates below about 64 Hz is also found in the human auditory cortex [28] [29] [30] [31] as revealed by brain-imaging techniques (fMRI) and cortical recordings in epileptic patients (electrocorticography). This is consistent with neuropsychological studies of brain-damaged patients[32] and with the notion that the central auditory system performs some form of spectral decomposition of the ENVp of incoming sounds. The ranges over which cortical responses encode well the temporal-envelope cues of speech have been shown to be predictive of the human ability to understand speech. In the human superior temporal gyrus (STG), an anterior-posterior spatial organization of spectro-temporal modulation tuning has been found in response to speech sounds, the posterior STG being tuned for temporally fast varying speech sounds with low spectral modulations and the anterior STG being tuned for temporally slow varying speech sounds with high spectral modulations.[33]

One unexpected aspect of phase locking in the auditory cortex has been observed in the responses elicited by complex acoustic stimuli with spectrograms that exhibit relatively slow envelopes (< 20 Hz), but that are carried by fast modulations that are as high as hundreds of Hertz. Speech and music, as well as various modulated noise stimuli have such temporal structure.[34] For these stimuli, cortical responses phase-lock to both the envelope and fine-structure induced by interactions between unresolved harmonics of the sound, thus reflecting the pitch of the sound, and exceeding the typical lower limits of cortical phase-locking to the envelopes of a few 10's of Hertz. This paradoxical relation[34] [35] between the slow and fast cortical phase-locking to the carrier “fine structure” has been demonstrated both in the auditory[34] and visual[36] cortices. It has also been shown to be amply manifested in measurements of the spectro-temporal receptive fields of the primary auditory cortex giving them unexpectedly fine temporal accuracy and selectivity bordering on a 5-10 ms resolution.[34] [36] The underlying causes of this phenomenon have been attributed to several possible origins, including nonlinear synaptic depression and facilitation, and/or a cortical network of thalamic excitation and cortical inhibition.[34] [37] [38] [39] There are many functionally significant and perceptually relevant reasons for the coexistence of these two complementary dynamic response modes. They include the ability to accurately encode onsets and other rapid ‘events’ in the ENVp of complex acoustic and other sensory signals, features that are critical for the perception of consonants (speech) and percussive sounds (music), as well as the texture of complex sounds.[34] [40]

Psychoacoustical aspects

The perception of ENVp depends on which AM rates are contained in the signal. Low rates of AM, in the 1–8 Hz range, are perceived as changes in perceived intensity, that is  loudness fluctuations (a percept that can also be evoked by frequency modulation, FM); at higher rates, AM is perceived as roughness, with the greatest roughness sensation occurring at around 70 Hz;[41] at even higher rates, AM can evoke a weak pitch percept corresponding to the modulation rate.[42] Rainstorms, crackling fire, chirping crickets or galloping horses produce "sound textures" - the collective result of many similar acoustic events - which perception is mediated by ENVn statistics.[43] [44]

The auditory detection threshold for AM as a function of AM rate, referred to as the temporal modulation transfer function (TMTF),[45] is best for AM rates in the range from 4 – 150 Hz and worsens outside that range[45] [46] [47] The cutoff frequency of the TMTF gives an estimate of temporal acuity (temporal resolution) for the auditory system. This cutoff frequency corresponds to a time constant of about 1 - 3 ms for the auditory system of normal-hearing humans.

Correlated envelope fluctuations across frequency in a masker can aid detection of a pure tone signal, an effect known as comodulation masking release.[48]

AM applied to a given carrier can perceptually interfere with the detection of a target AM imposed on the same carrier, an effect termed modulation masking.[49] Modulation-masking patterns are tuned (greater masking occurs for masking and target AMs close in modulation rate), suggesting that the human auditory system is equipped with frequency-selective channels for AM. Moreover, AM applied to spectrally remote carriers can perceptually interfere with the detection of AM on a target sound, an effect termed modulation detection interference.[50] The notion of modulation channels is also supported by the demonstration of selective adaptation effects in the modulation domain.[51] [52] [53] These studies show that AM detection thresholds are selectively elevated above pre-exposure thresholds when the carrier frequency and the AM rate of the adaptor are similar to those of the test tone.

Human listeners are sensitive to relatively slow "second-order" AMs cues correspond to fluctuations in the strength of AM. These cues arise from the interaction of different modulation rates, previously described as "beating" in the envelope-frequency domain. Perception of second-order AM has been interpreted as resulting from nonlinear mechanisms in the auditory pathway that produce an audible distortion component at the envelope beat frequency in the internal modulation spectrum of the sounds.[54] [55] [56]

Interaural time differences in the envelope provide binaural cues even at high frequencies where TFSn cannot be used.[57]

Models of normal envelope processing

The most basic computer model of ENV processing is the leaky integrator model.[58] [45] This model extracts the temporal envelope of the sound (ENVp) via bandpass filtering, half-wave rectification (which may be followed by fast-acting amplitude compression), and lowpass filtering with a cutoff frequency between about 60 and 150 Hz. The leaky integrator is often used with a decision statistic based on either the resulting envelope power, the max/min ratio, or the crest factor. This model accounts for the loss of auditory sensitivity for AM rates higher than about 60–150 Hz for broadband noise carriers.[45] Based on the concept of frequency selectivity for AM,[59] the perception model of Torsten Dau[60] incorporates broadly tuned bandpass modulation filters (with a Q value around 1) to account for data from a broad variety of psychoacoustic tasks and particularly AM detection for noise carriers with different bandwidths, taking into account their intrinsic envelope fluctuations. This model of has been extended to account for comodulation masking release (see sections above).[61] The shapes of the modulation filters have been estimated[62] and an “envelope power spectrum model” (EPSM) based on these filters can account for AM masking patterns and AM depth discrimination.[63] The EPSM has been extended to the prediction of speech intelligibility[64] and to account for data from a broad variety of psychoacoustic tasks.[65] A physiologically based processing model simulating brainstem responses has also been developed to account for AM detection and AM masking patterns.[66]

Temporal fine structure (TFS) processing

Neurophysiological aspects

The neural representation of temporal fine structure, TFSn, has been studied using stimuli with well-controlled TFSp: pure tones, harmonic complex tones, and frequency-modulated (FM) tones.

Auditory-nerve fibres are able to represent low-frequency sounds via their phase-locked discharges (i.e., TFSn information). The upper frequency limit for phase locking is species dependent. It is about 5 kHz in the cat, 9 kHz in the barn owl and just 4 kHz in the guinea pig. We do not know the upper limit of phase locking in humans but current, indirect, estimates suggest it is about 4–5 kHz.[67] Phase locking is a direct consequence of the transduction process with an increase in probability of transduction channel opening occurring with a stretching of the stereocilia and decrease in channel opening occurring when pushed in the opposite direction. This has led some to suggest that phase locking is an epiphenomenon. The upper limit appears to be determined by a cascade of low pass filters at the level of the inner hair cell and auditory-nerve synapse.[68] [69]

TFSn information in the auditory nerve may be used to encode the (audio) frequency of low-frequency sounds, including single tones and more complex stimuli such as frequency-modulated tones or steady-state vowels (see role and applications to speech and music).

The auditory system goes to some length to preserve this TFSn information with the presence of giant synapses (End bulbs of Held) in the ventral cochlear nucleus. These synapses contact bushy cells (Spherical and globular) and faithfully transmit (or enhance) the temporal information present in the auditory nerve fibers to higher structures in the brainstem.[70] The bushy cells project to the medial superior olive and the globular cells project to the medial nucleus of the trapezoid body (MNTB). The MNTB is also characterized by giant synapses (calyces of Held) and provides precisely timed inhibition to the lateral superior olive. The medial and lateral superior olive and MNTB are involved in the encoding of interaural time and intensity differences. There is general acceptance that the temporal information is crucial in sound localization but it is still contentious as to whether the same temporal information is used to encode the frequency of complex sounds.

Several problems remain with the idea that the TFSn is important in the representation of the frequency components of complex sounds. The first problem is that the temporal information deteriorates as it passes through successive stages of the auditory pathway (presumably due to the low pass dendritic filtering). Therefore, the second problem is that the temporal information must be extracted at an early stage of the auditory pathway. No such stage has currently been identified although there are theories about how temporal information can be converted into rate information (see section Models of normal processing: Limitations).

Psychoacoustical aspects

It is often assumed that many perceptual capacities rely on the ability of the monaural and binaural auditory system to encode and use TFSn cues evoked by components in sounds with frequencies below about 1–4 kHz. These capacities include discrimination of frequency,[71] [72] [73] [74] discrimination of the fundamental frequency of harmonic sounds,[73] [72] [74] detection of FM at rates below 5 Hz,[75] melody recognition for sequences of pure tones and complex tones,[71] [72] lateralization and localization of pure tones and complex tones,[76] and segregation of concurrent harmonic sounds (such as speech sounds).[77] It appears that TFSn cues require correct tonotopic (place) representation to be processed optimally by the auditory system.[78] Moreover, musical pitch perception has been demonstrated for complex tones with all harmonics above 6 kHz, demonstrating that it is not entirely dependent on neural phase locking to TFSBM (i.e., TFSn) cues.[79]

As for FM detection, the current view assumes that in the normal auditory system, FM is encoded via TFSn cues when the FM rate is low (<5 Hz) and when the carrier frequency is below about 4 kHz,[75] [80] [81] and via ENVn cues when the FM is fast or when the carrier frequency is higher than 4 kHz.[75] [82] [83] [84] [81] This is supported by single-unit recordings in the low brainstem.[70] According to this view, TFSn cues are not used to detect FM with rates above about 10 Hz because the mechanism decoding the TFSn information is “sluggish” and cannot track rapid changes in frequency.[75] Several studies have shown that auditory sensitivity to slow FM at low carrier frequency is associated with speech identification for both normal-hearing and hearing-impaired individuals when speech reception is limited by acoustic degradations (e.g., filtering) or concurrent speech sounds.[85] [86] [87] This suggests that robust speech intelligibility is determined by accurate processing of TFSn cues.

Models of normal processing: limitations

The separation of a sound into ENVp  and TFSp appears inspired partly by how sounds are synthesized and by the availability of a convenient way to separate an existing sound into ENV and TFS, namely the Hilbert transform. There is a risk that this view of auditory processing[88] is dominated by these physical/technical concepts, similarly to how cochlear frequency-to-place mapping was for a long time conceptualized in terms of the Fourier transform. Physiologically, there is no indication of a separation of ENV and TFS in the auditory system for stages up to the cochlear nucleus. Only at that stage does it appear that parallel pathways, potentially enhancing ENVn or TFSn information (or something akin to it), may be implemented through the temporal response characteristics of different cochlear nucleus cell types.[70] It may therefore be useful to better simulate cochlear nucleus cell types to understand the true concepts for parallel processing created at the level of the cochlear nucleus. These concepts may be related to separating ENV and TFS but are unlikely realized like the Hilbert transform.

A computational model of the peripheral auditory system[89] [90] may be used to simulate auditory-nerve fiber responses to complex sounds such as speech, and quantify the transmission (i.e., internal representation) of ENVn and TFSn cues. In two simulation studies,[91] [92] the mean-rate and spike-timing information was quantified at the output of such a model to characterize, respectively, the short-term rate of neural firing (ENVn) and the level of synchronization due to phase locking (TFSn) in response to speech sounds degraded by vocoders. The best model predictions of vocoded-speech intelligibility were found when both ENVn and TFSn cues were included, providing evidence that TFSn cues are important for intelligibility when the speech ENVp cues are degraded.

At a more fundamental level, similar computational modeling was used to demonstrate that the functional dependence of human just-noticeable-frequency-differences on pure-tone frequency were not accounted for unless temporal information was included (notably most so for mid-high frequencies, even above the nominal cutoff in physiological phase locking).[93] [94] However, a caveat of most TFS models is that optimal model performance with temporal information typically over-estimates human performance.

An alternative view is to assume that TFSn information at the level of the auditory nerve is converted into rate-place (ENVn) information at a later stage of the auditory system (e.g., the low brainstem). Several modelling studies proposed that the neural mechanisms for decoding TFSn are based on correlation of the outputs of adjacent places.[95] [96] [97] [98] [99]

Role in speech and music perception

Role of temporal envelope in speech and music perception

The ENVp plays a critical role in many aspects of auditory perception, including in the perception of speech and music.[4] [100] [101] Speech recognition is possible using cues related to the ENVp, even in situations where the original spectral information and TFSp are highly degraded.[102] Indeed, when the spectrally local TFSp from one sentence is combined with the ENVp from a second sentence, only the words of the second sentence are heard.[103] The ENVp rates most important for speech are those below about 16 Hz, corresponding to fluctuations at the rate of syllables.[104] [105] [106] On the other hand, the fundamental frequency (“pitch”) contour of speech sounds is primarily conveyed via TFSp cues,[105] although some information on the contour can be perceived via rapid envelope fluctuations corresponding to the fundamental frequency.[107] For music, slow ENVp rates convey rhythm and tempo information, whereas more rapid rates convey the onset and offset properties of sound (attack and decay, respectively) that are important for timbre perception.[108]

Role of TFS in speech and music perception

The ability to accurately process TFSp information is thought to play a role in our perception of pitch (i.e., the perceived height of sounds), an important sensation for music perception, as well as our ability to understand speech, especially in the presence of background noise.[72]

Role of TFS in pitch perception

Although pitch retrieval mechanisms in the auditory system are still a matter of debate,[74] [109] TFSn information may be used to retrieve the pitch of low-frequency pure tones[73] and estimate the individual frequencies of the low-numbered (ca. 1st-8th) harmonics of a complex sound,[110] frequencies from which the fundamental frequency of the sound can be retrieved according to, e.g., pattern-matching models of pitch perception.[111] A role of TFSn information in pitch perception of complex sounds containing intermediate harmonics (ca. 7th-16th) has also been suggested[112] and may be accounted for by temporal or spectrotemporal[113] models of pitch perception. The degraded TFSn cues conveyed by cochlear implant devices may also be partly responsible for impaired music perception of cochlear implant recipients.[114]

Role of TFS cues in speech perception

TFSp cues are thought to be important for the identification of speakers and for tone identification in tonal languages.[115] In addition, several vocoder studies have suggested that TFSp cues contribute to the intelligibility of speech in quiet and noise.[116] Although it is difficult to isolate TFSp from ENVp cues,[101] [117] there is evidence from studies in hearing-impaired listeners that speech perception in the presence of background noise can be partly accounted for by the ability to accurately process TFSp,[118] [119] although the ability to “listen in the dips” of fluctuating maskers does not seem to depend on periodic TFSp cues.[120]

Role in environmental sound perception

Environmental sounds can be broadly defined as nonspeech and nonmusical sounds in the listener's environment that can convey meaningful information about surrounding objects and events.[121] Environmental sounds are highly heterogeneous in terms of their acoustic characteristics and source types, and may include human and animal vocalizations, water and weather related events, mechanical and electronic signaling sounds. Given a great variety in sound sources that give rise to environmental sounds both ENVp and TFSp play an important role in their perception. However, the relative contributions of ENVp and TFSp can differ considerably for specific environmental sounds. This is reflected in the variety of acoustic measures that correlate with different perceptual characteristics of objects and events.[122] [123] [124]

Early studies highlighted the importance of envelope-based temporal patterning in perception of environmental events. For instance, Warren & Verbrugge, demonstrated that constructed sounds of a glass bottle dropped on the floor were perceived as bouncing when high-energy regions in four different frequency bands were temporally aligned, producing amplitude peaks in the envelope.[125] In contrast, when the same spectral energy was distributed randomly across bands the sounds were heard as breaking. More recent studies using vocoder simulations of cochlear implant processing demonstrated that many temporally-patterned sounds can be perceived with little original spectral information, based primarily on temporal cues.[123] [124] Such sounds as footsteps, horse galloping, helicopter flying, ping-pong playing, clapping, typing were identified with a high accuracy of 70% or more with a single channel of envelope-modulated broadband noise or with only two frequency channels. In these studies, envelope-based acoustic measures such as number of bursts and peaks in the envelope were predictive of listeners’ abilities to identify sounds based primarily on ENVp cues. On the other hand, identification of brief environmental sounds without strong temporal patterning in ENVp may require a much larger number of frequency channels to perceive. Sounds such as a car horn or a train whistle were poorly identified even with as many as 32 frequency channels.[123] Listeners with cochlear implants, which transmit envelope information for specific frequency bands, but do not transmit TFSp, have considerably reduced abilities in identification of common environmental sounds.[126] [127] [128]

In addition, individual environmental sounds are typically heard within the context of larger auditory scenes where sounds from multiple sources may overlap in time and frequency. When heard within an auditory scene, accurate identification of individual environmental sounds is contingent on the ability to segregate them from other sound sources or auditory streams in the auditory scene, which involves further reliance on ENVp and TFSp cues (see Role in auditory scene analysis).

Role in auditory scene analysis

Auditory scene analysis refers to the ability to perceive separately sounds coming from different sources. Any acoustical difference can potentially lead to auditory segregation,[129] and so any cues based either on ENVp or TFSp are likely to assist in segregating competing sound sources.[130] Such cues involve percepts such as pitch.[131] [132] [133] [134] Binaural TFSp cues producing interaural time differences have not always resulted in clear source segregation, particularly with simultaneously presented sources, although successful segregation of sequential sounds, such as noise or speech, have been reported.[135]

Effects of age and hearing loss on temporal envelope processing

Developmental aspects

In infancy, behavioral AM detection thresholds[136] and forward or backward masking thresholds[136] [137] [138] observed in 3-month olds are similar to those observed in adults. Electrophysiological studies conducted in 1-month-old infants using 2000 Hz AM pure tones indicate some immaturity in envelope following response (EFR). Although sleeping infants and sedated adults show the same effect of modulation rate on EFR, infants’ estimates were generally poorer than adults’.[139] [140] This is consistent with behavioral studies conducted with school-age children showing differences in AM detection thresholds compared to adults. Children systematically show worse AM detection thresholds than adults until 10–11 years. However, the shape of the TMTF (the cutoff) is similar to adults’ for younger children of 5 years.[141] [142] Sensory versus non-sensory factors for this long maturation are still debated,[143] but the results generally appear to be more dependent on the task or on sound complexity for infants and children than for adults.[144] Regarding the development of speech ENVp processing, vocoder studies suggest that infants as young as 3 months are able to discriminate a change in consonants when the faster ENVp information of the syllables is preserved (< 256 Hz) but less so when only the slowest ENVp is available (< 8 Hz).[145] Older children of 5 years show similar abilities than adults to discriminate consonant changes based on ENVp cues (< 64 Hz).[146]

Neurophysiological aspects

The effects of hearing loss and age on neural coding are generally believed to be smaller for slowly varying envelope responses (i.e., ENVn) than for rapidly varying temporal fine structure (i.e., TFSn).[147] [148] Enhanced ENVn coding following noise-induced hearing loss has been observed in peripheral auditory responses from single neurons[149] and in central evoked responses from the auditory midbrain.[150] The enhancement in ENVn coding of narrowband sounds occurs across the full range of modulation frequencies encoded by single neurons.[151] For broadband sounds, the range of modulation frequencies encoded in impaired responses is broader than normal (extending to higher frequencies), as expected from reduced frequency selectivity associated with outer-hair-cell dysfunction.[152] The enhancement observed in neural envelope responses is consistent with enhanced auditory perception of modulations following cochlear damage, which is commonly believed to result from loss of cochlear compression that occurs with outer-hair-cell dysfunction due to age or noise overexposure.[153]  However, the influence of inner-hair-cell dysfunction (e.g., shallower response growth for mild-moderate damage and steeper growth for severe damage) can confound the effects of outer-hair-cell dysfunction on overall response growth and thus ENVn coding.[149] [154] Thus, not surprisingly the relative effects of outer-hair-cell and inner-hair-cell dysfunction have been predicted with modeling to create individual differences in speech intelligibility based on the strength of envelope coding of speech relative to noise.

Psychoacoustical aspects

For sinusoidal carriers, which have no intrinsic envelope (ENVp) fluctuations, the TMTF is roughly flat for AM rates from 10 to 120 Hz, but increases (i.e. threshold worsens) for higher AM rates,[47] [155] provided that spectral sidebands are not audible. The shape of the TMTF for sinusoidal carriers is similar for young and older people with normal audiometric thresholds, but older people tend to have higher detection thresholds overall, suggesting poorer “detection efficiency” for ENVn cues in older people.[156] [157] Provided that the carrier is fully audible, the ability to detect AM is usually not adversely affected by cochlear hearing loss and may sometimes be better than normal, for both noise carriers [158] [159] and sinusoidal carriers,[155] [160] perhaps because loudness recruitment (an abnormally rapid growth of loudness with increasing sound level) “magnifies” the perceived amount of AM (i.e., ENVn cues). Consistent with this, when the AM is clearly audible, a sound with a fixed AM depth appears to fluctuate more for an impaired ear than for a normal ear. However, the ability to detect changes in AM depth can be impaired by cochlear hearing loss.[160] Speech that is processed with noise vocoder such that mainly envelope information is delivered in multiple spectral channels was also used in investigating envelope processing in hearing impairment. Here, hearing-impaired individuals could not make use of such envelope information as well as normal-hearing individuals, even after audibility factors were taken into account.[161] Additional experiments suggest that age negatively affects the binaural processing of ENVp at least at low audio-frequencies.[162]

Models of impaired temporal envelope processing

The perception model of ENV processing[60] that incorporates selective (bandpass) AM filters accounts for many perceptual consequences of cochlear dysfunction including enhanced sensitivity to AM for sinusoidal and noise carriers,[163] [164] abnormal forward masking (the rate of recovery from forward masking being generally slower than normal for impaired listeners),[165] stronger interference effects between AM and FM [166] and enhanced temporal integration of AM.[164] The model of Torsten Dau[60] has been extended to account for the discrimination of complex AM patterns by hearing-impaired individuals and the effects of noise-reduction systems.[167] The performance of the hearing-impaired individuals was best captured when the model combined the loss of peripheral amplitude compression resulting from the loss of the active mechanism in the cochlea[163] [164] [165] with an increase in internal noise in the ENVn domain.[163] [164] [166] Phenomenological models simulating the response of the peripheral auditory system showed that impaired AM sensitivity in individuals experiencing chronic tinnitus with clinically normal audiograms could be predicted by substantial loss of  auditory-nerve fibers with low spontaneous rates and some loss of auditory-nerve fibers with high-spontaneous rates.[168]

Effects of age and hearing loss on TFS processing

Developmental aspects

Very few studies have systematically assessed TFS processing in infants and children. Frequency-following response (FFR), thought to reflect phase-locked neural activity, appears to be adult-like in 1-month-old infants when using a pure tone (centered at 500, 1000 or 2000 Hz) modulated at 80 Hz with a 100% of modulation depth.[139]

As for behavioral data, six-month-old infants require larger frequency transitions to detect an FM change in a 1-kHz tone compared to adults.[169] However, 4-month-old infants are able to discriminate two different FM sweeps,[170] and they are more sensitive to FM cues swept from 150 Hz to 550 Hz than at lower frequencies.[171] In school-age children, performance in detecting FM change improves between 6 and 10 years and sensitivity to low modulation rate (2 Hz) is poor until 9 years.[172]

For speech sounds, only one vocoder study has explored the ability of school age children to rely on TFSp cues to detect consonant changes, showing the same abilities for 5-years-olds than adults.[146]

Neurophysiological aspects

Psychophysical studies have suggested that degraded TFS processing due to age and hearing loss may underlie some suprathreshold deficits, such as speech perception;[7] however, debate remains about the underlying neural correlates.[147] [148] The strength of phase locking to the temporal fine structure of signals (TFSn) in quiet listening conditions remains normal in peripheral single-neuron responses following cochlear hearing loss.[149] Although these data suggest that the fundamental ability of auditory-nerve fibers to follow the rapid fluctuations of sound remains intact following cochlear hearing loss, deficits in phase locking strength do emerge in background noise.[173] This finding, which is consistent with the common observation that listeners with cochlear hearing loss have more difficulty in noisy conditions, results from reduced cochlear frequency selectivity associated with outer-hair-cell dysfunction.[153]  Although only limited effects of age and hearing loss have been observed in terms of TFSn coding strength of narrowband sounds, more dramatic deficits have been observed in TFSn coding quality in response to broadband sounds, which are more relevant for everyday listening.  A dramatic loss of tonotopicity can occur following noise induced hearing loss, where auditory-nerve fibers that should be responding to mid frequencies (e.g., 2–4 kHz) have dominant TFS responses to lower frequencies (e.g., 700 Hz).[174]  Notably, the loss of tonotopicity generally occurs only for TFSn coding but not for ENVn coding, which is consistent with greater perceptual deficits in TFS processing.[7] This tonotopic degradation is likely to have important implications for speech perception, and can account for degraded coding of vowels following noise-induced hearing loss in which most of the cochlea responds to only the first formant, eliminating the normal tonotopic representation of the second and third formants.

Psychoacoustical aspects

Several psychophysical studies have shown that older people with normal hearing and people with sensorineural hearing loss often show impaired performance for auditory tasks that are assumed to rely on the ability of the monaural and binaural auditory system to encode and use TFSn cues, such as: discrimination of sound frequency,[74] [175] [176] discrimination of the fundamental frequency of harmonic sounds,[74] [175] [176] [177] detection of FM at rates below 5 Hz,[178] [179] [180] melody recognition for sequences of pure tones and complex sounds,[181] lateralization and localization of pure tones and complex tones,[76] [182] [162] and segregation of concurrent harmonic sounds (such as speech sounds).[77] However, it remains unclear to which extent deficits associated with hearing loss reflect poorer TFSn processing or reduced cochlear frequency selectivity.[181]

Models of impaired processing

The quality of the representation of a sound in the auditory nerve is limited by refractoriness, adaptation, saturation, and reduced synchronization (phase locking) at high frequencies, as well as by the stochastic nature of actions potentials.[183] However, the auditory nerve contains thousands of fibers. Hence, despite these limiting factors, the properties of sounds are reasonably well represented in the population nerve response over a wide range of levels[184] and audio frequencies (see Volley Theory).

The coding of temporal information in the auditory nerve can be disrupted by two main mechanisms: reduced synchrony and loss of synapses and/or auditory nerve fibers.[185] The impact of disrupted temporal coding on human auditory perception has been explored using physiologically inspired signal-processing tools. The reduction in neural synchrony has been simulated by jittering the phases of the multiple frequency components in speech,[186] although this has undesired effects in the spectral domain. The loss of auditory nerve fibers or synapses has been simulated by assuming (i) that each afferent fiber operates as a stochastic sampler of the sound waveform, with greater probability of firing for higher-intensity and sustained sound features than for lower-intensity or transient features, and (ii) that deafferentation can be modeled by reducing the number of samplers.[183] However, this also has undesired effects in the spectral domain. Both jittering and stochastic undersampling degrade the representation of the TFSn more than the representation of the ENVn. Both jittering and stochastic undersampling impair the recognition of speech in noisy backgrounds without degrading recognition in silence, support the argument that TFSn is important for recognizing speech in noise.[187] Both jittering and stochastic undersampling mimic the effects of aging on speech perception.[188]

Transmission by hearing aids and cochlear implants

Temporal envelope transmission

Individuals with cochlear hearing loss usually have a smaller than normal dynamic range between the level of the weakest detectable sound and the level at which sounds become uncomfortably loud.[189] [190] To compress the large range of sound levels encountered in everyday life into the small dynamic range of the hearing-impaired person, hearing aids apply amplitude compression, which is also called automatic gain control (AGC). The basic principle of such compression is that the amount of amplification applied to the incoming sound progressively decreases as the input level increases. Usually, the sound is split into several frequency “channels”, and AGC is applied independently in each channel. As a result of compressing the level, AGC reduces the amount of envelope fluctuation in the input signal (ENVp) by an amount that depends on the rate of fluctuation and the speed with which the amplification changes in response to changes in input sound level.[191] [192] AGC can also change the shape of the envelope of the signal.[193] Cochlear implants are devices that electrically stimulate the auditory nerve, thereby creating the sensation of sound in a person who would otherwise be profoundly or totally deaf. The electrical dynamic range is very small,[194] so cochlear implants usually incorporate AGC prior to the signal being filtered into multiple frequency channels.[195] The channel signals are then subjected to instantaneous compression to map them into the limited dynamic range for each channel.[196]

Cochlear implants differ than hearing aids in that the entire acoustic hearing is replaced with direct electric stimulation of the auditory nerve, achieved via an electrode array placed inside the cochlea. Hence, here, other factors than device signal processing also strongly contribute to overall hearing, such as etiology, nerve health, electrode configuration and proximity to the nerve, and overall adaptation process to an entirely new mode of hearing.[197] [198] [199] [200] Almost all information in cochlear implants is conveyed by the envelope fluctuations in the different channels. This is sufficient to give reasonable perception of speech in quiet, but not in noisy or reverberant conditions.[201] [202] [203] [204] [115] [102] [205] [206] [207] [208] The processing in cochlear implants is such that the TFSp is discarded in favor of fixed-rate pulse trains amplitude-modulated by the ENVp within each frequency band. Implant users are sensitive to these ENVp modulations, but performance varies across stimulation site, stimulation level, and across individuals.[209] [210] The TMTF shows a low-pass filter shape similar to that observed in normal-hearing listeners.[210] [211] [212] Voice pitch or musical pitch information, conveyed primarily via weak periodicity cues in the ENVp, results in a pitch sensation that is not salient enough to support music perception,[213] [214] talker sex identification,[215] [216] lexical tones,[217] [218] or prosodic cues.[219] [220] [221] Listeners with cochlear implants are susceptible to interference in the modulation domain[222] [223] which likely contributes to difficulties listening in noise.

Temporal fine structure transmission

Hearing aids usually process sounds by filtering them into multiple frequency channels and applying AGC in each channel. Other signal processing in hearing aids, such as noise reduction, also involves filtering the input into multiple channels.[224] The filtering into channels can affect the TFSp of sounds depending on characteristics such as the phase response and group delay of the filters. However, such effects are usually small. Cochlear implants also filter the input signal into frequency channels. Usually, the ENVp of the signal in each channel is transmitted to the implanted electrodes in the form an electrical pulses of fixed rate that are modulated in amplitude or duration. Information about TFSp is discarded. This is justified by the observation that people with cochlear implants have a very limited ability to process TFSp information, even if it is transmitted to the electrodes,[225] perhaps because of a mismatch between the temporal information and the place in the cochlea to which it is delivered[74] Reducing this mismatch may improve the ability to use TFSp information and hence lead to better pitch perception.[226] Some cochlear implant systems transmit information about TFSp in the channels of the cochlear implants that are tuned to low audio frequencies, and this may improve the pitch perception of low-frequency sounds.[227]

Training effects and plasticity of temporal-envelope processing

Perceptual learning resulting from training has been reported for various auditory AM detection or discrimination tasks,[228] [229] [230] suggesting that the responses of central auditory neurons to ENVp cues are plastic and that practice may modify the circuitry of ENVn processing.[230] [231]

The plasticity of ENVn processing has been demonstrated in several ways. For instance, the ability of auditory-cortex neurons to discriminate voice-onset time cues for phonemes is degraded following moderate hearing loss (20-40 dB HL) induced by acoustic trauma.[232] Interestingly, developmental hearing loss reduces cortical responses to slow, but not fast (100 Hz) AM stimuli, in parallel with behavioral performance.[233] As a matter of fact, a transient hearing loss (15 days) occurring during the "critical period" is sufficient to elevate AM thresholds in adult gerbils.[234] Even non-traumatic noise exposure reduces the phase-locking ability of cortical neurons as well as the animals' behavioral capacity to discriminate between different AM sounds.[235] Behavioral training or pairing protocols involving neuromodulators also alter the ability of cortical neurons to phase lock to AM sounds.[236] [237] In humans, hearing loss may result in an unbalanced representation of speech cues: ENVn cues are enhanced at the cost of TFSn cues (see: Effects of age and hearing loss on temporal envelope processing). Auditory training may reduce the representation of speech ENVn cues for elderly listeners with hearing loss, who may then reach levels comparable to those observed for normal-hearing elderly listeners.[238] Last, intensive musical training induces both behavioral effects such as higher sensitivity to pitch variations (for Mandarin linguistic pitch) and a better synchronization of brainstem responses to the f0-contour of lexical tones for musicians compared with non-musicians.[239]

Clinical evaluation of TFS sensitivity

Fast and easy to administer psychophysical tests have been developed to assist clinicians in the screening of TFS-processing abilities and diagnosis of suprathreshold temporal auditory processing deficits associated with cochlear damage and ageing. These tests may also be useful for audiologists and hearing-aid manufacturers to explain and/or predict the outcome of hearing-aid fitting in terms of perceived quality, speech intelligibility or spatial hearing.[240] [241] These tests may eventually be used to recommend the most appropriate compression speed in hearing aids [242] or the use of directional microphones. The need for such tests is corroborated by strong correlations between slow-FM or spectro-temporal modulation detection thresholds and aided speech intelligibility in competing backgrounds for hearing-impaired persons.[87] [243] Clinical tests can be divided into two groups: those assessing monaural TFS processing capacities (TFS1 test) and those assessing binaural capacities (binaural pitch, TFS-LF, TFS-AF).

TFS1: this test assesses the ability to discriminate between a harmonic complex tone and its frequency-transposed (and thus, inharmonic) version.[244] [245] [246] [156] Binaural pitch: these tests evaluate the ability to detect and discriminate binaural pitch, and melody recognition using different types of binaural pitch.[181] [247] TFS-LF: this test assesses the ability to discriminate low-frequency pure tones that are identical at the two ears from the same tones differing in interaural phase.[248] [249] TFS AF: this test assesses the highest audio frequency of a pure tone up to which a change in interaural phase can be discriminated.[250]

Objective measures using envelope and TFS cues

Signal distortion, additive noise, reverberation, and audio processing strategies such as noise suppression and dynamic-range compression can all impact speech intelligibility and speech and music quality.[251] [252] [253] [254] [255] These changes in the perception of the signal can often be predicted by measuring the associated changes in the signal envelope and/or temporal fine structure (TFS). Objective measures of the signal changes, when combined with procedures that associate the signal changes with differences in auditory perception, give rise to auditory performance metrics for predicting speech intelligibility and speech quality.

Changes in the TFS can be estimated by passing the signals through a filterbank and computing the coherence[256] between the system input and output in each band. Intelligibility predicted from the coherence is accurate for some forms of additive noise and nonlinear distortion,[251] [255] but works poorly for ideal binary mask (IBM) noise suppression.[253] Speech and music quality for signals subjected to noise and clipping distortion have also been modeled using the coherence [257] or using the coherence averaged across short signal segments.[258]

Changes in the signal envelope can be measured using several different procedures. The presence of noise or reverberation will reduce the modulation depth of a signal, and multiband measurement of the envelope modulation depth of the system output is used in the speech transmission index (STI) to estimate intelligibility.[259] While accurate for noise and reverberation applications, the STI works poorly for nonlinear processing such as dynamic-range compression.[260] An extension to the STI estimates the change in modulation by cross-correlating the envelopes of the speech input and output signals.[261] [262] A related procedure, also using envelope cross-correlations, is the short-time objective intelligibility (STOI) measure,[253] which works well for its intended application in evaluating noise suppression, but which is less accurate for nonlinear distortion.[263] Envelope-based intelligibility metrics have also been derived using modulation filterbanks [64] and using envelope time-frequency modulation patterns.[264] Envelope cross-correlation is also used for estimating speech and music quality.[265] [266]

Envelope and TFS measurements can also be combined to form intelligibility and quality metrics. A family of metrics for speech intelligibility,[263] speech quality,[267] [268] and music quality [269] has been derived using a shared model of the auditory periphery [270] that can represent hearing loss. Using a model of the impaired periphery leads to more accurate predictions for hearing-impaired listeners than using a normal-hearing model, and the combined envelope/TFS metric is generally more accurate than a metric that uses envelope modulation alone.[263] [267]

See also

Notes and References

  1. Book: Human Psychophysics. Viemeister. Neal F.. Plack. Christopher J.. Time Analysis . vanc . 1993. Springer, New York, NY. 978-1-4612-7644-9. Springer Handbook of Auditory Research. 3 . 116–154. 10.1007/978-1-4612-2728-1_4.
  2. De Boer E. Pitch of inharmonic signals. Nature. 178. 4532. 535–6. September 1956. 13358790. 10.1038/178535a0. 1956Natur.178..535B. 4241424.
  3. Zeng FG, Nie K, Liu S, Stickney G, Del Rio E, Kong YY, Chen H. On the dichotomy in auditory perception between temporal envelope and fine structure cues. The Journal of the Acoustical Society of America. 116. 3. 1351–4. September 2004. 15478399. 10.1121/1.1777938. 2004ASAJ..116.1351Z.
  4. Plomp. Reinier. vanc . 1983. Perception of speech as a modulated signal. Proceedings of the 10th International Congress of Phonetic Sciences, Utrecht. 19–40.
  5. Book: Hilbert, David. Grundzüge einer allgemeinen theorie der linearen integralgleichungen. vanc . 1912. Leipzig, B. G. Teubner. University of California Libraries.
  6. Ruggero MA. Response to noise of auditory nerve fibers in the squirrel monkey. Journal of Neurophysiology. 36. 4. 569–87. July 1973. 4197339. 10.1152/jn.1973.36.4.569.
  7. Book: Moore, Brian C. J.. Auditory Processing of Temporal Fine Structure: Effects of Age and Hearing Loss. vanc. 2014-05-04. World Scientific Publishing Company. 9789814579650. New Jersey.
  8. Joris PX, Louage DH, Cardoen L, van der Heijden M. Correlation index: a new metric to quantify temporal coding. Hearing Research. 216–217. 19–30. June 2006. 16644160. 10.1016/j.heares.2006.03.010. 32416471.
  9. Heinz MG, Swaminathan J. Quantifying envelope and fine-structure coding in auditory nerve responses to chimaeric speech. Journal of the Association for Research in Otolaryngology. 10. 3. 407–23. September 2009. 19365691. 3084379. 10.1007/s10162-009-0169-8.
  10. Søndergaard. Peter L.. Decorsière. Rémi. Dau. Torsten. vanc. 2011-12-15. On the relationship between multi-channel envelope and temporal fine structure. Proceedings of the International Symposium on Auditory and Audiological Research. 3. 363–370.
  11. Shamma S, Lorenzi C. On the balance of envelope and temporal fine structure in the encoding of speech in the early auditory system. The Journal of the Acoustical Society of America. 133. 5. 2818–33. May 2013. 23654388. 3663870. 10.1121/1.4795783. 2013ASAJ..133.2818S.
  12. Joris PX, Schreiner CE, Rees A. Neural processing of amplitude-modulated sounds. Physiological Reviews. 84. 2. 541–77. April 2004. 15044682. 10.1152/physrev.00029.2003.
  13. Frisina RD. Subcortical neural coding mechanisms for auditory temporal processing. Hearing Research. 158. 1–2. 1–27. August 2001. 11506933. 10.1016/S0378-5955(01)00296-9. 36727875.
  14. Pressnitzer D, Meddis R, Delahaye R, Winter IM. Physiological correlates of comodulation masking release in the mammalian ventral cochlear nucleus. The Journal of Neuroscience. 21. 16. 6377–86. August 2001. 11487661. 6763188. 10.1523/JNEUROSCI.21-16-06377.2001.
  15. Eggermont JJ. Temporal modulation transfer functions for AM and FM stimuli in cat auditory cortex. Effects of carrier type, modulating waveform and intensity. Hearing Research. 74. 1–2. 51–66. April 1994. 8040099. 10.1016/0378-5955(94)90175-9. 4766840.
  16. 10.1007/BF00228100 . 8815035 . 108 . 2 . Auditory responsive cortex in the squirrel monkey: neural responses to amplitude-modulated sounds . 1996 . Exp Brain Res . 273–84 . Bieser A, Müller-Preuss P. 19565315 .
  17. Liang L, Lu T, Wang X. Neural representations of sinusoidal amplitude and frequency modulations in the primary auditory cortex of awake primates. Journal of Neurophysiology. 87. 5. 2237–61. May 2002. 11976364. 10.1152/jn.2002.87.5.2237. 3184199 .
  18. Schreiner CE, Urbas JV. Representation of amplitude modulation in the auditory cortex of the cat. II. Comparison between cortical fields. Hearing Research. 32. 1. 49–63. January 1988. 3350774. 10.1016/0378-5955(88)90146-3. 8416108.
  19. Lu T, Liang L, Wang X. Temporal and rate representations of time-varying signals in the auditory cortex of awake primates. Nature Neuroscience. 4. 11. 1131–8. November 2001. 11593234. 10.1038/nn737. 1417232.
  20. Eggermont JJ. Rate and synchronization measures of periodicity coding in cat primary auditory cortex. Hearing Research. 56. 1–2. 153–67. November 1991. 1769910. 10.1016/0378-5955(91)90165-6. 4704094.
  21. Baumann S, Joly O, Rees A, Petkov CI, Sun L, Thiele A, Griffiths TD. The topography of frequency and time representation in primate auditory cortices. eLife. 4. January 2015. 25590651. 4398946. 10.7554/eLife.03256 . free .
  22. Book: Handbook of Modern Techniques in Auditory Cortex. 2014-01-15. Nova Science Pub Inc. 9781628088946. Depireux. Didier A.. first . Elhilali. Mounya . vanc.
  23. Kowalski N, Depireux DA, Shamma SA. Analysis of dynamic spectra in ferret primary auditory cortex. I. Characteristics of single-unit responses to moving ripple spectra. Journal of Neurophysiology. 76. 5. 3503–23. November 1996. 8930289. 10.1152/jn.1996.76.5.3503. 1903/5688 . free.
  24. Mesgarani N, Chang EF. Selective cortical representation of attended speaker in multi-talker speech perception. Nature. 485. 7397. 233–6. May 2012. 22522927. 3870007. 10.1038/nature11020. 2012Natur.485..233M.
  25. John MS, Picton TW. Human auditory steady-state responses to amplitude-modulated tones: phase and latency measurements. Hearing Research. 141. 1–2. 57–79. March 2000. 10713496. 10.1016/S0378-5955(99)00209-9. 12478710.
  26. Atiani S, David SV, Elgueda D, Locastro M, Radtke-Schuller S, Shamma SA, Fritz JB. Emergent selectivity for task-relevant stimuli in higher-order auditory cortex. Neuron. 82. 2. 486–99. April 2014. 24742467. 4048815. 10.1016/j.neuron.2014.02.029.
  27. Schreiner CE, Urbas JV. Representation of amplitude modulation in the auditory cortex of the cat. I. The anterior auditory field (AAF). Hearing Research. 21. 3. 227–41. 1986. 3013823. 10.1016/0378-5955(86)90221-2. 4703229.
  28. Giraud AL, Lorenzi C, Ashburner J, Wable J, Johnsrude I, Frackowiak R, Kleinschmidt A. Representation of the temporal envelope of sounds in the human brain. Journal of Neurophysiology. 84. 3. 1588–98. September 2000. 10980029. 10.1152/jn.2000.84.3.1588. 9020690 .
  29. Liégeois-Chauvel C, Lorenzi C, Trébuchon A, Régis J, Chauvel P. Temporal envelope processing in the human left and right auditory cortices. Cerebral Cortex. 14. 7. 731–40. July 2004. 15054052. 10.1093/cercor/bhh033. free.
  30. Herdener M, Esposito F, Scheffler K, Schneider P, Logothetis NK, Uludag K, Kayser C. Spatial representations of temporal and spectral sound cues in human auditory cortex. Cortex; A Journal Devoted to the Study of the Nervous System and Behavior. 49. 10. 2822–33. November 2013. 23706955. 10.1016/j.cortex.2013.04.003. 19454517.
  31. Schönwiesner M, Zatorre RJ. Spectro-temporal modulation transfer function of single voxels in the human auditory cortex measured with high-resolution fMRI. Proceedings of the National Academy of Sciences of the United States of America. 106. 34. 14611–6. August 2009. 19667199. 10.1073/pnas.0907682106. 2732853. 2009PNAS..10614611S. free.
  32. Griffiths TD, Penhune V, Peretz I, Dean JL, Patterson RD, Green GG. Frontal processing and auditory perception. NeuroReport. 11. 5. 919–22. April 2000. 10790855. 10.1097/00001756-200004070-00004. 17386864.
  33. Hullett PW, Hamilton LS, Mesgarani N, Schreiner CE, Chang EF. Human Superior Temporal Gyrus Organization of Spectrotemporal Modulation Tuning Derived from Speech Stimuli. The Journal of Neuroscience. 36. 6. 2014–26. February 2016. 26865624. 4748082. 10.1523/JNEUROSCI.1779-15.2016.
  34. Elhilali M, Fritz JB, Klein DJ, Simon JZ, Shamma SA. Dynamics of precise spike timing in primary auditory cortex. The Journal of Neuroscience. 24. 5. 1159–72. February 2004. 14762134. 6793586. 10.1523/JNEUROSCI.3825-03.2004.
  35. Book: Boer, E. de. Time Resolution in Auditory Systems. 1985. Springer, Berlin, Heidelberg. 9783642706240. Proceedings in Life Sciences. 141–158. 10.1007/978-3-642-70622-6_9. Auditory Time Constants: A Paradox?.
  36. Bair W, Koch C. Temporal precision of spike trains in extrastriate cortex of the behaving macaque monkey. Neural Computation. 8. 6. 1185–202. August 1996. 8768391. 10.1162/neco.1996.8.6.1185. 8615893.
  37. Simon JZ, Depireux DA, Klein DJ, Fritz JB, Shamma SA. Temporal symmetry in primary auditory cortex: implications for cortical connectivity. Neural Computation. 19. 3. 583–638. March 2007. 17298227. 10.1162/neco.2007.19.3.583 . q-bio/0608027. 10011056.
  38. Theunissen FE, Sen K, Doupe AJ. Spectral-temporal receptive fields of nonlinear auditory neurons obtained using natural sounds. The Journal of Neuroscience. 20. 6. 2315–31. March 2000. 10704507. 6772498. 10.1523/JNEUROSCI.20-06-02315.2000.
  39. David SV, Mesgarani N, Fritz JB, Shamma SA. Rapid synaptic depression explains nonlinear modulation of spectro-temporal tuning in primary auditory cortex by natural stimuli. The Journal of Neuroscience. 29. 11. 3374–86. March 2009. 19295144. 2774136. 10.1523/JNEUROSCI.5249-08.2009.
  40. Bieser A, Müller-Preuss P. Auditory responsive cortex in the squirrel monkey: neural responses to amplitude-modulated sounds. Experimental Brain Research. 108. 2. 273–84. March 1996. 8815035 . 10.1007/bf00228100. 19565315.
  41. Book: Fast, Hugo. Psychoacoustics - Facts and Models. vanc. Springer. 9783540231592. 2007 .
  42. Burns . Edward M. . Viemeister . Neal F. . vanc . Played-again SAM: Further observations on the pitch of amplitude-modulated noise . The Journal of the Acoustical Society of America . December 1981 . 70 . 6 . 1655–1660 . 10.1121/1.387220 . 1981ASAJ...70.1655B.
  43. McDermott JH, Simoncelli EP. Sound texture perception via statistics of the auditory periphery: evidence from sound synthesis. Neuron. 71. 5. 926–40. September 2011. 21903084. 4143345. 10.1016/j.neuron.2011.06.032.
  44. McWalter R, Dau T. Cascaded Amplitude Modulations in Sound Texture Perception. Frontiers in Neuroscience. 11. 485. 2017-09-11. 28955191. 5601004. 10.3389/fnins.2017.00485. free.
  45. Viemeister NF. Temporal modulation transfer functions based upon modulation thresholds. The Journal of the Acoustical Society of America. 66. 5. 1364–80. November 1979. 500975. 10.1121/1.383531. 1979ASAJ...66.1364V.
  46. Sheft S, Yost WA. Temporal integration in amplitude modulation detection. The Journal of the Acoustical Society of America. 88. 2. 796–805. August 1990. 2212305. 10.1121/1.399729. 1990ASAJ...88..796S.
  47. Kohlrausch A, Fassel R, Dau T. The influence of carrier level and frequency on modulation and beat-detection thresholds for sinusoidal carriers. The Journal of the Acoustical Society of America. 108. 2. 723–34. August 2000. 10955639. 10.1121/1.429605. 2000ASAJ..108..723K.
  48. Hall JW, Haggard MP, Fernandes MA. Detection in noise by spectro-temporal pattern analysis. The Journal of the Acoustical Society of America. 76. 1. 50–6. July 1984. 6747111. 10.1121/1.391005. 1984ASAJ...76R..50H.
  49. Bacon SP, Grantham DW. Modulation masking: effects of modulation frequency, depth, and phase. The Journal of the Acoustical Society of America. 85. 6. 2575–80. June 1989. 2745880. 10.1121/1.397751. 1989ASAJ...85.2575B.
  50. Yost WA, Sheft S. Across-critical-band processing of amplitude-modulated tones. The Journal of the Acoustical Society of America. 85. 2. 848–57. February 1989. 2925999. 10.1121/1.397556. 1989ASAJ...85..848Y.
  51. Kay RH, Matthews DR. On the existence in human auditory pathways of channels selectively tuned to the modulation present in frequency-modulated tones. The Journal of Physiology. 225. 3. 657–77. September 1972. 5076392. 1331136. 10.1113/jphysiol.1972.sp009962.
  52. Tansley BW, Suffield JB. Time course of adaptation and recovery of channels selectively sensitive to frequency and amplitude modulation. The Journal of the Acoustical Society of America. 74. 3. 765–75. September 1983. 6630734. 10.1121/1.389864. 1983ASAJ...74..765T.
  53. Wojtczak M, Viemeister NF. Suprathreshold effects of adaptation produced by amplitude modulation. The Journal of the Acoustical Society of America. 114. 2. 991–7. August 2003. 12942978. 10.1121/1.1593067. 2003ASAJ..114..991W. free.
  54. Lorenzi C, Simpson MI, Millman RE, Griffiths TD, Woods WP, Rees A, Green GG. Second-order modulation detection thresholds for pure-tone and narrow-band noise carriers. The Journal of the Acoustical Society of America. 110. 5 Pt 1. 2470–8. November 2001. 11757936. 10.1121/1.1406160. 2001ASAJ..110.2470L.
  55. Ewert SD, Verhey JL, Dau T. Spectro-temporal processing in the envelope-frequency domain. The Journal of the Acoustical Society of America. 112. 6. 2921–31. December 2002. 12509013. 10.1121/1.1515735. 2002ASAJ..112.2921E.
  56. Füllgrabe C, Moore BC, Demany L, Ewert SD, Sheft S, Lorenzi C. Modulation masking produced by second-order modulators. The Journal of the Acoustical Society of America. 117. 4 Pt 1. 2158–68. April 2005. 15898657. 2708918. 10.1121/1.1861892. 2005ASAJ..117.2158F.
  57. Klein-Hennig M, Dietz M, Hohmann V, Ewert SD. June 2011. The influence of different segments of the ongoing envelope on sensitivity to interaural time delays. The Journal of the Acoustical Society of America. 129. 6. 3856–72. 10.1121/1.3585847. 21682409. 2011ASAJ..129.3856K.
  58. Strickland EA, Viemeister NF. Cues for discrimination of envelopes. The Journal of the Acoustical Society of America. 99. 6. 3638–46. June 1996. 8655796. 10.1121/1.414962. 1996ASAJ...99.3638S.
  59. Houtgast T. Frequency selectivity in amplitude-modulation detection. The Journal of the Acoustical Society of America. 85. 4. 1676–80. April 1989. 2708683. 10.1121/1.397956. 1989ASAJ...85.1676H.
  60. Dau T, Kollmeier B, Kohlrausch A. Modeling auditory processing of amplitude modulation. I. Detection and masking with narrow-band carriers. The Journal of the Acoustical Society of America. 102. 5 Pt 1. 2892–905. November 1997. 9373976. 10.1121/1.420344. 1997ASAJ..102.2892D. 3500293 .
  61. Piechowiak T, Ewert SD, Dau T. Modeling comodulation masking release using an equalization-cancellation mechanism. The Journal of the Acoustical Society of America. 121. 4. 2111–26. April 2007. 17471726. 10.1121/1.2534227. 2007ASAJ..121.2111P. 5001122 .
  62. Ewert SD, Dau T. Characterizing frequency selectivity for envelope fluctuations. The Journal of the Acoustical Society of America. 108. 3 Pt 1. 1181–96. September 2000. 11008819. 10.1121/1.1288665. 2000ASAJ..108.1181E.
  63. Wakefield GH, Viemeister NF. Discrimination of modulation depth of sinusoidal amplitude modulation (SAM) noise. The Journal of the Acoustical Society of America. 88. 3. 1367–73. September 1990. 2229672. 10.1121/1.399714. 1990ASAJ...88.1367W.
  64. Jørgensen S, Ewert SD, Dau T. A multi-resolution envelope-power based model for speech intelligibility. The Journal of the Acoustical Society of America. 134. 1. 436–46. July 2013. 23862819. 10.1121/1.4807563. 2013ASAJ..134..436J.
  65. Biberger T, Ewert SD. Envelope and intensity based prediction of psychoacoustic masking and speech intelligibility. The Journal of the Acoustical Society of America. 140. 2. 1023–1038. August 2016. 27586734. 10.1121/1.4960574. 2016ASAJ..140.1023B.
  66. Nelson PC, Carney LH. Cues for masked amplitude-modulation detection. The Journal of the Acoustical Society of America. 120. 2. 978–90. August 2006. 16938985. 2572864. 10.1121/1.2213573. 2006ASAJ..120..978N.
  67. Verschooten E, Robles L, Joris PX. Assessment of the limits of neural phase-locking using mass potentials. The Journal of Neuroscience. 35. 5. 2255–68. February 2015. 25653380. 6705351. 10.1523/JNEUROSCI.2979-14.2015.
  68. Palmer AR, Russell IJ. Phase-locking in the cochlear nerve of the guinea-pig and its relation to the receptor potential of inner hair-cells. Hearing Research. 24. 1. 1–15. 1986. 3759671. 10.1016/0378-5955(86)90002-X. 40262271.
  69. Weiss TF, Rose C. A comparison of synchronization filters in different auditory receptor organs. Hearing Research. 33. 2. 175–9. May 1988. 3397327. 10.1016/0378-5955(88)90030-5. 20291529.
  70. Paraouty N, Stasiak A, Lorenzi C, Varnet L, Winter IM. Dual Coding of Frequency Modulation in the Ventral Cochlear Nucleus. The Journal of Neuroscience. 38. 17. 4123–4137. April 2018. 29599389. 6596033. 10.1523/JNEUROSCI.2107-17.2018.
  71. Moore BC. Frequency difference limens for short-duration tones. The Journal of the Acoustical Society of America. 54. 3. 610–9. September 1973. 4754385. 10.1121/1.1913640. 1973ASAJ...54..610M.
  72. Moore BC. The role of temporal fine structure processing in pitch perception, masking, and speech perception for normal-hearing and hearing-impaired people. Journal of the Association for Research in Otolaryngology. 9. 4. 399–406. December 2008. 18855069. 2580810. 10.1007/s10162-008-0143-x.
  73. Book: Moore, Brian. An Introduction to the Psychology of Hearing: Sixth Edition. vanc . 2013-04-05. BRILL. 9789004252424. 6th . Leiden.
  74. Book: Plack, Christopher J.. Pitch - Neural Coding and Perception. vanc. Springer. 2005. 9780387234724. Springer Handbook of Auditory Research.
  75. Moore BC, Sek A. Detection of frequency modulation at low modulation rates: evidence for a mechanism based on phase locking. The Journal of the Acoustical Society of America. 100. 4 Pt 1. 2320–31. October 1996. 8865639. 10.1121/1.417941. 1996ASAJ..100.2320M.
  76. Lacher-Fougère S, Demany L. Consequences of cochlear damage for the detection of interaural phase differences. The Journal of the Acoustical Society of America. 118. 4. 2519–26. October 2005. 16266172. 10.1121/1.2032747. 2005ASAJ..118.2519L. 1218899. free.
  77. Hopkins K, Moore BC, Stone MA. Effects of moderate cochlear hearing loss on the ability to benefit from temporal fine structure information in speech. The Journal of the Acoustical Society of America. 123. 2. 1140–53. February 2008. 18247914. 2688774. 10.1121/1.2824018. 2008ASAJ..123.1140H.
  78. Oxenham AJ, Bernstein JG, Penagos H. Correct tonotopic representation is necessary for complex pitch perception. Proceedings of the National Academy of Sciences of the United States of America. 101. 5. 1421–5. February 2004. 14718671. 337068. 10.1073/pnas.0306958101. free.
  79. Oxenham AJ, Micheyl C, Keebler MV, Loper A, Santurette S. Pitch perception beyond the traditional existence region of pitch. Proceedings of the National Academy of Sciences of the United States of America. 108. 18. 7629–34. May 2011. 21502495. 3088642. 10.1073/pnas.1015291108. free.
  80. Demany L, Semal C. Detection thresholds for sinusoidal frequency modulation. The Journal of the Acoustical Society of America. 85. 3. 1295–301. March 1989. 2708671. 10.1121/1.397460. 1989ASAJ...85.1295D.
  81. Ernst SM, Moore BC. Mechanisms underlying the detection of frequency modulation. The Journal of the Acoustical Society of America. 128. 6. 3642–8. December 2010. 21218896. 10.1121/1.3506350. 2010ASAJ..128.3642E.
  82. Zwicker. E. 1956-01-01. Die elementaren Grundlagen zur Bestimmung der Informationskapazität des Gehörs. Acta Acustica United with Acustica. 6. 4. 365–381.
  83. Maiwald. D. 1967. Ein Funktionsschema des Gehors zur Beschreibung der Erkennbarkeit kleiner Frequenz und Amplitudenanderungen. Acustica. 18. 81–92.
  84. Saberi K, Hafter ER. A common neural code for frequency- and amplitude-modulated sounds. Nature. 374. 6522. 537–9. April 1995. 7700378. 10.1038/374537a0. 1995Natur.374..537S. 4236436.
  85. Ruggles D, Bharadwaj H, Shinn-Cunningham BG. Normal hearing is not enough to guarantee robust encoding of suprathreshold features important in everyday communication. Proceedings of the National Academy of Sciences of the United States of America. 108. 37. 15516–21. September 2011. 21844339. 3174666. 10.1073/pnas.1108912108. 2011PNAS..10815516R. free.
  86. Johannesen PT, Pérez-González P, Kalluri S, Blanco JL, Lopez-Poveda EA. The Influence of Cochlear Mechanical Dysfunction, Temporal Processing Deficits, and Age on the Intelligibility of Audible Speech in Noise for Hearing-Impaired Listeners. Trends in Hearing. 20. 233121651664105. September 2016. 27604779. 5017567. 10.1177/2331216516641055.
  87. Lopez-Poveda EA, Johannesen PT, Pérez-González P, Blanco JL, Kalluri S, Edwards B. Predictors of Hearing-Aid Outcomes. Trends in Hearing. 21. 2331216517730526. January 2017. 28929903. 5613846. 10.1177/2331216517730526.
  88. Ewert SD, Paraouty N, Lorenzi C. A two-path model of auditory modulation detection using temporal fine structure and envelope cues. The European Journal of Neuroscience. January 2018. 51. 5. 1265–1278. 29368797. 10.1111/ejn.13846. 46843680.
  89. Zilany MS, Bruce IC, Nelson PC, Carney LH. A phenomenological model of the synapse between the inner hair cell and auditory nerve: long-term adaptation with power-law dynamics. The Journal of the Acoustical Society of America. 126. 5. 2390–412. November 2009. 19894822. 2787068. 10.1121/1.3238250. 2009ASAJ..126.2390Z.
  90. Zilany MS, Bruce IC, Carney LH. Updated parameters and expanded simulation options for a model of the auditory periphery. The Journal of the Acoustical Society of America. 135. 1. 283–6. January 2014. 24437768. 3985897. 10.1121/1.4837815. 2014ASAJ..135..283Z.
  91. Wirtzfeld MR, Ibrahim RA, Bruce IC. Predictions of Speech Chimaera Intelligibility Using Auditory Nerve Mean-Rate and Spike-Timing Neural Cues. Journal of the Association for Research in Otolaryngology. 18. 5. 687–710. October 2017. 28748487. 5612921. 10.1007/s10162-017-0627-7.
  92. Moon IJ, Won JH, Park MH, Ives DT, Nie K, Heinz MG, Lorenzi C, Rubinstein JT. Optimal combination of neural temporal envelope and fine structure cues to explain speech identification in background noise. The Journal of Neuroscience. 34. 36. 12145–54. September 2014. 25186758. 4152611. 10.1523/JNEUROSCI.1025-14.2014.
  93. Heinz MG, Colburn HS, Carney LH. Evaluating auditory performance limits: i. one-parameter discrimination using a computational model for the auditory nerve. Neural Computation. 13. 10. 2273–316. October 2001. 11570999. 10.1162/089976601750541804. 14775452.
  94. Heinz MG, Colburn HS, Carney LH. Evaluating auditory performance limits: II. One-parameter discrimination with random-level variation. Neural Computation. 13. 10. 2317–38. October 2001. 11571000. 10.1162/089976601750541813. 7756946.
  95. Carney . Laurel H. . Heinzy . Michael G. . Evilsizer . Mary E. . Gilkeyz . Robert H. . Colburn . H. Steven . Auditory Phase Opponency: A Temporal Model for Masked Detection at Low Frequencies . Acta Acustica United with Acustica . 2002 . 88 . 3 . 334–47 .
  96. Deng L, Geisler CD. A composite auditory model for processing speech sounds. The Journal of the Acoustical Society of America. 82. 6. 2001–12. December 1987. 3429735. 10.1121/1.395644. 1987ASAJ...82.2001D.
  97. Loeb GE, White MW, Merzenich MM. Spatial cross-correlation. A proposed mechanism for acoustic pitch perception. Biological Cybernetics. 47. 3. 149–63. 1983. 6615914. 10.1007/BF00337005. 454354.
  98. Shamma S, Klein D. The case of the missing pitch templates: how harmonic templates emerge in the early auditory system. The Journal of the Acoustical Society of America. 107. 5 Pt 1. 2631–44. May 2000. 10830385. 10.1121/1.428649. 2000ASAJ..107.2631S . 1903/6017 . free.
  99. Shamma SA. Speech processing in the auditory system. II: Lateral inhibition and the central processing of speech evoked activity in the auditory nerve. The Journal of the Acoustical Society of America. 78. 5. 1622–32. November 1985. 3840813. 10.1121/1.392800. 1985ASAJ...78.1622S.
  100. Van Tasell DJ, Soli SD, Kirby VM, Widin GP. Speech waveform envelope cues for consonant recognition. The Journal of the Acoustical Society of America. 82. 4. 1152–61. October 1987. 3680774. 10.1121/1.395251. 1987ASAJ...82.1152V.
  101. Ghitza O. On the upper cutoff frequency of the auditory critical-band envelope detectors in the context of speech perception. The Journal of the Acoustical Society of America. 110. 3 Pt 1. 1628–40. September 2001. 11572372. 10.1121/1.1396325. 2001ASAJ..110.1628G.
  102. Shannon RV, Zeng FG, Kamath V, Wygonski J, Ekelid M. Speech recognition with primarily temporal cues. Science. 270. 5234. 303–4. October 1995. 7569981. 10.1126/science.270.5234.303. 1995Sci...270..303S. 581599.
  103. Smith ZM, Delgutte B, Oxenham AJ. Chimaeric sounds reveal dichotomies in auditory perception. Nature. 416. 6876. 87–90. March 2002. 11882898. 2268248. 10.1038/416087a. 2002Natur.416...87S.
  104. Drullman R, Festen JM, Plomp R. Effect of temporal envelope smearing on speech reception. The Journal of the Acoustical Society of America. 95. 2. 1053–64. February 1994. 8132899. 10.1121/1.408467. 1994ASAJ...95.1053D.
  105. Varnet L, Ortiz-Barajas MC, Erra RG, Gervain J, Lorenzi C. A cross-linguistic study of speech modulation spectra. The Journal of the Acoustical Society of America. 142. 4. 1976–1989. October 2017. 29092595. 10.1121/1.5006179. 2017ASAJ..142.1976V. 25269485 .
  106. Singh NC, Theunissen FE. Modulation spectra of natural sounds and ethological theories of auditory processing. The Journal of the Acoustical Society of America. 114. 6 Pt 1. 3394–411. December 2003. 14714819. 10.1121/1.1624067. 2003ASAJ..114.3394S.
  107. Rosen S. Temporal information in speech: acoustic, auditory and linguistic aspects. Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences. 336. 1278. 367–73. June 1992. 1354376. 10.1098/rstb.1992.0070. 1992RSPTB.336..367R.
  108. Iverson P, Krumhansl CL. Isolating the dynamic attributes of musical timbre. The Journal of the Acoustical Society of America. 94. 5. 2595–603. November 1993. 8270737. 10.1121/1.407371. 1993ASAJ...94.2595I. 16374075.
  109. Book: Cheveigné, Alain de. Pitch. 24. 2005. Springer, New York, NY. 9780387234724. Springer Handbook of Auditory Research. 169–233. 10.1007/0-387-28958-5_6. Pitch Perception Models.
  110. Moore BC, Glasberg BR, Low KE, Cope T, Cope W. Effects of level and frequency on the audibility of partials in inharmonic complex tones. The Journal of the Acoustical Society of America. 120. 2. 934–44. August 2006. 16938981. 10.1121/1.2216906. 2006ASAJ..120..934M.
  111. Terhardt E. Pitch, consonance, and harmony. The Journal of the Acoustical Society of America. 55. 5. 1061–9. May 1974. 4833699. 10.1121/1.1914648. 1974ASAJ...55.1061T.
  112. Santurette S, Dau T. The role of temporal fine structure information for the low pitch of high-frequency complex tones. The Journal of the Acoustical Society of America. 129. 1. 282–92. January 2011. 21303009. 10.1121/1.3518718. 2011ASAJ..129..282S.
  113. Santurette S, Dau T, Oxenham AJ. On the possibility of a place code for the low pitch of high-frequency complex tones. The Journal of the Acoustical Society of America. 132. 6. 3883–95. December 2012. 23231119. 3528728. 10.1121/1.4764897. 2012ASAJ..132.3883S.
  114. Gfeller K, Turner C, Oleson J, Zhang X, Gantz B, Froman R, Olszewski C. Accuracy of cochlear implant recipients on pitch perception, melody recognition, and speech reception in noise. Ear and Hearing. 28. 3. 412–23. June 2007. 17485990. 10.1097/AUD.0b013e3180479318. 21933200.
  115. Zeng FG, Nie K, Stickney GS, Kong YY, Vongphoe M, Bhargave A, Wei C, Cao K. Speech recognition with amplitude and frequency modulations. Proceedings of the National Academy of Sciences of the United States of America. 102. 7. 2293–8. February 2005. 15677723. 546014. 10.1073/pnas.0406460102. 2005PNAS..102.2293Z. free.
  116. Lorenzi C, Gilbert G, Carn H, Garnier S, Moore BC. Speech perception problems of the hearing impaired reflect inability to use temporal fine structure. Proceedings of the National Academy of Sciences of the United States of America. 103. 49. 18866–9. December 2006. 17116863. 1693753. 10.1073/pnas.0607364103. 2006PNAS..10318866L. free.
  117. Apoux F, Yoho SE, Youngdahl CL, Healy EW. Role and relative contribution of temporal envelope and fine structure cues in sentence recognition by normal-hearing listeners. The Journal of the Acoustical Society of America. 134. 3. 2205–12. September 2013. 23967950. 3765279. 10.1121/1.4816413. 2013ASAJ..134.2205A.
  118. Strelcyk O, Dau T. Relations between frequency selectivity, temporal fine-structure processing, and speech reception in impaired hearing. The Journal of the Acoustical Society of America. 125. 5. 3328–45. May 2009. 19425674. 10.1121/1.3097469. 2009ASAJ..125.3328S. 8062937 .
  119. Hopkins K, Moore BC. The effects of age and cochlear hearing loss on temporal fine structure sensitivity, frequency selectivity, and speech reception in noise. The Journal of the Acoustical Society of America. 130. 1. 334–49. July 2011. 21786903. 10.1121/1.3585848. 2011ASAJ..130..334H.
  120. Freyman RL, Griffin AM, Oxenham AJ. Intelligibility of whispered speech in stationary and modulated noise maskers. The Journal of the Acoustical Society of America. 132. 4. 2514–23. October 2012. 23039445. 3477190. 10.1121/1.4747614. 2012ASAJ..132.2514F.
  121. Book: 10.1016/b978-0-12-407794-2.00089-4 . Environmental Sounds . Neurobiology of Language . 1121–1138 . 2016 . Dick . Frederic . Krishnan . Saloni . Leech . Robert . Saygin . Ayşe Pinar . 978-0-12-407794-2.
  122. Book: 10.1007/978-3-319-63450-0_3 . Acoustics and Psychoacoustics of Sound Scenes and Events . Computational Analysis of Sound Scenes and Events . 41–67 . 2018 . Lemaitre . Guillaume . Grimault . Nicolas . Suied . Clara . 978-3-319-63449-4.
  123. Shafiro . Valeriy . Identification of Environmental Sounds With Varying Spectral Resolution . Ear and Hearing . June 2008 . 29 . 3 . 401–420 . 10.1097/AUD.0b013e31816a0cf1 . 18344871. 44607434 .
  124. Gygi . Brian . Kidd . Gary R. . Watson . Charles S. . Spectral-temporal factors in the identification of environmental sounds . The Journal of the Acoustical Society of America . March 2004 . 115 . 3 . 1252–1265 . 10.1121/1.1635840 . 15058346 . 2004ASAJ..115.1252G.
  125. Warren . William H. . Verbrugge . Robert R. . Auditory perception of breaking and bouncing events: A case study in ecological acoustics. . Journal of Experimental Psychology: Human Perception and Performance . 1984 . 10 . 5 . 704–712 . 10.1037/0096-1523.10.5.704 . 6238128.
  126. Inverso . Yell . Limb . Charles J. . Cochlear Implant-Mediated Perception of Nonlinguistic Sounds . Ear and Hearing . August 2010 . 31 . 4 . 505–514 . 10.1097/AUD.0b013e3181d99a52 . 20588119. 24076889 .
  127. Shafiro . Valeriy . Gygi . Brian . Cheng . Min-Yu . Vachhani . Jay . Mulvey . Megan . Perception of Environmental Sounds by Experienced Cochlear Implant Patients . Ear and Hearing . July 2011 . 32 . 4 . 511–523 . 10.1097/AUD.0b013e3182064a87 . 21248643 . 3115425.
  128. Harris . Michael S. . Boyce . Lauren . Pisoni . David B. . Shafiro . Valeriy . Moberly . Aaron C. . The Relationship Between Environmental Sound Awareness and Speech Recognition Skills in Experienced Cochlear Implant Users . Otology & Neurotology . October 2017 . 38 . 9 . e308–e314 . 10.1097/MAO.0000000000001514 . 6205294 . 28731964.
  129. Moore BC, Gockel HE. Properties of auditory stream formation. Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences. 367. 1591. 919–31. April 2012. 22371614. 3282308. 10.1098/rstb.2011.0355.
  130. Cusack R, Roberts B. Effects of differences in the pattern of amplitude envelopes across harmonics on auditory stream segregation. Hearing Research. 193. 1–2. 95–104. July 2004. 15219324. 10.1016/j.heares.2004.03.009. 2056844.
  131. Vliegen J, Oxenham AJ. Sequential stream segregation in the absence of spectral cues. The Journal of the Acoustical Society of America. 105. 1. 339–46. January 1999. 9921660. 10.1121/1.424503. 1999ASAJ..105..339V.
  132. Grimault N, Micheyl C, Carlyon RP, Arthaud P, Collet L. Influence of peripheral resolvability on the perceptual segregation of harmonic complex tones differing in fundamental frequency. The Journal of the Acoustical Society of America. 108. 1. 263–71. July 2000. 10923890. 10.1121/1.429462. 2000ASAJ..108..263G.
  133. Grimault N, Bacon SP, Micheyl C. Auditory stream segregation on the basis of amplitude-modulation rate. The Journal of the Acoustical Society of America. 111. 3. 1340–8. March 2002. 11931311. 10.1121/1.1452740. 2002ASAJ..111.1340G.
  134. Yamagishi S, Otsuka S, Furukawa S, Kashino M. Comparison of perceptual properties of auditory streaming between spectral and amplitude modulation domains. Hearing Research. 350. 244–250. July 2017. 28323019. 10.1016/j.heares.2017.03.006. free.
  135. David M, Lavandier M, Grimault N, Oxenham AJ. Discrimination and streaming of speech sounds based on differences in interaural and spectral cues. The Journal of the Acoustical Society of America. 142. 3. 1674–1685. September 2017. 28964066. 5617732. 10.1121/1.5003809. 2017ASAJ..142.1674D.
  136. Levi EC, Werner LA. 1996 . Amplitude modulation detection in infancy: Update on 3-month-olds. Assoc. Res. Otolaryngol. 19. 142.
  137. Werner LA. The development of auditory behavior (or what the anatomists and physiologists have to explain). Ear and Hearing. 17. 5. 438–46. October 1996. 8909892. 10.1097/00003446-199610000-00010. 37813999.
  138. Werner LA. Forward masking among infant and adult listeners. The Journal of the Acoustical Society of America. 105. 4. 2445–53. April 1999. 10212425. 10.1121/1.426849. 1999ASAJ..105.2445W.
  139. Levi EC, Folsom RC, Dobie RA. Coherence analysis of envelope-following responses (EFRs) and frequency-following responses (FFRs) in infants and adults. Hearing Research. 89. 1–2. 21–7. September 1995. 8600128. 10.1016/0378-5955(95)00118-3. 4764350.
  140. Levi EC, Folsom RC, Dobie RA. Amplitude-modulation following response (AMFR): effects of modulation rate, carrier frequency, age, and state. Hearing Research. 68. 1. 42–52. June 1993. 8376214. 10.1016/0378-5955(93)90063-7. 4703747.
  141. Hall JW, Grose JH. Development of temporal resolution in children as measured by the temporal modulation transfer function. The Journal of the Acoustical Society of America. 96. 1. 150–4. July 1994. 7598757. 10.1121/1.410474. 1994ASAJ...96..150H.
  142. Peter V, Wong K, Narne VK, Sharma M, Purdy SC, McMahon C. Assessing spectral and temporal processing in children and adults using temporal modulation transfer function (TMTF), Iterated Ripple Noise (IRN) perception, and spectral ripple discrimination (SRD). Journal of the American Academy of Audiology. 25. 2. 210–8. February 2014. 24828221. 10.3766/jaaa.25.2.9.
  143. Werner LA. Issues in human auditory development. Journal of Communication Disorders. 40. 4. 275–83. 2007. 17420028. 1975821. 10.1016/j.jcomdis.2007.03.004.
  144. Buss E, Hall JW, Grose JH, Dev MB. Development of adult-like performance in backward, simultaneous, and forward masking. Journal of Speech, Language, and Hearing Research. 42. 4. 844–9. August 1999. 10450905. 10.1044/jslhr.4204.844.
  145. Cabrera L, Werner L. Infants' and Adults' Use of Temporal Cues in Consonant Discrimination. Ear and Hearing. 38. 4. 497–506. July 2017. 28338496. 5482774. 10.1097/AUD.0000000000000422.
  146. Bertoncini J, Serniclaes W, Lorenzi C. Discrimination of speech sounds based upon temporal envelope versus fine structure cues in 5- to 7-year-old children. Journal of Speech, Language, and Hearing Research. 52. 3. 682–95. June 2009. 18952853. 10.1044/1092-4388(2008/07-0273).
  147. Book: Le Prell, Colleen G.. Noise-Induced Hearing Loss - Scientific Advances. vanc. Springer. 2012 . 9781441995223 . Springer Handbook of Auditory Research.
  148. Book: Manley, Geoffrey A.. Understanding the Cochlea. vanc. Springer. 2017 . 9783319520711 . Springer Handbook of Auditory Research.
  149. Kale S, Heinz MG. Envelope coding in auditory nerve fibers following noise-induced hearing loss. Journal of the Association for Research in Otolaryngology. 11. 4. 657–73. December 2010. 20556628. 2975881. 10.1007/s10162-010-0223-6.
  150. Zhong Z, Henry KS, Heinz MG. Sensorineural hearing loss amplifies neural coding of envelope information in the central auditory system of chinchillas. Hearing Research. 309. 55–62. March 2014. 24315815. 3922929. 10.1016/j.heares.2013.11.006.
  151. Kale S, Heinz MG. Temporal modulation transfer functions measured from auditory-nerve responses following sensorineural hearing loss. Hearing Research. 286. 1–2. 64–75. April 2012. 22366500. 3326227. 10.1016/j.heares.2012.02.004.
  152. Henry KS, Kale S, Heinz MG. Noise-induced hearing loss increases the temporal precision of complex envelope coding by auditory-nerve fibers. Frontiers in Systems Neuroscience. 8. 20. 2014-02-17. 24596545. 3925834. 10.3389/fnsys.2014.00020. free.
  153. Ruggero MA, Rich NC. Furosemide alters organ of corti mechanics: evidence for feedback of outer hair cells upon the basilar membrane. The Journal of Neuroscience. 11. 4. 1057–67. April 1991. 2010805. 3580957. 10.1523/JNEUROSCI.11-04-01057.1991.
  154. Heinz MG, Young ED. Response growth with sound level in auditory-nerve fibers after noise-induced hearing loss. Journal of Neurophysiology. 91. 2. 784–95. February 2004. 14534289. 2921373. 10.1152/jn.00776.2003.
  155. Moore BC, Glasberg BR. Temporal modulation transfer functions obtained using sinusoidal carriers with normally hearing and hearing-impaired listeners. The Journal of the Acoustical Society of America. 110. 2. 1067–73. August 2001. 11519575. 10.1121/1.1385177. 2001ASAJ..110.1067M.
  156. Füllgrabe C, Moore BC, Stone MA. Age-group differences in speech identification despite matched audiometrically normal hearing: contributions from auditory temporal processing and cognition. Frontiers in Aging Neuroscience. 6. 347. 2014. 25628563. 4292733. 10.3389/fnagi.2014.00347. free.
  157. Wallaert N, Moore BC, Lorenzi C. Comparing the effects of age on amplitude modulation and frequency modulation detection. The Journal of the Acoustical Society of America. 139. 6. 3088–3096. June 2016. 27369130. 10.1121/1.4953019. 2016ASAJ..139.3088W.
  158. Bacon SP, Gleitman RM. Modulation detection in subjects with relatively flat hearing losses. Journal of Speech and Hearing Research. 35. 3. 642–53. June 1992. 1608256. 10.1044/jshr.3503.642.
  159. Moore BC, Shailer MJ, Schooneveldt GP. Temporal modulation transfer functions for band-limited noise in subjects with cochlear hearing loss. British Journal of Audiology. 26. 4. 229–37. August 1992. 1446186. 10.3109/03005369209076641.
  160. Schlittenlacher J, Moore BC. Discrimination of amplitude-modulation depth by subjects with normal and impaired hearing. The Journal of the Acoustical Society of America. 140. 5. 3487–3495. November 2016. 27908066. 10.1121/1.4966117. 2016ASAJ..140.3487S. free.
  161. Başkent D . Speech recognition in normal hearing and sensorineural hearing loss as a function of the number of spectral channels . The Journal of the Acoustical Society of America . November 2006 . 120 . 5 . 2908–2925 . 10.1121/1.2354017. 17139748 . 2006ASAJ..120.2908B.
  162. King A, Hopkins K, Plack CJ. The effects of age and hearing loss on interaural phase difference discrimination. The Journal of the Acoustical Society of America. 135. 1. 342–51. January 2014. 24437774. 10.1121/1.4838995. 2014ASAJ..135..342K.
  163. Derleth RP, Dau T, Kollmeier B. Modeling temporal and compressive properties of the normal and impaired auditory system. Hearing Research. 159. 1–2. 132–49. September 2001. 11520641. 10.1016/S0378-5955(01)00322-7. 1288804.
  164. Wallaert N, Moore BC, Ewert SD, Lorenzi C. Sensorineural hearing loss enhances auditory sensitivity and temporal integration for amplitude modulation. The Journal of the Acoustical Society of America. 141. 2. 971–980. February 2017. 28253641. 10.1121/1.4976080. 2017ASAJ..141..971W.
  165. Jepsen ML, Dau T. Characterizing auditory processing and perception in individual listeners with sensorineural hearing loss. The Journal of the Acoustical Society of America. 129. 1. 262–81. January 2011. 21303008. 10.1121/1.3518768. 2011ASAJ..129..262J.
  166. Paraouty N, Ewert SD, Wallaert N, Lorenzi C. Interactions between amplitude modulation and frequency modulation processing: Effects of age and hearing loss. The Journal of the Acoustical Society of America. 140. 1. 121–131. July 2016. 27475138. 10.1121/1.4955078. 2016ASAJ..140..121P.
  167. Ives DT, Kalluri S, Strelcyk O, Sheft S, Miermont F, Coez A, Bizaguet E, Lorenzi C. Effects of noise reduction on AM perception for hearing-impaired listeners. Journal of the Association for Research in Otolaryngology. 15. 5. 839–48. October 2014. 24899379. 4164688. 10.1007/s10162-014-0466-8.
  168. Paul BT, Bruce IC, Roberts LE. Evidence that hidden hearing loss underlies amplitude modulation encoding deficits in individuals with and without tinnitus. Hearing Research. 344. 170–182. February 2017. 27888040. 10.1016/j.heares.2016.11.010. 205103970.
  169. Aslin RN. Discrimination of frequency transitions by human infants. The Journal of the Acoustical Society of America. 86. 2. 582–90. August 1989. 2768673. 10.1121/1.398237. 1989ASAJ...86..582A.
  170. Colombo J, Horowitz FD. Infants' attentional responses to frequency modulated sweeps. Child Development. 57. 2. 287–91. April 1986. 3956313. 10.2307/1130583. 1130583.
  171. Leibold . Lori J. . Werner . Lynne A.. vanc . 2007-09-01. Infant Auditory Sensitivity to Pure Tones and Frequency-Modulated Tones. Infancy . 12 . 2 . 225–233 . 10.1111/j.1532-7078.2007.tb00241.x . 33412743 . 10.1.1.525.6267.
  172. Dawes P, Bishop DV. Maturation of visual and auditory temporal processing in school-aged children. Journal of Speech, Language, and Hearing Research. 51. 4. 1002–15. August 2008. 18658067. 10.1044/1092-4388(2008/073). 13953101.
  173. Henry KS, Heinz MG. Diminished temporal coding with sensorineural hearing loss emerges in background noise. Nature Neuroscience. 15. 10. 1362–4. October 2012. 22960931. 3458164. 10.1038/nn.3216.
  174. Henry KS, Kale S, Heinz MG. Distorted Tonotopic Coding of Temporal Envelope and Fine Structure with Noise-Induced Hearing Loss. The Journal of Neuroscience. 36. 7. 2227–37. February 2016. 26888932. 4756156. 10.1523/JNEUROSCI.3944-15.2016.
  175. Moore BC, Peters RW. Pitch discrimination and phase sensitivity in young and elderly subjects and its relationship to frequency selectivity. The Journal of the Acoustical Society of America. 91. 5. 2881–93. May 1992. 1629481. 10.1121/1.402925. 1992ASAJ...91.2881M.
  176. Book: Moore, Brian C.J.. Cochlear Hearing Loss: Physiological, Psychological and Technical Issues. Second. Wiley Online Library. 10.1002/9780470987889. 2008. Brian C.J. Moore. vanc. 9780470987889. 142251782 .
  177. Hopkins K, Moore BC. Moderate cochlear hearing loss leads to a reduced ability to use temporal fine structure information. The Journal of the Acoustical Society of America. 122. 2. 1055–68. August 2007. 17672653. 10.1121/1.2749457. 2007ASAJ..122.1055H.
  178. Moore BC, Skrodzka E. Detection of frequency modulation by hearing-impaired listeners: effects of carrier frequency, modulation rate, and added amplitude modulation. The Journal of the Acoustical Society of America. 111. 1 Pt 1. 327–35. January 2002. 11833538. 10.1121/1.1424871. 2002ASAJ..111..327M.
  179. Grose JH, Mamo SK. Frequency modulation detection as a measure of temporal processing: age-related monaural and binaural effects. Hearing Research. 294. 1–2. 49–54. December 2012. 23041187. 3505233. 10.1016/j.heares.2012.09.007.
  180. Buss E, Hall JW, Grose JH. Temporal fine-structure cues to speech and pure tone modulation in observers with sensorineural hearing loss. Ear and Hearing. 25. 3. 242–50. June 2004. 15179115. 10.1097/01.AUD.0000130796.73809.09. 11620271.
  181. Santurette S, Dau T. Binaural pitch perception in normal-hearing and hearing-impaired listeners. Hearing Research. 223. 1–2. 29–47. January 2007. 17107767. 10.1016/j.heares.2006.09.013. 6209193.
  182. Grose JH, Mamo SK. Processing of temporal fine structure as a function of age. Ear and Hearing. 31. 6. 755–60. December 2010. 20592614. 2966515. 10.1097/AUD.0b013e3181e627e7.
  183. Lopez-Poveda EA, Barrios P. Perception of stochastically undersampled sound waveforms: a model of auditory deafferentation. Frontiers in Neuroscience. 7. 124. 2013-07-16. 23882176. 3712141. 10.3389/fnins.2013.00124. free.
  184. Young ED, Sachs MB. Representation of steady-state vowels in the temporal aspects of the discharge patterns of populations of auditory-nerve fibers. The Journal of the Acoustical Society of America. 66. 5. 1381–1403. November 1979. 500976. 10.1121/1.383532. 1979ASAJ...66.1381Y.
  185. Zeng FG, Kong YY, Michalewski HJ, Starr A. Perceptual consequences of disrupted auditory nerve activity. Journal of Neurophysiology. 93. 6. 3050–63. June 2005. 15615831. 10.1152/jn.00985.2004. 8628394 .
  186. Pichora-Fuller MK, Schneider BA, Macdonald E, Pass HE, Brown S. Temporal jitter disrupts speech intelligibility: a simulation of auditory aging. Hearing Research. 223. 1–2. 114–21. January 2007. 17157462. 10.1016/j.heares.2006.10.009. 6541722.
  187. Drullman R. Temporal envelope and fine structure cues for speech intelligibility. The Journal of the Acoustical Society of America. 97. 1. 585–92. January 1995. 7860835. 10.1121/1.413112. 1995ASAJ...97..585D.
  188. Lopez-Poveda EA. Why do I hear but not understand? Stochastic undersampling as a model of degraded neural encoding of speech. Frontiers in Neuroscience. 8. 348. 2014-10-30. 25400543. 4214224. 10.3389/fnins.2014.00348. free.
  189. Fowler EP. 1936-12-01. A method for the early detection of otosclerosis: a study of sounds well above threshold. Archives of Otolaryngology–Head & Neck Surgery. 24. 6. 731–741. 10.1001/archotol.1936.00640050746005.
  190. Moore BC. Testing the concept of softness imperception: loudness near threshold for hearing-impaired ears. The Journal of the Acoustical Society of America. 115. 6. 3103–11. June 2004. 15237835. 10.1121/1.1738839. 2004ASAJ..115.3103M. free.
  191. Stone MA, Moore BC. Syllabic compression: effective compression ratios for signals modulated at different rates. British Journal of Audiology. 26. 6. 351–61. December 1992. 1292819. 10.3109/03005369209076659.
  192. Plomp R. The negative effect of amplitude compression in multichannel hearing aids in the light of the modulation-transfer function. The Journal of the Acoustical Society of America. 83. 6. 2322–7. June 1988. 3411024. 10.1121/1.396363. 1988ASAJ...83.2322P. free.
  193. Stone MA, Moore BC. Quantifying the effects of fast-acting compression on the envelope of speech. The Journal of the Acoustical Society of America. 121. 3. 1654–64. March 2007. 17407902. 10.1121/1.2434754. 2007ASAJ..121.1654S.
  194. Book: Bacon, Sid. Compression: From Cochlea to Cochlear Implants. vanc. Springer. 9780387004969. 2004. Springer Handbook of Auditory Research.
  195. Boyle PJ, Büchner A, Stone MA, Lenarz T, Moore BC. Comparison of dual-time-constant and fast-acting automatic gain control (AGC) systems in cochlear implants. International Journal of Audiology. 48. 4. 211–21. April 2009. 19363722. 10.1080/14992020802581982. 2235920.
  196. Clark GM, Blamey PJ, Brown AM, Gusby PA, Dowell RC, Franz BK, Pyman BC, Shepherd RK, Tong YC, Webb RL. The University of Melbourne--nucleus multi-electrode cochlear implant. Advances in Oto-Rhino-Laryngology. 38. V–IX, 1–181. 1987. 10.1159/000414597. 3318305.
  197. Book: Başkent D, Gaudrain E, Tamati TN, Wagner A. Chapter 12: Perception and Psychoacoustics of Speech in Cochlear Implant Users. Cacace AT, de Kleine E, Holt AG, van Dijk P. Scientific foundations of Audiology: Perspectives from Physics, Biology, Modeling, and Medicine . 2016 . Plural Publishing, Inc . San Diego, CA . 978-1-59756-652-0 . 285–319 . https://www.pluralpublishing.com/publication_sfa.htm.
  198. Bierer JA, Faulkner KF. Identifying cochlear implant channels with poor electrode-neuron interface: partial tripolar, single-channel thresholds and psychophysical tuning curves. Ear and Hearing. 31. 2. 247–58. April 2010. 20090533. 2836401. 10.1097/AUD.0b013e3181c7daf4.
  199. Lazard DS, Vincent C, Venail F, Van de Heyning P, Truy E, Sterkers O, Skarzynski PH, Skarzynski H, Schauwers K, O'Leary S, Mawman D, Maat B, Kleine-Punte A, Huber AM, Green K, Govaerts PJ, Fraysse B, Dowell R, Dillier N, Burke E, Beynon A, Bergeron F, Başkent D, Artières F, Blamey PJ. 6. Pre-, per- and postoperative factors affecting performance of postlinguistically deaf adults using cochlear implants: a new conceptual model over time. PLOS ONE. 7. 11. e48739. November 2012. 23152797. 3494723. 10.1371/journal.pone.0048739. 2012PLoSO...748739L. free.
  200. Holden LK, Firszt JB, Reeder RM, Uchanski RM, Dwyer NY, Holden TA. Factors Affecting Outcomes in Cochlear Implant Recipients Implanted With a Perimodiolar Electrode Array Located in Scala Tympani. Otology & Neurotology. 37. 10. 1662–1668. December 2016. 27755365. 5113723. 10.1097/MAO.0000000000001241.
  201. Boyle PJ, Nunn TB, O'Connor AF, Moore BC. STARR: a speech test for evaluation of the effectiveness of auditory prostheses under realistic conditions. Ear and Hearing. 34. 2. 203–12. March 2013. 23135616. 10.1097/AUD.0b013e31826a8e82. 11733708.
  202. Won JH, Drennan WR, Nie K, Jameyson EM, Rubinstein JT. Acoustic temporal modulation detection and speech perception in cochlear implant listeners. The Journal of the Acoustical Society of America. 130. 1. 376–88. July 2011. 21786906. 3155593. 10.1121/1.3592521. 2011ASAJ..130..376W.
  203. Fu QJ. Temporal processing and speech recognition in cochlear implant users. NeuroReport. 13. 13. 1635–9. September 2002. 12352617. 10.1097/00001756-200209160-00013. 44385994.
  204. Friesen LM, Shannon RV, Baskent D, Wang X. Speech recognition in noise as a function of the number of spectral channels: comparison of acoustic hearing and cochlear implants. The Journal of the Acoustical Society of America. 110. 2. 1150–63. August 2001. 11519582. 10.1121/1.1381538. 2001ASAJ..110.1150F.
  205. Moore DR, Shannon RV. Beyond cochlear implants: awakening the deafened brain. Nature Neuroscience. 12. 6. 686–91. June 2009. 19471266. 10.1038/nn.2326. 35141874.
  206. Stickney GS, Zeng FG, Litovsky R, Assmann P. Cochlear implant speech recognition with speech maskers. The Journal of the Acoustical Society of America. 116. 2. 1081–91. August 2004. 15376674. 10.1121/1.1772399. 2004ASAJ..116.1081S.
  207. Blamey P, Artieres F, Başkent D, Bergeron F, Beynon A, Burke E, Dillier N, Dowell R, Fraysse B, Gallégo S, Govaerts PJ, Green K, Huber AM, Kleine-Punte A, Maat B, Marx M, Mawman D, Mosnier I, O'Connor AF, O'Leary S, Rousset A, Schauwers K, Skarzynski H, Skarzynski PH, Sterkers O, Terranti A, Truy E, Van de Heyning P, Venail F, Vincent C, Lazard DS. 6. Factors affecting auditory performance of postlinguistically deaf adults using cochlear implants: an update with 2251 patients. Audiology & Neuro-Otology. 18. 1. 36–47. 2013. 23095305. 10.1159/000343189. 4668675.
  208. Başkent D, Clarke J, Pals C, Benard MR, Bhargava P, Saija J, Sarampalis A, Wagner A, Gaudrain E. Cognitive compensation of speech perception with hearing impairment, cochlear implants, and aging . Trends in Hearing . October 2016 . 20 . 233121651667027 . 10.1177/2331216516670279. 5056620.
  209. Pfingst BE, Burkholder-Juhasz RA, Xu L, Thompson CS. Across-site patterns of modulation detection in listeners with cochlear implants. The Journal of the Acoustical Society of America. 123. 2. 1054–62. February 2008. 18247907. 2431465. 10.1121/1.2828051. 2008ASAJ..123.1054P.
  210. Chatterjee M, Oberzut C. Detection and rate discrimination of amplitude modulation in electrical hearing. The Journal of the Acoustical Society of America. 130. 3. 1567–80. September 2011. 21895095. 3188971. 10.1121/1.3621445. 2011ASAJ..130.1567C.
  211. Shannon RV. Temporal modulation transfer functions in patients with cochlear implants. The Journal of the Acoustical Society of America. 91. 4 Pt 1. 2156–64. April 1992. 1597606. 10.1121/1.403807. 1992ASAJ...91.2156S.
  212. Cazals Y, Pelizzone M, Saudan O, Boex C. Low-pass filtering in amplitude modulation detection associated with vowel and consonant identification in subjects with cochlear implants. The Journal of the Acoustical Society of America. 96. 4. 2048–54. October 1994. 7963020. 10.1121/1.410146. 1994ASAJ...96.2048C.
  213. Cooper WB, Tobey E, Loizou PC. Music perception by cochlear implant and normal hearing listeners as measured by the Montreal Battery for Evaluation of Amusia. Ear and Hearing. 29. 4. 618–26. August 2008. 18469714. 2676841. 10.1097/AUD.0b013e318174e787.
  214. Galvin JJ, Fu QJ, Nogaki G. Melodic contour identification by cochlear implant listeners. Ear and Hearing. 28. 3. 302–19. June 2007. 17485980. 3627492. 10.1097/01.aud.0000261689.35445.20.
  215. Fu QJ, Chinchilla S, Nogaki G, Galvin JJ. Voice gender identification by cochlear implant users: the role of spectral and temporal resolution. The Journal of the Acoustical Society of America. 118. 3 Pt 1. 1711–8. September 2005. 16240829. 10.1121/1.1985024. 2005ASAJ..118.1711F.
  216. Fuller CD, Gaudrain E, Clarke JN, Galvin JJ, Fu QJ, Free RH, Başkent D. Gender categorization is abnormal in cochlear implant users. Journal of the Association for Research in Otolaryngology. 15. 6. 1037–48. December 2014. 25172111. 4389960. 10.1007/s10162-014-0483-7.
  217. Peng SC, Lu HP, Lu N, Lin YS, Deroche ML, Chatterjee M. Processing of Acoustic Cues in Lexical-Tone Identification by Pediatric Cochlear-Implant Recipients. Journal of Speech, Language, and Hearing Research. 60. 5. 1223–1235. May 2017. 28388709. 5755546. 10.1044/2016_JSLHR-S-16-0048.
  218. Wang W, Zhou N, Xu L. Musical pitch and lexical tone perception with cochlear implants. International Journal of Audiology. 50. 4. 270–8. April 2011. 21190394. 5662112. 10.3109/14992027.2010.542490.
  219. Chatterjee M, Peng SC. Processing F0 with cochlear implants: Modulation frequency discrimination and speech intonation recognition. Hearing Research. 235. 1–2. 143–56. January 2008. 18093766. 2237883. 10.1016/j.heares.2007.11.004.
  220. Fu QJ, Galvin JJ. Vocal emotion recognition by normal-hearing listeners and cochlear implant users. Trends in Amplification. 11. 4. 301–15. December 2007. 18003871. 4111530. 10.1177/1084713807305301.
  221. Chatterjee M, Zion DJ, Deroche ML, Burianek BA, Limb CJ, Goren AP, Kulkarni AM, Christensen JA. Voice emotion recognition by cochlear-implanted children and their normally-hearing peers. Hearing Research. 322. 151–62. April 2015. 25448167. 4615700. 10.1016/j.heares.2014.10.003.
  222. Chatterjee M, Oba SI. Across- and within-channel envelope interactions in cochlear implant listeners. Journal of the Association for Research in Otolaryngology. 5. 4. 360–75. December 2004. 15675001. 2504569. 10.1007/s10162-004-4050-5.
  223. Chatterjee M, Kulkarni AM. Modulation detection interference in cochlear implant listeners under forward masking conditions. The Journal of the Acoustical Society of America. 143. 2. 1117–1127. February 2018. 29495705. 5821512. 10.1121/1.5025059. 2018ASAJ..143.1117C.
  224. Alcántara JL, Moore BC, Kühnel V, Launer S. Evaluation of the noise reduction system in a commercial digital hearing aid. International Journal of Audiology. 42. 1. 34–42. January 2003. 12564514. 10.3109/14992020309056083. 8131281.
  225. Moore BC. Coding of sounds in the auditory system and its relevance to signal processing and coding in cochlear implants. Otology & Neurotology. 24. 2. 243–54. March 2003. 12621339. 10.1097/00129492-200303000-00019. 16232602.
  226. Rader T, Döge J, Adel Y, Weissgerber T, Baumann U. Place dependent stimulation rates improve pitch perception in cochlear implantees with single-sided deafness. Hearing Research. 339. 94–103. September 2016. 27374479. 10.1016/j.heares.2016.06.013. 4853875.
  227. Roy AT, Carver C, Jiradejvong P, Limb CJ. Musical Sound Quality in Cochlear Implant Users: A Comparison in Bass Frequency Perception Between Fine Structure Processing and High-Definition Continuous Interleaved Sampling Strategies. Ear and Hearing. 36. 5. 582–90. October 2015. 25906173. 10.1097/AUD.0000000000000170. 35712387.
  228. Fitzgerald MB, Wright BA. Perceptual learning and generalization resulting from training on an auditory amplitude-modulation detection task. The Journal of the Acoustical Society of America. 129. 2. 898–906. February 2011. 21361447. 3070992. 10.1121/1.3531841. 2011ASAJ..129..898F.
  229. Fitzgerald MB, Wright BA. A perceptual learning investigation of the pitch elicited by amplitude-modulated noise. The Journal of the Acoustical Society of America. 118. 6. 3794–803. December 2005. 16419824. 10.1121/1.2074687. 2005ASAJ..118.3794F.
  230. Sabin AT, Eddins DA, Wright BA. Perceptual learning evidence for tuning to spectrotemporal modulation in the human auditory system. The Journal of Neuroscience. 32. 19. 6542–9. May 2012. 3519395. 10.1523/JNEUROSCI.5732-11.2012. 22573676.
  231. Joosten ER, Shamma SA, Lorenzi C, Neri P. Dynamic Reweighting of Auditory Modulation Filters. PLOS Computational Biology. 12. 7. e1005019. July 2016. 4939963. 10.1371/journal.pcbi.1005019. 27398600. 2016PLSCB..12E5019J . free .
  232. Aizawa N, Eggermont JJ. Effects of noise-induced hearing loss at young age on voice onset time and gap-in-noise representations in adult cat primary auditory cortex. Journal of the Association for Research in Otolaryngology. 7. 1. 71–81. March 2006. 16408166. 2504589. 10.1007/s10162-005-0026-3.
  233. Rosen MJ, Sarro EC, Kelly JB, Sanes DH. Diminished behavioral and neural sensitivity to sound modulation is associated with moderate developmental hearing loss. PLOS ONE. 7. 7. e41514. 2012-07-26. 22848517. 3406049. 10.1371/journal.pone.0041514. 2012PLoSO...741514R. free.
  234. Caras ML, Sanes DH. Sustained Perceptual Deficits from Transient Sensory Deprivation. The Journal of Neuroscience. 35. 30. 10831–42. July 2015. 26224865. 4518056. 10.1523/JNEUROSCI.0837-15.2015.
  235. Zhou X, Merzenich MM. Environmental noise exposure degrades normal listening processes. Nature Communications. 3. 843. May 2012. 22588305. 10.1038/ncomms1849. 2012NatCo...3..843Z. free.
  236. Bao S, Chang EF, Woods J, Merzenich MM. Temporal plasticity in the primary auditory cortex induced by operant perceptual learning. Nature Neuroscience. 7. 9. 974–81. September 2004. 15286790. 10.1038/nn1293. 8038109.
  237. Kilgard MP, Merzenich MM. Plasticity of temporal information processing in the primary auditory cortex. Nature Neuroscience. 1. 8. 727–31. December 1998. 10196590. 2948964. 10.1038/3729.
  238. Anderson S, White-Schwoch T, Choi HJ, Kraus N. Training changes processing of speech cues in older adults with hearing loss. Frontiers in Systems Neuroscience. 7. 97. 2013. 24348347. 3842592. 10.3389/fnsys.2013.00097. free.
  239. Wong PC, Skoe E, Russo NM, Dees T, Kraus N. Musical experience shapes human brainstem encoding of linguistic pitch patterns. Nature Neuroscience. 10. 4. 420–2. April 2007. 17351633. 4508274. 10.1038/nn1872.
  240. Perez E, McCormack A, Edmonds BA. Sensitivity to temporal fine structure and hearing-aid outcomes in older adults. Frontiers in Neuroscience. 8. 7. 2014. 24550769. 3914396. 10.3389/fnins.2014.00007. free.
  241. Rönnberg J, Lunner T, Ng EH, Lidestam B, Zekveld AA, Sörqvist P, Lyxell B, Träff U, Yumba W, Classon E, Hällgren M, Larsby B, Signoret C, Pichora-Fuller MK, Rudner M, Danielsson H, Stenfelt S. Hearing impairment, cognition and speech understanding: exploratory factor analyses of a comprehensive test battery for a group of hearing aid users, the n200 study. International Journal of Audiology. 55. 11. 623–42. November 2016. 27589015. 5044772. 10.1080/14992027.2016.1219775.
  242. Moore BC, Sęk A. Preferred Compression Speed for Speech and Music and Its Relationship to Sensitivity to Temporal Fine Structure. Trends in Hearing. 20. 233121651664048. September 2016. 27604778. 5017572. 10.1177/2331216516640486.
  243. Bernstein JG, Danielsson H, Hällgren M, Stenfelt S, Rönnberg J, Lunner T. Spectrotemporal Modulation Sensitivity as a Predictor of Speech-Reception Performance in Noise With Hearing Aids. Trends in Hearing. 20. 233121651667038. November 2016. 27815546. 5098798. 10.1177/2331216516670387.
  244. Sęk A, Moore BC. Implementation of two tests for measuring sensitivity to temporal fine structure. International Journal of Audiology. 51. 1. 58–63. January 2012. 22050366. 10.3109/14992027.2011.605808. 2954679.
  245. Moore BC, Vickers DA, Mehta A. The effects of age on temporal fine structure sensitivity in monaural and binaural conditions. International Journal of Audiology. 51. 10. 715–21. October 2012. 22998412. 10.3109/14992027.2012.690079. 29277736.
  246. Füllgrabe C. Age-dependent changes in temporal-fine-structure processing in the absence of peripheral hearing loss. American Journal of Audiology. 22. 2. 313–5. December 2013. 23975124. 10.1044/1059-0889(2013/12-0070).
  247. Santurette S, Dau T. Relating binaural pitch perception to the individual listener's auditory profile. The Journal of the Acoustical Society of America. 131. 4. 2968–86. April 2012. 22501074. 10.1121/1.3689554. 2012ASAJ..131.2968S. 206468031 .
  248. Hopkins K, Moore BC. Development of a fast method for measuring sensitivity to temporal fine structure information at low frequencies. International Journal of Audiology. 49. 12. 940–6. December 2010. 20874427. 10.3109/14992027.2010.512613. 46058919.
  249. Füllgrabe C, Harland AJ, Sęk AP, Moore BC. Development of a method for determining binaural sensitivity to temporal fine structure. International Journal of Audiology. 56. 12. 926–935. December 2017. 28859494. 10.1080/14992027.2017.1366078. free.
  250. Füllgrabe C, Moore BC. Evaluation of a Method for Determining Binaural Sensitivity to Temporal Fine Structure (TFS-AF Test) for Older Listeners With Normal and Impaired Low-Frequency Hearing. Trends in Hearing. 21. 2331216517737230. January 2017. 29090641. 5669320. 10.1177/2331216517737230.
  251. Kates JM, Arehart KH. Coherence and the speech intelligibility index. The Journal of the Acoustical Society of America. 117. 4 Pt 1. 2224–37. April 2005. 15898663. 10.1121/1.1862575. 2005ASAJ..117.2224K.
  252. Arehart KH, Kates JM, Anderson MC. Effects of noise, nonlinear processing, and linear filtering on perceived speech quality. Ear and Hearing. 31. 3. 420–36. June 2010. 20440116. 10.1097/AUD.0b013e3181d3d4f3. 34271886.
  253. Taal. Cees H. . Hendriks. Richard C. . Heusdens . Richard . Jensen . Jesper. vanc. September 2011 . An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech. IEEE Transactions on Audio, Speech, and Language Processing. 19. 7. 2125–2136. 10.1109/tasl.2011.2114881. 13972181 .
  254. Croghan NB, Arehart KH, Kates JM. Music preferences with hearing aids: effects of signal properties, compression settings, and listener characteristics. Ear and Hearing. 35. 5. e170–84. September 2014. 25010635. 10.1097/AUD.0000000000000056. 10459417.
  255. Arehart K, Souza P, Kates J, Lunner T, Pedersen MS. Relationship Among Signal Fidelity, Hearing Loss, and Working Memory for Digital Noise Suppression. Ear and Hearing. 36. 5. 505–16. 2015. 25985016. 10.1097/aud.0000000000000173. 4549215.
  256. Carter . G. . Knapp . C. . Nuttall . A. . Estimation of the magnitude-squared coherence function via overlapped fast Fourier transform processing . IEEE Transactions on Audio and Electroacoustics. August 1973 . 21 . 4 . 337–344 . 10.1109/TAU.1973.1162496.
  257. Arehart KH, Kates JM, Anderson MC, Harvey LO. Effects of noise and distortion on speech quality judgments in normal-hearing and hearing-impaired listeners. The Journal of the Acoustical Society of America. 122. 2. 1150–64. August 2007. 17672661. 10.1121/1.2754061. 2007ASAJ..122.1150A.
  258. Tan CT, Moore BC. Perception of nonlinear distortion by hearing-impaired people. International Journal of Audiology. 47. 5. 246–56. May 2008. 18465409. 10.1080/14992020801945493. 28018246.
  259. Houtgast . T. . Steeneken . H. J. M. . A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria . The Journal of the Acoustical Society of America . March 1985 . 77 . 3 . 1069–1077 . 10.1121/1.392224 . 1985ASAJ...77.1069H. free .
  260. Hohmann V, Kollmeier B. The effect of multichannel dynamic compression on speech intelligibility. The Journal of the Acoustical Society of America. 97. 2. 1191–5. February 1995. 7876441. 10.1121/1.413092. 1995ASAJ...97.1191H. free.
  261. Goldsworthy RL, Greenberg JE. Analysis of speech-based Speech Transmission Index methods with implications for nonlinear operations. The Journal of the Acoustical Society of America. 116. 6. 3679–89. December 2004. 15658718. 10.1121/1.1804628. 2004ASAJ..116.3679G.
  262. Ludvigsen C, Elberling C, Keidser G, Poulsen T. Prediction of intelligibility of non-linearly processed speech. Acta Oto-Laryngologica. Supplementum. 469. 190–5. 1990. 10.1080/00016489.1990.12088428. 2356726.
  263. Kates . James M. . Arehart . Kathryn H. . The Hearing-Aid Speech Perception Index (HASPI) . Speech Communication . November 2014 . 65 . 75–93 . 10.1016/j.specom.2014.06.002.
  264. Chi T, Gao Y, Guyton MC, Ru P, Shamma S. Spectro-temporal modulation transfer functions and speech intelligibility. The Journal of the Acoustical Society of America. 106. 5. 2719–32. November 1999. 10573888. 10.1121/1.428100. 1999ASAJ..106.2719C . 1903/6121 . free.
  265. Huber . R. . Kollmeier . B. . PEMO-Q—A New Method for Objective Audio Quality Assessment Using a Model of Auditory Perception . IEEE Transactions on Audio, Speech, and Language Processing. November 2006 . 14 . 6 . 1902–1911 . 10.1109/TASL.2006.883259. 7815134 .
  266. Huber R, Parsa V, Scollie S. Predicting the perceived sound quality of frequency-compressed speech. PLOS ONE. 9. 11. e110260. 2014-11-17. 25402456. 4234248. 10.1371/journal.pone.0110260. 2014PLoSO...9k0260H. free.
  267. Kates. James. Arehart. Kathryn. vanc . 2014-03-20. The Hearing-Aid Speech Quality Index (HASQI) Version 2. Journal of the Audio Engineering Society. 62. 3. 99–117. 10.17743/jaes.2014.0006. 1549-4950.
  268. Kates . James . Arehart . Kathryn . vanc . The Hearing-Aid Speech Quality Index (HASQI) Version 2 . Journal of the Audio Engineering Society . 20 March 2014 . 62 . 3 . 99–117 . 10.17743/jaes.2014.0006.
  269. Kates JM, Arehart KH. The Hearing-Aid Audio Quality Index (HAAQI). IEEE/ACM Transactions on Audio, Speech, and Language Processing. 24. 2. 354–365. February 2016. 27135042. 4849486. 10.1109/taslp.2015.2507858.
  270. Kates. James. vanc . 2013. An auditory model for intelligibility and quality predictions. Proceedings of Meetings on Acoustics. Acoustical Society of America . 133 . 5 . 050184. 10.1121/1.4799223. 2013ASAJ..133.3560K. free.