Neural Encoding of Spectro-Temporal Cues at Slow and Near Speech-Rate in Cochlear Implant Users Jaime A. Undurraga ∗1, Lindsey Van Yper1, Manohar Bance2, David McAlpine1, and Deborah Vickers2 1Department of Linguistics, 16 University Avenue, Macquarie University, NSW 2109, Australia 2Department of Clinical Neurosciences, Cambridge Biomedical Campus, University of Cambridge, CB2 0QQ, UK The authors declare no competing financial interests. ∗Corresponding author: Jaime A. Undurraga, jaime.undurraga@mq.edu.au Abstract The ability to process rapid modulations in the spectro-temporal structure of sounds is critical for speech comprehension. For users of cochlear implants (CIs), spectral cues in speech are conveyed by differential stimulation of electrode contacts along the cochlea, and temporal cues in terms of the amplitude of stimulating electrical pulses, which track the amplitude-modulated (AM’ed) envelope of speech sounds. Whilst survival of inner-ear neurons and spread of electrical current are known factors that limit the representation of speech information in CI listeners, limitations in the neural representation of dynamic spectro-temporal cues common to speech are also likely to play a role. We assessed the ability of CI listeners to process spectro-temporal cues varying at rates typically present in human speech. Employing an auditory change complex (ACC) paradigm, and a low (0.5 Hz) alternating rate between stimulating electrodes, or different AM frequencies, to evoke a transient cortical ACC, we demonstrate that CI listeners—like normal-hearing listeners—are sensitive to transitions in the spectral- and temporal-domain. However, CI listeners showed impaired cortical responses when either spectral or temporal cues were alternated at faster, speech-like (6-7 Hz), rates. Specifically, auditory change following responses—reliably obtained in normal-hearing listeners—were small or absent in CI users, indicating that cortical adaptation to alternating cues at speech-like rates is stronger under electrical stimulation. In CI listeners, temporal processing was also influenced by the polarity— behaviourally—and rate of presentation of electrical pulses—both neurally and behaviorally. Limitations in the ability to process dynamic spectro-temporal cues will likely impact speech comprehension in CI users. 1 Introduction Since their inception, cochlear implants (CIs) have proven a transformative technology, restoring, or providing de novo, functional hearing and the ability to communicate using spoken language, in severe-profoundly deaf adults and children. CIs work by filtering acoustic signals into frequency (spectral) channels, and extracting the slowly varying amplitude envelope in each channel, which is then used to modulate a train of electrical pulses delivered by each electrode along the implanted array, directly stimulating the auditory nerve fibres (ANFs). 2 Despite their undoubted success, speech perception remains highly variable across the population of CI users (e.g. Holden et al., 2013). Potential sources of this variance are numerous, but likely include the quality of the electrode-neural interface (Pfingst et al., 2008; Bierer et al., 2010)—the degree of neural survival in the deafened cochlea, for example—or traumatic events arising from surgical insertion of the electrode array. A poor electrode-neural interface can result in broader neural activation than would otherwise be the case, as more current is required to excite ANFs. Further, stimulation parameters such as pulse rate, electrode configuration (monopolar vs. bipolar), and stimulus polarity can interact with the degree of neural health near to a stimulating electrode, affecting the overall fidelity of information encoded in ANFs (Macherey et al., 2008, 2017; Undurraga et al., 2010, 2012, 2013). Given the already reduced number of independent frequency channels in implanted, compared to normal-hearing (NH) ears, any additional distortions of the internal representation of the sound will contribute to reduced speech understanding (Eisen and Franck, 2005; Bierer, 2007). CI listeners rely on spectral cues conveyed in terms of which electrode contacts along the inserted array are activated, and temporal cues conveyed in terms of the rate at which electrical pulses are modulated in amplitude, to understand speech. Since speech perception correlates with the number of electrodes accurately conveying temporal information (Croghan et al., 2017), and because different amplitude modulation (AM) frequencies convey distinct temporal features of speech (Rosen, 1992), the ability to discriminate between different AM frequencies represents a more ecologically valid approach to predicting speech perception outcomes in CI patients than simply detecting the presence of AM cues. Here, we employed a measure of cortical function—the auditory change complex (ACC)—to assess neural representation of spectro-temporal cues in CI users. In contrast to the standard approach, in which a long, silent interval is presented between pairs of sounds, with the cortical potential at the transition between them—the ACC—reflecting neural/behavioural sensitivity to the physical/perceptual difference between the sounds (Fig. 1A), we employed an ‘alternating ACC’ paradigm in which stimuli are continuously and periodically alternated (i.e. without silent gaps), at a controlled rate such that every change triggers a cortical transient response (Fig. 1B) or a steady-state auditory change following response (AC-FR) (Fig. 1C). [Figure 1 about here.] 3 We confirm that CI listeners are sensitive to slow alternating (0.5 Hz) spectral cues; like NH listeners, they can distinguish similarly alternating transitions in AM frequencies at the behavioural and neural levels, indicating no specific limitation to processing AM cues per se. As neural coding of AM cues may reflect interactions between stimulus polarity and the site of generation of action potentials in ANFs (Joshi et al., 2017), we also assessed the effect of stimulus polarity (anodic and cathodic) in CI listeners using a slow (126 pulses per second (pps)) pulse rate, and find that, behaviourally, anodic pulses are more distinguishable than cathodic ones, and neurally, cortical responses are weak regardless of polarity. Finally, employing speech-like (6-7 Hz) alternating spectro-temporal cues, we demonstrate a specific reduction in cortical responses in otherwise high-performing CI listeners, compared to NH listeners, indicative of greater neural adaptation at this higher alternating rate. Stronger cortical adaptation to speech-like rates of spectro-temporal modulations are likely the consequence of the greater neural synchrony of inner-ear neural responses to electrical stimulation, suggesting that current stimulation strategies might impose a fundamental limitation to speech performance outcomes in CI users. 2 Materials and Methods Two experiments were conducted to measure neural responses to spectral or temporal cues. The first experiment was designed to measure neural responses to slow alternating cues (spectral or temporal) in an ongoing stimulus - a transient-like cortical ACC response. The alternating ACC paradigm was used at a low alternating rate (0.5 Hz), thus allowing us to minimize neural adaptation. The second experiment was designed to trigger neural adaptation. Thus, the alternating paradigm was used at a higher rate (≈ 6 to 7 Hz), where neural adaptation is expected to be apparent. This alternating rate is interesting because it is close to the syllabic rate encountered in English and other languages, a rate to which the brain can entrain (4 to 7 Hz; Lewis, 2014; Peelle and Davis, 2012; Chandrasekaran et al., 2009). Hence, providing neural information at a rate critical for speech processing. In addition, this rate range is useful because it could be used to elicit an steady-state ACC response—the AC-FRs (Fig. 1C)—similar to those reported in NH studies using binaural stimuli (Dajani and Picton, 2006; Haywood et al., 2015; Undurraga 4 et al., 2016). Note that, for simplicity, we use the term ACC, AC-FR, or auditory steady-state response (ASSR) indistinctly for acoustically- or electrically-evoked responses. 2.1 Participants There were six NH listeners who were screened for NH, defined as pure-tone thresholds ≤ 20 dB HL, at octave frequencies from 250 to 8000 Hz; and five CI participants who were unilaterally implanted with the Cochlear Limited device and met the following inclusion criteria: (1) full intra-cochlear electrode insertion, (2) post implant activation experience of more than 9 months, and (3) post-lingual (after the acquisition of speech and language) onset of severe/ profound bilateral hearing loss. Ethical approval was obtained from Macquarie University ethics committee (Ref. Number 5201700786) and by the Sydney Cochlear Implant Centre (SCIC) Cochlear Implant Program Research Governance Committee. All subjects provided informed consent before beginning the experiments and were paid an honorarium for their time. 2.2 Stimulus With CI stimulation, stimuli consisted of the so-called quadra-phasic (QP) pulses (Macherey et al., 2017) where a positive (anodic) or negative (cathodic) pulse is emulated by concatenating two biphasic pulses (separated by 7 µs) of opposite polarity. Therefore, to generate an “anodic” stimulus a cathodic-first biphasic pulse is concatenated to an anodic-first biphasic pulse so that the two central phases have the same anodic polarity—the quadra-phasic-anodic (QP-A) pulse (see Fig. 1E). A “cathodic” stimulus is generated in the same way by inverting the polarity—the quadra-phasic-cathodic (QP-C) pulse. Spectral processing was assessed using the new alternating ACC paradigm we developed to detect neural responses to alternations between two fixed stimulating electrodes (Fig. 1B). This measure also served as a reference response since we had previously found reliable transient ACC responses using the standard ACC paradigm (Mathew et al., 2017, 2018). This was performed on three different loudness balanced (see Stimulus Presentation) electrode pairs, thus providing an objective assessment of electrode discrimination whilst minimizing the contribution 5 of loudness differences in the response. Stimuli consisted of QP-C monopolar pulse trains at a rate of 827 pps, 43µs pulse width and 7 µs inter-phase gap (IPG). For 4 out of 5 participants three apical electrode pairs were tested: 14-15, 15-16, and 16-17. For one participant (S4) the electrode pair 21-22 was used instead of 16-17 (see Table 1 for details). Middle-apical electrodes were preferred as they convey most of the speech information by the CI, and also for their importance in speech perception (Busby and Clark, 2000; Geier and Norton, 1992; Henry et al., 2000). Transient ACC responses were evoked using an alternating rate of 0.5 Hz. Each recording lasted 10 minutes and data were epoched in 300 blocks of 2 seconds. Adapted responses, measured by the AC-FR, were elicited using the alternating paradigm at a rate of 6.1 Hz. Each electrode pair was tested for about 5.43 minutes and data were epoched in 84 blocks of 3.92 seconds. Note that longer epochs for AC-FR analyses improve the spectral resolution in the frequency-domain (0.25 Hz resolution). Temporal processing employed the alternating ACC paradigm to detect neural responses elicited by alternations between two fixed AM frequencies (Fig. 1D). We measured responses from both NH and CI listeners at two modulation depths, which was the same for the two AM frequencies—a paradigm that reduces the potentially confounding contribution of loudness cues to envelope encoding in CI listeners (McKay and Henshall, 2010); loudness perception is almost constant as a function of AM frequencies (Gransier et al., 2016; Kreft et al., 2010). An advantageous feature of this paradigm is that in addition to evoking the ACC response, the ASSR to the AMs can be obtained simultaneously. This is an indication of the phase locking ability in the auditory system (Picton et al., 2003), thus providing additional information about AM processing. For the NH listeners a 500 Hz sinusoidal carrier was used and amplitude-modulated (AM’ed). AM frequencies alternated between of 20.0 and 35.5 Hz. Two modulation depths were used in different conditions, but did not change within a condition. These were fully modulated (100%), and halfway modulated (50%, Fig. 1D). The change of AM frequency was always performed at the minima of the AM cycle to avoid other cues (e.g. clicks) occurring that would elicit an exogenous response. For the CI listeners, stimuli AMs were presented to electrode 16. This middle-apical electrode 6 was selected because animal data indicate that apical neurons may encode temporal cues more strongly than basal neurons (Middlebrooks and Snyder, 2010). QP pulses were AM’ed between 20 and 35 Hz, except for S5, where AMs alternated between 20 and 28 Hz (subject S5 could also discriminate this condition). Note that, in all cases, the magnitude of the AM frequency change was above discrimination limens reported in previous studies (≈ 12% of the base frequency for CI and ≈ 8% for NH listeners; e.g. Chen and Zeng (2004) and Kreft et al. (2013)). As with NH listeners, two AM depths were used at either 100 or 50% of the subject’s dynamic range (in Current Units), defined as the difference between the comfortable and the threshold level (see Table 1 for details). Two stimulation pulse rates 801 pps (4 subjects) and 126 pps (5 subjects) were used to encode the AM envelopes. We chose a low stimulation rate of 126 pps for two reasons. First, by using a low pulse rate, the effect of stimulus polarity on ANFs is maximized as the interactions between adjacent pulses decrease with increasing inter-stimulus interval (ISI) (Macherey et al., 2008; Macherey and Carlyon, 2012; Macherey et al., 2017; Undurraga et al., 2010, 2012, 2013). Second, this stimulation rate is suitable for reliably removing the electrical artefact from the ASSR via interpolation (Hofmann and Wouters, 2010; Luke et al., 2015; Deprez et al., 2017), hence, allowing us to estimate the amount of phase-locking to each AM frequency by means of the ASSR which is elicited alongside the ACC response. The effect of polarity was investigated at 126 pps using positive (QP-A) and negative (QP-C) pulses, and all stimuli were checked in advance to ensure that they were discriminable at 100% modulation depth for the QP-A pulse shape. At 801 pps, we only used QP-C pulses mainly because the short ISI between QPs pulses results in a “pseudo alternating” polarity pulse train. This can be observed in Fig. 1E, where the reduction of the ISI brings the last phase of the first QP pulse closer to the first phase of the second QP pulse. For both NH and CI listeners, transient ACC responses were evoked by alternating the AM at a rate of 0.5 Hz. Each recording lasted 10 minutes and data were (300 epochs of 2 seconds). Neural adaptation to rapid AM frequency alternations was measured for both NH and CI listeners. For NH listeners, AC-FRs were measured to AM frequencies alternating between 19.2 7 and 32.0 Hz at a rate of at 6.4 Hz (5.02 mins, 69 epochs of 4.38 seconds, 0.23 Hz resolution). This slight difference in AM frequencies (compared to AM frequencies used in the low alternating rate) was necessary to ensure that an integer number of modulations fitted within each epoch. In order to elicit AC-FRs to AM frequency alternations in CI users, we used an alternating rate of 6.9 Hz (4.81 mins, 72 epochs of 4.01 seconds, 0.25 Hz resolution). It should be noted that the alternating rate was adjusted to ensure a fix integer number of pulses and AM cycles within an epoch which also depended on the stimulation pulse rate. This resulted in a slightly higher alternating rate than for the NH listeners (0.5 Hz difference). However, in all cases, the alternating rate was confined within the optimal range in which the AC-FR has been obtained using binaural paradigms (≈ 6 to 8 Hz; e.g. Dajani and Picton, 2006; Haywood et al., 2015; McAlpine et al., 2016; Undurraga et al., 2016). A summary including pulse rate, AM frequencies and depths, pulse shape, electrodes tested, and different alternating rates for both spectral and temporal processing is provided in Table 2. 2.3 Stimulus Presentation For CI listeners, all stimuli were presented at a loudness balanced comfortable level estimated using a loudness scale chart. Comfortable levels were obtained for each electrode, pulse rate, and polarity by gradually increasing the stimulation level until participants reported a loudness of 6 on a 10-point loudness scale chart. This procedure was repeated twice and the final balanced comfortable level corresponded to the average of these two measures. For the AM stimuli, comfortable levels were determined only for the loudest condition, i.e. 50% AM depth. However, all participants ranked the fully AM’ed stimuli as 6 in the loudness scale chart. Thresholds for the AM stimuli were obtained using unmodulated pulses in a similar way but set as the level corresponding to a loudness of 1 (“barely audible”) on the loudness scale chart. For CI listeners, the stimuli were directly presented to the CI using the Nucleus Implant Controller (NIC) version 3 via the RF-Generator-XS (Cochlear Limited, Sydney, Australia) using a custom interface developed in Python. For NH listeners, acoustic stimuli were presented via earphones on the right ear (Etymotic ER2) at 72 dB SPL. All acoustic stimuli were generated in a custom interface “aep_gui” developed in Matlab and delivered via RME Fireface UC at a sampling rate of 48 kHz and calibrated using a 2 cc GRASS artificial ear. 8 2.4 EEG recordings Electroencephography (EEG) recordings were made using a 64-channel high resolution BioSemi ActiveTwo (Amsterdam, The Netherlands) at a sampling rate of 16 384 Hz and a resolution of 24 bits/sample (31 nV LSB). Channels were arranged according to the international 10–20 system. Two additional channels were placed on the left and right mastoid and eye movements were recorded with right infra-orbital and right lateral canthus channels. The voltage offset was always below ± 40 mV and usually within ± 20 mV. For NH listeners, recordings were referenced to Cz. With CI users, scalp channels around the CI receiver package were not connected (typically 1-3 scalp electrodes) and responses were referenced to the contralateral mastoid. 2.5 EEG analysis EEG analysis was as follows: Firstly, data was referenced to remove line noise. Afterwards, unconnected electrodes (those near the CI) or noisy EEG electrodes due to poor contact (usually electrodes with voltage offsets greater than 40 mV or strong DC components) were automatically detected and removed from the analysis. For the low pulse rate recordings (126 pps), CI artefacts were removed by interpolation between points with minimum artefact contamination (a detailed description is provided in the Appendix). For transient responses, time-domain data were down sampled to 1024 Hz and band-pass filtered between 2 and 30 Hz (zero-phase, third-order finite impulse response (FIR) Kaiser filter). Eye blink artefacts were removed using a template matching suppression method (Valderrama et al., 2018). Next, data were epoched and residual CI artefacts were identified and removed using denoising source separation (DSS) (details provided in the Appendix; de Cheveigné and Simon (2008) and Mathew et al. (2017)). Per-channel averages were obtained using a weighted average (full details provided in Mathew et al. (2017)). The objective detection of an ACC response and signal- to-noise ratio (SNR) estimations utilised the Hotelling’s t-squared test (Hotelling-T2, Golding et al. (2009) and Mathew et al. (2017)). The significance of the response was evaluated using a time window spanning between 50 and 400 ms relative to stimulus change. The P1-N1-P2 peaks and latencies were automatically detected within this time window by obtaining the minimum 9 (N1) and maxima before and after N1. For AC-FRs, the analysis pipeline was similar with the exception that data was not down-sampled and only a high pass filter (at 2 Hz) was applied. The objective detection of a frequency-domain response was also determined using Hotelling-T2 test. In this case, confidence intervals are estimated from the two-dimensional distribution of the frequency bin of interest (Picton et al., 1987, 2003). To reduce the dimensionality of the data existing across all EEG electrodes, we estimated the global field power (GFP), which corresponds to the spatial standard deviation of the EEG data across all recording electrodes (Skrandies, 1990). For the transient ACC response, the GFP estimation was modified. The modification consisted of computing the standard deviation of the N1-P2 amplitudes across channels rather than taking the peak amplitude of the waveform resulting by computing the standard deviation across all channels, thus taking into account the contribution of both peaks, and not just the main peak as with the standard GFP. The latency of the steady-state responses was estimated following a series of adjustments previously described (Rodriguez et al., 1986; Herdman et al., 2002; John and Picton, 2000; Undurraga et al., 2016). First, the phase (φ) of the frequency of interest (fi) was adjusted by pi rad when data were referenced to Cz and by 0 rad when referenced to the contralateral mastoid. This phase adjustment accounts for polarity differences arising the location of the reference electrode. Next, the phase of responses generated by the AM were adjusted by pi/2 rad in order to compensate for the frequency transformation, which was computed in terms of a cosine phase. AC-FRs generated by rapid alternating cues did not include this correction as it was assumed that the response followed each transition, i.e. the transition had zero phase. Once the phase was adjusted (φa), the phase delay (φd) was computed as: φd = 2pi − φa (1) and the latency (in seconds) was estimated by: L = 12pi · fi (φd + 2pi · n+ 2pi ·m) (2) where n is an integer number set to correct for circularity ambiguity (phase unwrapping) 10 associated to phase and m is an integer number set to account for the number stimulus cycles occurring before a response is evoked. Here, m was set to zero when studying AC-FRs whilst it was set to 1 when analysing latency of the ASSRs generated by the AMs. All signal processing was carried out off-line using a custom analysis module “pyeeg-python” developed in Python 3. 2.6 Behavioural discrimination In all experiments, the ability of CI participants to behaviourally discriminate either between AM frequencies or between electrodes was assessed with a 3-interval 2-alternative forced choice task. Each interval lasted 400 ms and there was a 500 ms silent gap between intervals. The first interval always contained the reference stimulus, and the second and third contained either the reference or the target, and the participant indicated whether the 2nd or 3rd presentation was different to the other two. A total of 20 trials were presented for each pair of contrasts and based on binomial significance levels (p < 0.01) the discrimination threshold was set at 80%. The stimuli used in the behavioural discrimination were identical to those used to measure the brain response. [Table 1 about here.] [Table 2 about here.] 2.7 Statistical analyses Analysis of variance (ANOVA) were carried out by means of linear mixed-effects (LME) models for which the factor subject was set as random. LME models with more than one factor were fitted and reduced using a backward stepwise reduction method. The degrees of freedoms were estimated by means of Satterthwaite approximation (Kuznetsova et al., 2017). If outliers were detected these were removed and the LME model was refitted. Conditional, random, and marginal residuals of the LME model were checked by visual inspection. For all LME models, the variance explained by the fixed factors, marginal R2 (R2m), and by fixed and random factors, conditional R2 (R2c), are reported as a tandem (R2m/c; Nakagawa and Schielzeth (2013) and Harrison et al. (2018)). Post-hoc comparisons were carried using trimmed means when heteroscedasticity was observed (Field and Wilcox, 2017; Mair and Wilcox, 2020). 11 All statistical analyses were performed using the R software package (R Development Core Team, 2020) and the significance level was set to α = 0.05. 3 Results 3.1 Behavioural discrimination of temporal cues is affected by pulse polarity in CI listeners We first examined the behavioural ability of CI listeners to discriminate between stimulation of different electrodes (spectral discrimination) and different AM frequencies (temporal discrim- ination) using high rate cathodic (QP-C) pulses, and low rate cathodic (QP-C) and anodic (QP-A) pulses. Overall, CI listeners performed well in both listening tasks (Fig. 2), and only one participant (S4) was unable to discriminate between stimulation of the most apical electrode pair (Fig. 2A). Similarly, in the AM discrimination task, although discrimination using the high pulse rate (801 pps) was at ceiling for all participants (Fig. 2B), sensitivity to AM cues using the low pulse rate (126 pps) was more variable (Fig. 2C). To determine which factors might explain the variability in performance across listeners at the lower (126 pps) pulse rate, a generalized LME model (using a logit family) was fitted to the data, with the significance of the response as the dependent variable. The model including factors modulation depth and polarity was significantly better than the null hypothesis model [χ2(2) = 6.12, p = 0.046, R2m/c = 0.427/0.654]; in only one case (S5, Fig. 2C) the AM frequency was not discriminable with QP-A pulses, whilst this occurred for four cases using QP-C pulses. As well as differences between listeners, we also observed differences within CI listeners in their ability to discriminate AM frequencies. Subjects S1 and S3 could discriminate between the two AM frequencies for both polarities and modulation depths, whilst subject S5 could only discriminate AM frequencies at 100% modulation depth for both polarities. Subject S2 could discriminate only QP-A pulses for both modulation depths, and S4 could discriminate all conditions except for QP-C pulses at 50% modulation depth. Overall, our data demonstrate that CI listeners were able to discriminate differences in spectral cues elicited by stimulation of different electrode pairs, and that discrimination of AM cues varies as a function of the electrical pulse rate at which the cues are conveyed, as well the depth 12 of modulation and the pulse polarity. [Figure 2 about here.] 3.2 CI users show robust neural discrimination between electrodes during ongoing stimulation We next assessed cortical sensitivity to spectral cues in CI listeners using our alternating ACC paradigm, alternating periodically, at a slow rate (0.5 Hz), the stimulated electrode of an adjacent pair of electrodes (Fig. 1B). As with the standard paradigm (Mathew et al., 2017, 2018; He et al., 2014; Brown et al., 2008) neural responses to the electrode discrimination task evoked large ACC responses. Typical neural responses and topographic maps from the three electrode pairs confirm that electrical artefacts were successfully attenuated from the recordings (see Appendix for details), evidenced by the absence of a large area of activation on the side nearest the implanted (left) ear (Fig. 3, for CI participant S5). Overall, transient ACC responses were obtained from all participants and for all electrode pairs (Fig. 4) using the alternating ACC paradigm. To facilitate description of the EEG data, we selected the channel with the largest SNR for each participant. Across participants, F-values estimated via Hotelling-T2 test ranged between 4.38 and 144.79 (with a critical F for a significance level of p < 0.05 of 1.97). A LME model with factor electrode pair indicated that neither the GFP [F(2, 8) = 3.87, p = 0.07, R2m/c = 0.033/0.941] nor N1 latency [F(2, 8) = 0.21, p = 0.81, R2m/c = 0.006/0.787] were explained by this factor. Across subjects and electrodes, N1-P2 amplitudes ranged between 1.89 and 10.39 µV (s.d. = 2.80µV), whilst N1 latencies spanned 93.0 to 207.5 ms (mean = 122.1 ms, s.d. = 33.3 ms). There was good correspondence between behavioural and ACC responses, with 14 out 15 agreements between successful behavioural discrimination and objective detection of the response. Note that participant S4, who showed poor electrode discrimination for one electrode pair in the behavioural task (Fig. 2A), nevertheless showed strong ACC responses for all three electrodes comparisons. This is consistent with our previous study where we found that, for some participants, the ACC response preceded the ability to discriminate electrodes in a behavioural task (Mathew et al., 2018), and suggest that electrode discrimination may be limited at a more central (maladapted) stage in the brain. 13 The data confirm that periodic electrode alternations at a low rate in an ongoing and continuous stimulus evoke reliable ACC responses (Mathew et al., 2017, 2018). [Figure 3 about here.] [Figure 4 about here.] 3.3 Neural temporal discrimination differs between NH and CI listeners Sensitivity to AM, and specifically to changes in AM frequency, is a critical factor in speech understanding. To assess the dynamics of AM sensitivity, we measured the cortical response to transitions in AM frequency in both NH and CI listeners, employing the transient ACC evoked by a low, alternating rate (0.5 Hz) in AM frequency (alternating between ≈ 20 and 35 Hz; Table 1). In NH listeners, reliable cortical activation to transitions in AM frequency—assessed in terms of the GFP of transient ACC responses—were evoked at 100% AM depth (Fig. 5A), and were significantly smaller [F(1, 5) = 24.05, p = 0.004, R2m/c = 0.452/0.793] at 50% AM depth (Fig. 5B & E). No significant effect was observed in terms of the latency of the transient ACC as a function of modulation depth (Fig. 5F). Across NH listeners, Hotelling-T2 F-values ranged between 3.8 and 8.6 for 50% modulation depth and 6.1 and 37.3 for 100% modulation depth. In addition, significant ASSRs were simultaneously obtained from all NH listeners at both 50% and 100% AM depths (Fig. 5C & D), and, for these, we observed a significant interaction between the AM frequency and modulation depth [F(1, 15) = 4.79, p = 0.045, R2m/c = 0.179/0.610]. Overall, 35 Hz ASSRs were significantly larger at 100%, than at 50% AM depth [t(5) = 6.34, p = 0.001, d′ = 0.38], whilst 20 Hz ASSRs were not significantly different as a function of AM depth. These results suggest that, for NH listeners, ACC responses are highly sensitive to AM depth, and that ASSRs, whilst affected by the AM depth, are able to encode AM frequencies at both depths. [Figure 5 about here.] Like NH listeners, CI listeners also showed significant alternating ACC responses to dynamic AM cues, but this sensitivity was highly dependent on pulse rate. 14 For the high pulse rate of 801 pps (presented as QP-C pulses) ACC responses were robust; in agreement with the behavioural data (Fig. 2B), transient ACC responses in 4 CI subjects (S5 was unable to take part in this experimental session) were clearly evoked to both AM modulation depths at this higher pulse rate (Figures 6A & B). This indicates that neurally, as well as behaviourally, the two AM rates are reliably encoded in CI listeners. In contrast to NH listeners, however, the magnitude of the transient ACC response was not significantly affected by modulation depth; a LME model indicated that the factor modulation depth was not significant for the GFP [F(1, 1.93) = 0.19, p = 0.699, R2m/c = 0.006/0.851] or for the latency [F(1, 2.08) = 0.76, p = 0.471, R2 = 0.014/0.903] (Figures 6M & N). Hotelling-T2 F-values ranged between 10.3 and 38.6, comparable to the range observed in NH listeners. We compared the GFP and the latency of ACC responses between NH and CI using a LME model with factors modulation depth and group (NH or CI) as independent variables. The analysis indicated a significant interaction between modulation depth and group [F(1, 7.20) = 8.94, p = 0.009, R2m/c = 0.449/0.813], confirming that ACC responses for the two modulation depths differed for NH and CI listeners. Latencies, however, were not significantly affected by modulation depth. The mean N1 latencies were 182 ms (s.d. = 62 ms) and 164 ms (s.d. = 17 ms) for NH and CI listeners, respectively. Next, we compared ACC responses generated by alternating between different electrodes (spectral alternations) and between different AM frequencies (temporal alternations) at 801 pps pulse rate in CI listeners. A LME model with the factor type (electrode and AM) indicated that the GFP [F(1, 14.01) = 54.52, p < 0.001, R2m/c = 0.115/0.965] and latency [F(1, 15.11) = 34.35, p < 0.001, R2m/c = 0.428/0.768] differed significantly, with ACC responses evoked by alternating electrodes (stimulated at 827 pps) around 50% larger, and latencies 42 ms shorter, than those evoked by AM changes (stimulated at 801 pps). [Figure 6 about here.] In contrast to the 801 pps pulse rate, the 126 pps pulse rate showed a relatively poor corre- spondence between behaviourally sensitivity (Fig. 2C) and transient ACC responses, with only 2 out of 10 agreements when anodic (QP-A) pulses were employed. This was improved only marginally (4 out of 10 agreements) when cathodic (QP-C) pulses were employed. Overall, behavioural discrimination was more sensitive than ACC to changes in AM frequency; transient 15 ACC responses for both QP-A (Fig. 6E & F) and QP-C (Fig. 6I & J) pulse trains, respectively, across all subjects, revealed to be relatively weak and variable. Indeed, for QP-A pulses, average traces were dominated by the responses of S3 at 100% (Fig. 6E), and S3 and S5 at 50% (Fig. 6F) modulation depth, whilst for QP-C average traces were dominated by responses from S2 and S3 at 100% (Fig. 6I), and S1, S2, S3 at 50% (Fig. 6J) modulation depth. Similarly, ASSRs (embedded on the transient waveform) to the two AM frequencies were clearly observed in the frequency domain for S2 and S3, whose responses dominate the average waveform (Fig. 6G, H, K, & L). Across all participants and polarities, Hotelling-T2 F-values ranged between 2.6 and 18.1 for 50% modulation depth and 2.0 and 18.4 for 100% modulation depth (with a critical F of 1.97 for a significance level of p < 0.05). Although this analysis suggests that significant transient ACC responses were obtained from all participants, visual inspections revealed that some significant responses were not evoked by transitions in the AM frequency but, rather, by the 20 Hz ASSR component present in the response (Appendix ??). A LME model confirmed that neither modulation depth nor polarity had a significant effect on the GFP or the latency of response (Fig. 6O-R). The data suggest that low pulse rates do not elicit a strong ACC and, consequently, are relatively insensitive to polarity effects. A plausible explanation for this is that the perceptual change resulting from transitions in the AM frequency is relatively weak due to the low carrier-rate used to convey the AMs; the 126 pps pulse train is only about 4 times higher than the highest AM rate we employed. This is reflected in the relative increase in variability of behavioural responses (cf. Figures 2B & C). Thus, whilst the stimuli are behaviourally discriminable, neural synchronization and/or activation may be reduced, leading to poor transient ACC responses. Overall, the data confirm that, as in NH listeners, the ACC is sensitive to alternating AM frequencies in CI users, but that this (and behavioural) sensitivity, depends upon the specific choice of electrical pulse rates applied in the CI processor; poor cortical sensitivity to the low (126 pps) pulse rate was not dependent on pulse polarity. Further, despite high (801 pps) pulse rates evoking large cortical ACC responses in CI listeners, these responses were insensitive to changes in AM depth, and cortical responses to alternating AM frequencies (temporal cues) were significantly smaller, and of longer latency, than responses to alternating electrodes (spectral cues). 16 3.4 CI listeners show hyper-adapted cortical responses to speech-like alternating cues While CI listeners show robust ACC responses to slow spectral and temporal transitions, it is unclear whether these cues remain robust at more realistic (i.e. speech-like) alternating rates. Here, we re-assessed neural encoding of spectral and temporal cues by presenting them at a higher alternating rate (6 to 7 Hz) to evoke AC-FRs (Fig. 1C). We first recorded AC-FRs to alternations in AM frequency in NH listeners at a rate of 6.4 Hz (Fig. 7A-D). At this alternating rate, clear AC-FRs were observed (see peaks in frequency-domain responses in Fig. 7C-D), although ASSRs to the AM frequencies were much smaller, likely because the short alternating period (156 ms), which is insufficient to generate substantial ASSRs in NH listeners (Ross et al., 2002). Significant responses were obtained from all participants at 100% AM depth, with Hotelling-T2 F-values ranging between 3.25 and 9.0 (critical F value 3.14 for p < 0.05). A LME model indicated that both frequency [F(2, 6.01) = 7.99, p = 0.020, R2m/c = 0.508/0.723] and modulation depth [F(1, 22.00) = 7.01, p = 0.015, R2m/c = 0.508/0.723] had a significant effect on the GFP (Fig. 7E). The mean latency of the steady-state responses (Eq. 2; Fig. 7F) generated by the alternating rate (6.4 Hz) between the two AM frequencies, and the AM frequencies in itself (19.2 and 32 Hz) were 90.0, 65.8, and 44.8 ms, respectively. Latencies of the responses to 6.4 Hz (where responses were reliably obtained) were not affected by the modulation depth. These data are consistent with those from the low alternating rate and demonstrate that AC-FRs to AM frequency alternations could be obtained from all NH listeners at rapid alternating rates, in a similar way to those evoked with binaural stimuli (Dajani and Picton, 2006; Haywood et al., 2015; McAlpine et al., 2016; Undurraga et al., 2016). [Figure 7 about here.] In contrast to NH listeners, steady-state cortical responses in CI listeners were considerably less robust to speech-like alternating rates. We assessed whether AC-FRs in CI listeners could phase-lock to the rapidly alternating AM frequencies (6.9 Hz), for pulse rates of 801 or 126 pps. However, in comparison to the robust ACC generated by the low (0.5 Hz) alternating rate using the 801 pps pulse rate, this same pulse rate did not generate a significant AC-FR; ASSRs were evident at the AM frequencies, but not AC-FRs at the 6.9 Hz alternating rate (Fig. 8A-D). Note 17 that the magnitude of the AC-FRs did not change when data were reanalyzed without applying spatial filtering for artifact rejection. Unsurprisingly, at a pulse rate of 126 pps, AC-FRs were largely absent for either anodic or cathodic pulse shapes (Fig. 8E-L), consistent with the transient ACC responses obtained under the same conditions. In contrast, relatively strong ASSRs were observed at the two AM frequencies (Fig. 8, vertical lines in Fig. 8G, H, K, & L) as well as comodulations around those rates, as demonstrated by the spectral peaks near the main AM frequencies. Comodulations were caused by the beating of each AM frequency within the analysis window used to compute the frequency-domain response (4.01 seconds). Individually, each AM frequency is effectively presented at a rate of 3.45 Hz (half the alternating rate), resulting in a temporal beating which is equivalent to a second order modulation observed around each AM frequency ± 3.45 Hz. The data confirm that, for CI listeners, ASSRs are observed, even when the duration of each AM frequency block (≈ 145 ms) is close to the minimum time required to evoke ASSRs in NH listeners (Ross et al., 2002). However, consistent with an excess of cortical neural adaptation in CI listeners, AC-FRs were smaller than in NH listeners at this alternating rate (≈ 6 dB difference at full modulation depth). [Figure 8 about here.] We next investigated whether a similar excess of neural adaptation was observed with responses to alternating spectral cues (i.e. alternating between stimulating electrodes) in CI listeners. AC-FRs generated by alternating electrodes (6.1 Hz; Fig. 9) were clearly observed, although of smaller amplitude than AC-FRs from NH listeners. A LME analysis of the 6.1 Hz response indicated that neither GFP [F(2, 7.99) = 0.05, p = 0.951, R2m/c = 0.007/0.016] nor latencies [F(2, 8) = 1.25, p = 0.336, R2m/c = 0.124/0.307] were affected by which electrode pair was assessed in the alternating paradigm. In general, although relatively small, significant responses were observed at all electrodes; Hotelling-T2 F-values ranged between 3.32 and 24.0 (critical F value of 3.1 for a significance level of p<0.05). A reanalysis of these results without using spatial filtering to reject electrical artefacts led to the same results (data from S4, electrode pair 3 was excluded from this reanalysis due to unacceptable artefact contamination). The data indicate that AC-FRs from CI listeners were significantly reduced when compared to their transient ACC responses. This reduction is consistent with the hypothesis that, compared 18 to acoustic stimulation in NH listeners, electrical stimulation in CI listeners generates greater adaptation in cortical neurons in response to speech-like rates alternations of spectro-temporal cues. To quantify the overall level of neural adaptation between low and high alternating rates, the GFP obtained with the ACC and the AC-FR were used to compute the adaptation index (AI) as: AI = 20 · log10 (GFPACC/GFPAC−FR) (3) [Figure 9 about here.] First, we assessed whether the AM depth or the electrode discrimination affected the AI in CI. Neither modulation depth [F(1, 5) = 0.95, p = 0.374, R2m/c = 0.057/0.340 for NH; F(1, 2) = 0.18, p = 0.713, R2m/c = 0.014/0.614 for CI] nor electrode pair [F(2, 8) = 2.26, p = 0.166, R2m/c = 0.059 / 0.818] had a significant effect on the AI. We then assessed whether the AI differed between alternating AM frequencies and electrode stimulation in CI listeners. The analysis indicated that AI was not affected by the experiment type [F(1, 1.78) = 0.272, p = 0.659, R2m/c = 0.004/0.784]. Next, we compared whether the AI for the AM experiment was different between CI and NH listeners. The mean AI for CI listeners across all conditions was 25.6 dB (s.d. = 6.6 dB), significantly greater [F(1, 7) = 7.58, p = 0.028, R2m/c = 0.392/0.665] than that for the NH listeners (mean AI of 19.1 dB; s.d. = 4.8 dB). The difference between NH and CI listeners (pooling the AI across all conditions; Fig. 10) was confirmed by a robust non-parametric analysis of variance [F(1, 18.72) = 14.01, p = 0.001, ξ = 0.67]. Furthermore, to control for a potential bias of our artefact cancellation, in which the AC- FR amplitudes may have resulted smaller when removing DSS components, we repeated the previous analysis using the raw AC-FR, i.e. without artefact removal. The results (not reported here) remained virtually the same. These data confirm that, when presented at a speech-like alternating rate, spectral or temporal cues are considerably more adapted in CI listeners than in NH listeners, potentially accounting for some of the difficulties CI listeners have in encoding dynamic spectral and temporal cues critical to speech processing. [Figure 10 about here.] 19 4 Discussion We assessed the ability of CI users to process dynamic patterns of spectro-temporal modulations, including at rates present in typical speech sounds. By means of the alternating ACC paradigm, we elicited cortical responses to dynamic spectral and temporal changes, generated by alternating the stimulating electrode of a pair of electrodes, or the AM frequency of a single electrode, respectively. We obtained significant cortical responses when alternating between stimulation of pairs of CI electrodes at a slow, 0.5 Hz, rate. For the same, slow alternating rate, we also demonstrated that transient ACC responses were evoked in CI listeners by alternating the AM frequency, from all participants at the high (801 pps) pulse rate, and for some listeners at the low (126 pps) rate. ACC responses were dramatically reduced in NH listeners when AM depth was reduced from 100% to 50%. This was not the case for CI users, where similar high-amplitude responses were obtained for both 100% and 50% AM depths, when stimulating at the highest pulse rate (801 pps). Overall, anodic (QP-A) pulse trains were easier to discriminate than cathodic (QP-C) pulse trains. However, cortical ACC responses evoked with low-rate pulse trains (126 pps) were weak regardless of stimulus polarity. Finally, we showed that rapid alternating cues evoked AC-FRs in both NH and CI listeners. However, responses from CI participants were much more adapted than those from NH listeners, confirming that cortical activity decreased faster for CI than NH listeners. 4.1 Neural adaptation to changes in an ongoing stimuli Multiple factors likely contribute to poor speech understanding in CI users. Whilst neural entrainment to speech envelopes plays a critical role in the encoding of speech (e.g. Ding and Simon, 2012; Peelle et al., 2013; Broderick et al., 2019), our data suggest that the inability to represent changes in spectro-temporal cues at rates common to natural speech may be critical in CI listeners. This does not preclude CI users exploiting already degraded static spectro-temporal processing. Although it was not possible to evoke robust AC-FRs to rapid (≈ 7 Hz) alternating rates in CI users, significant ASSRs (20 and 35 Hz) were observed at both slow (126 pps) and fast (801 pps) pulse rates. This suggests that strong neural synchronization to electrical stimulation in primary auditory neurons allows for sub-cortical neural generators of the ASSR to follow AM 20 frequencies in the range of a few tens of Hz, but that the cortical generator of the AC-FR is rapidly adapted and, consequently, unable to track AM transitions that occur at speech-like rates. This supports the hypothesis that rapid changes in spectro-temporal cues conveyed by highly synchronized electrical pulses induce greater neural adaptation at the cortical level. This is not the case in NH listeners, for whom neural activity is likely less synchronized, allowing for sparse neural activation and neural tracking of rapidly fluctuating cues. If the ability to encode and discriminate dynamic spectro-temporal cues at a cortical level is intrinsic to speech understanding, it suggests one reason why CI listeners may be limited in exploiting these cues (Zhang et al., 2019); excessive adaptation in the cortex to changing spectro-temporal cues. Together with the limited number of independent channels, the highly synchronized neural response of the inner ear may lead to an overly adapted neural response at the cortical level. 4.2 Temporal processing in NH and CI listeners At a pulse rate of 801 pps—representative of pulse rates employed in clinical devices—ACC responses were well defined, and we observed two distinct differences between NH and CI listeners. First, in NH listeners, ACC and AC-FRs were significantly smaller when the AM depth was reduced to 50%. Conversely, CI listeners were less sensitive to changes in AM depth—likely arising from the sharp amplitude growth functions and narrow dynamic range typical of CI users (e.g. Cohen, 2009; Zeng and Shannon, 1992; Zeng et al., 2002)—and ACC responses were indistinguishable in terms of magnitude and latency for the two modulation depths (Figures 6K & L), whilst AC-FRs were absent. This indicates that the poor perceptual difference between fully and partially AM’ed stimuli for CI listeners is reflected by large cortical activations, and indicates that shallow AMs may be overrepresented, potentially deteriorating speech perception (Moore et al., 1995; Moore and Glasberg, 1993). At a lower pulse rate (126 pps), cortical representation was poor compared to 801 pps, where ACC responses were well defined and behavioural performance was at ceiling, supporting the hypothesis that low pulse rates have a detrimental effect on the representation of temporal envelopes. 21 4.3 Effect of polarity in AM encoding At low pulse rates (126 pps) AM frequencies conveyed by anodic QP-A pulses were easier to discriminate (at both AM depths) than those conveyed by cathodic (QP-C) pulses, consistent with observations in CI listeners that anodic pulses are more effective than cathodic ones (Macherey et al., 2008; Undurraga et al., 2010, 2012, 2013). This supports the hypothesis that interactions between place-of-excitation and pulse polarity may distort envelope cues at the level of the auditory nerve (Joshi et al., 2017). In our study, two participants showed some polarity sensitivity, and of the 5 instances where CI listeners were unable to distinguish behaviourally different AM frequencies, 4 came from these two participants. Contrary to their behavioural scores, however, transient ACC/ASSRs represented the predominant feature of their cortical responses (albeit of relatively small magnitude), but were largely absent in other participants. This suggests that responses from participants with lower anodic thresholds may reflect highly synchronized somatic or post-somatic activation of ANFs, compared to the relatively less-synchronized activation of peripheral processes evoked by cathodic stimulation (Undurraga et al., 2010, 2012, 2013). Furthermore, Macherey et al. (2017) measured loudness growth functions in human CI listeners using both QP-C and QP-A pulse trains presented at a rate of 100 pps. They found that loudness growth functions with QP-A pulse trains increased monotonically with the stimulus amplitude. This was not the case for QP-C pulse trains, and several listeners show clear non-monotonic loudness growth functions. Thus, it may also be possible that the effective modulation depth is shallower for QP-C than for QP-A pulse trains, difficulting the encoding the different AM frequencies, and thus, discrimination. 4.4 AM discrimination as a measure of temporal envelope processing Our use of an AM frequency discrimination task to assess temporal envelope processing is highly novel. CI users are able to understand speech based on the AM’ed cues conveyed by the envelope at each channel (Shannon et al., 1995; Fu and Shannon, 2002), and many studies have measured AM discrimination thresholds behaviourally (e.g. Galvin and Fu, 2005; Garadat et al., 2012; Chatterjee and Peng, 2008; Galvin and Fu, 2009) or objectively (Han and Dimitrijevic, 2015; Han and Dimitrijevic, 2020; Gransier et al., 2019). However, these studies compared an unmodulated reference signal to a modulated target signal in which the AM depth was reduced until it was 22 no longer distinguishable from the reference. A problem associated with this method is the confounding factor of available loudness cues, which may contribute to participants’ judgments in a detection or discrimination task (McKay and Henshall, 2010). Whilst these cue can be minimized—for example, by adjusting signal intensity to maintain loudness perception as a function of the AM depth, or roving the overall signal amplitude between trials—this procedure usually requires obtaining significant behavioural data and, even then, does not remove the cue as a potential confound. This problem can be largely avoided if AM frequency discrimination is measured with both AM frequencies having the same AM depth—the loudness of low pulse rate AM pulses is almost constant (Kreft et al., 2010; Gransier et al., 2016; McKay and Henshall, 2010). Thus, provided that the AM frequency difference is above the discrimination limen at 100% AM depth, the AM-frequency discrimination task should reflect the participant’s ability to exploit temporal envelope cues per se, and not loudness cues. 4.5 Neural processing of spectral and temporal envelope cues The ACC response is largely dependent on the extent to which a stimulus change can be perceived. A potential explanation for AM alternations generating longer-latency responses than electrode changes in CI listeners is that neural responses to AM were perceptually weaker than those elicited by electrode changes. However, taking into account that behavioural discrimination of AM was at ceiling at 801 pps, an alternate interpretation is that the generator of the response requires a longer integration time to process the change in AM. The average period of a 20 Hz (50 ms) and a 35 Hz (28 ms) AM cycle is 39 ms, almost identical to the latency difference (42 ms) between electrode and envelope ACC responses, suggesting that neurons require at least one cycle of the AM stimuli to process the switch in AM frequency. 4.6 Conclusions Using the alternating ACC paradigm at rates that regulate the amount of neural adaptation, we demonstrate that spectro-temporal cues are considerable more adapted in CI listeners compared to NH listeners. Excess of cortical adaptation to spectro-temporal modulation patterns may limit the extraction of important dynamic cues for the understanding of speech, in particular, in background noise and may explain some of the variability in performance observed across CI 23 listeners, in addition to the limited number of independent channels, and differential sensitivity to stimulus polarity reflecting neural health. Finally, whilst the results were consistent across the high-performing CI listeners that took place in our study, the sample size is very limited and results should not be generalized. Further studies should investigate the degree of neural adaptation to speech-like alternating rates in a much larger sample size. Acknowledgements Authors acknowledge support from the National Cochlear Implant Users Association of the UK and from the SCIC. The authors also thank Dr WaiKong Lai (SCIC) for his useful comments and discussions during data collection. The authors also thank Simone de Rijk for implanting the human head and assistance during the artefact assessment. Author Deborah Vickers is the recipient and supported by a Medical Research Council Senior Fellowship (MR/2002537/1), author Manohar Bance is supported by Evelyn Trust Cambridge, and authors Jaime Undurraga, Lindsey Van Yper, and David McAlpine are recipients and supported by a grant from the Australian Research Council (FL160100108). References Bierer, JA (2007). “Threshold and channel interaction in cochlear implant users: Evaluation of the tripolar electrode configuration”. The Journal of the Acoustical Society of America 121, 1642–1642. Bierer, JA, Bierer, SM, and Middlebrooks, JC (2010). “Partial tripolar cochlear implant stimu- lation: Spread of excitation and forward masking in the inferior colliculus.” Hearing research, 1–9. Broderick, MP, Anderson, AJ, and Lalor, EC (2019). “Semantic Context Enhances the Early Auditory Encoding of Natural Speech”. J Neurosci 39, 7564–7575. Brown, CJ, Etler, C, He, S, O’Brien, S, Erenberg, S, Kim, JR, Dhuldhoya, AN, and Abbas, PJ (2008). “The Electrically Evoked Auditory Change Complex: Preliminary Results from Nucleus Cochlear Implant Users”. Ear Hear 29, 704–717. Busby, PA and Clark, GM (2000). “Electrode Discrimination by Early-Deafened Subjects Using the Cochlear Limited Multiple-Electrode Cochlear Implant”. Ear and Hearing 21, 291–304. 24 Chandrasekaran, C, Trubanova, A, Stillittano, S, Caplier, A, and Ghazanfar, AA (2009). “The Natural Statistics of Audiovisual Speech”. PLoS Comput Biol 5. Ed. by KJ Friston, e1000436. Chatterjee, M and Peng, SC (2008). “Processing F0 with cochlear implants: Modulation frequency discrimination and speech intonation recognition”. Hearing Research 235, 143–156. Chen, H and Zeng, FG (2004). “Frequency modulation detection in cochlear implant subjects”. The Journal of the Acoustical Society of America 116, 2269–2277. Cohen, LT (2009). “Practical model description of peripheral neural excitation in cochlear implant recipients: 1. Growth of loudness and ECAP amplitude with current.” Hearing research 247, 87–99. Croghan, NBH, Duran, SI, and Smith, ZM (2017). “Re-examining the relationship between number of cochlear implant channels and maximal speech intelligibility”. The Journal of the Acoustical Society of America 142, EL537–EL543. Dajani, HR and Picton, TW (2006). “Human auditory steady-state responses to changes in interaural correlation.” Hearing research 219, 85–100. De Cheveigné, A and Simon, JZ (2008). “Denoising based on spatial filtering.” Journal of neuroscience methods 171, 331–9. Deprez, H, Gransier, R, Hofmann, M, van Wieringen, A, Wouters, J, and Moonen, M (2017). “Characterization of cochlear implant artifacts in electrically evoked auditory steady-state responses”. Biomedical Signal Processing and Control 31 (Supplement C), 127–138. Ding, N and Simon, JZ (2012). “Emergence of neural encoding of auditory objects while listening to competing speakers.” Proceedings of the National Academy of Sciences of the United States of America 109, 11854–9. Eisen, MD and Franck, KH (2005). “Electrode interaction in pediatric cochlear implant subjects.” Journal of the Association for Research in Otolaryngology : JARO 6, 160–70. Field, AP and Wilcox, RR (2017). “Robust statistical methods: A primer for clinical psychology and experimental psychopathology researchers”. Behaviour Research and Therapy. Best Practice Guidelines for Modern Statistical Methods in Applied Clinical Research 98, 19–38. Fu, Qj and Shannon, RV (2002). “Frequency mapping in cochlear implants.” Ear and hearing 23, 339–48. 25 Galvin, JJ and Fu, QJ (2005). “Effects of Stimulation Rate, Mode and Level on Modulation Detection by Cochlear Implant Users”. J Assoc Res Otolaryngol 6, 269–279. – (2009). “Influence of stimulation rate and loudness growth on modulation detection and intensity discrimination in cochlear implant users”. Hearing Research 250, 46–54. Garadat, SN, Zwolan, TA, and Pfingst, BE (2012). “Across-site patterns of modulation detection: relation to speech recognition”. J. Acoust. Soc. Am. 131, 4030–4041. Geier, LL and Norton, SJ (1992). “The effects of limiting the number of Nucleus 22 cochlear implant electrodes programmed on speech perception.” Ear and hearing 13, 340–8. Golding, M, Dillon, H, Seymour, J, and Carter, L (2009). “The detection of adult cortical auditory evoked potentials (CAEPs) using an automated statistic and visual detection”. International Journal of Audiology 48, 833–842. Gransier, R, David, W, Guerit, F, Richardson, M, Middlebrooks, JM, Carlyon, RP, and Wouters, J (2019). “Effect of Modulation Depth on the 40-Hz Auditory Steady-state Response and Auditory Change Complex in CI Users and Normal-Hearing Listeners.” In: The 2019 Conference on Implantable Auditory Prostheses (CIAP). The 2019 Conference on Implantable Auditory Prostheses (CIAP), Lake Tahoe, USA. Gransier, R, Deprez, H, Hofmann, M, Moonen, M, van Wieringen, A, and Wouters, J (2016). “Auditory steady-state responses in cochlear implant users: Effect of modulation frequency and stimulation artifacts”. Hearing Research 335, 149–160. Han, JH and Dimitrijevic, A (2020). “Acoustic Change Responses to Amplitude Modulation in Cochlear Implant Users: Relationships to Speech Perception”. Front. Neurosci. 14. Han, JH and Dimitrijevic, A (2015). “Acoustic change responses to amplitude modulation: a method to quantify cortical temporal processing and hemispheric asymmetry”. Front. Neurosci. 9. Harrison, XA, Donaldson, L, Correa-Cano, ME, Evans, J, Fisher, DN, Goodwin, CE, Robinson, BS, Hodgson, DJ, and Inger, R (2018). “A brief introduction to mixed effects modelling and multi-model inference in ecology”. PeerJ 6. Haywood, NR, Undurraga, JA, Marquardt, T, and McAlpine, D (2015). “A Comparison of Two Objective Measures of Binaural Processing: The Interaural Phase Modulation Following Response and the Binaural Interaction Component”. Trends Hear 19. 26 He, S, Grose, JH, Teagle, HF, and Buchman, CA (2014). “Objective Measures of Electrode Discrimination with Electrically-Evoked Auditory Change Complex and Speech Perception Abilities in Children with Auditory Neuropathy Spectrum Disorder”. Ear Hear 35, e63–e74. Henry, BA, McKay, CM, McDermott, HJ, and Clark, GM (2000). “The relationship between speech perception and electrode discrimination in cochlear implantees”. The Journal of the Acoustical Society of America 108, 1269–1280. Herdman, AT, Lins, O, Roon, PV, Stapells, DR, Scherg, M, and Picton, TW (2002). “Intracere- bral Sources of Human Auditory Steady-State Responses”. Brain Topogr 15, 69–86. Hofmann, M and Wouters, J (2010). “Electrically evoked auditory steady state responses in cochlear implant users.” Journal of the Association for Research in Otolaryngology : JARO 11, 267–82. Holden, LK, Finley, CC, Firszt, JB, Holden, TA, Brenner, C, Potts, LG, Gotter, BD, Vanderhoof, SS, Mispagel, K, Heydebrand, G, and Skinner, MW (2013). “Factors affecting open-set word recognition in adults with cochlear implants”. Ear Hear 34, 342–360. John, MS and Picton, TW (2000). “Human auditory steady-state responses to amplitude- modulated tones: phase and latency measurements.” Hearing research 141, 57–79. Joshi, SN, Dau, T, and Epp, B (2017). “Effect of Uncertainty in Site of Spike Generation on Envelope Coding in the Electrically Stimulated Auditory Nerve”. ARO. Kreft, HA, Nelson, DA, and Oxenham, AJ (2013). “Modulation Frequency Discrimination with Modulated and Unmodulated Interference in Normal Hearing and in Cochlear-Implant Users”. J Assoc Res Otolaryngol 14, 591–601. Kreft, HA, Oxenham, AJ, and Nelson, DA (2010). “Modulation rate discrimination using half-wave rectified and sinusoidally amplitude modulated stimuli in cochlear-implant users”. The Journal of the Acoustical Society of America 127, 656–659. Kuznetsova, A, Brockhoff, PB, and Christensen, RHB (2017). “lmerTest Package: Tests in Linear Mixed Effects Models”. Journal of Statistical Software 82, 1–26. Lewis, JW (2014). “ALAN CRUTTENDEN , Gimson’s Pronunciation of English, 8th edition. London & New York: Routledge, 2014. Pp. xxvi + 381. ISBN: 978-0-415-72174-5 (hbk), ISBN: 978-1-4441-8309-2 (pbk), ISBN: 978-0-203-78496-9 (ebk) Website: www.routledge.com/cw/”. Journal of the International Phonetic Association 44, 297–299. 27 Luke, R, Van Deun, L, Hofmann, M, van Wieringen, A, and Wouters, J (2015). “Assessing temporal modulation sensitivity using electrically evoked auditory steady state responses”. Hear. Res. 324, 37–45. Macherey, O, Carlyon, RP, van Wieringen, A, Deeks, JM, and Wouters, J (2008). “Higher sensitivity of human auditory nerve fibers to positive electrical currents.” Journal of the Association for Research in Otolaryngology : JARO 9, 241–51. Macherey, O and Carlyon, RP (2012). “Place-pitch manipulations with cochlear implants.” The Journal of the Acoustical Society of America 131, 2225–36. Macherey, O, Carlyon, RP, Chatron, J, and Roman, S (2017). “Effect of Pulse Polarity on Thresholds and on Non-monotonic Loudness Growth in Cochlear Implant Users”. JARO, 1–15. Mair, P and Wilcox, R (2020). “Robust statistical methods in R using the WRS2 package”. Behav Res 52, 464–488. Mathew, R, Undurraga, JA, Li, G, Meerton, L, Boyle, P, Shaida, A, Selvadurai, D, Jiang, D, and Vickers, D (2017). “Objective assessment of electrode discrimination with the auditory change complex in adult cochlear implant users”. Hear Res. Mathew, R, Vickers, D, Boyle, P, Shaida, A, Selvadurai, D, Jiang, D, and Undurraga, J (2018). “Development of electrophysiological and behavioural measures of electrode discrimination in adult cochlear implant users”. Hearing Research 367, 74–87. McAlpine, D, Haywood, N, Undurraga, JA, and Marquardt, T (2016). “Objective Measures of Neural Processing of Interaural Time Differences”. In: Physiology, Psychoacoustics and Cognition in Normal and Impaired Hearing. Ed. by Pv Dijk, D Başkent, E Gaudrain, Ed Kleine, A Wagner, and C Lanting. Advances in Experimental Medicine and Biology. Springer International Publishing, 197–205. McKay, CM and Henshall, KR (2010). “Amplitude modulation and loudness in cochlear im- plantees.” Journal of the Association for Research in Otolaryngology : JARO 11, 101–11. Middlebrooks, JC and Snyder, RL (2010). “Selective electrical stimulation of the auditory nerve activates a pathway specialized for high temporal acuity.” The Journal of neuroscience : the official journal of the Society for Neuroscience 30, 1937–46. 28 Moore, BCJ and Glasberg, BR (1993). “Simulation of the effects of loudness recruitment and threshold elevation on the intelligibility of speech in quiet and in a background of speech”. The Journal of the Acoustical Society of America 94, 2050–2062. Moore, BCJ, Glasberg, BR, and Vickers, DA (1995). “Simulation of the effects of loud ness recruitment on the intelligibility of speech in noise”. British Journal of Audiology 29, 131–143. Nakagawa, S and Schielzeth, H (2013). “A general and simple method for obtaining R2 from generalized linear mixed-effects models”. Methods in Ecology and Evolution 4, 133–142. Peelle, JE and Davis, MH (2012). “Neural Oscillations Carry Speech Rhythm through to Comprehension”. Front. Psychol. 3. Peelle, JE, Gross, J, and Davis, MH (2013). “Phase-Locked Responses to Speech in Human Auditory Cortex are Enhanced During Comprehension”. Cereb Cortex 23, 1378–1387. Pfingst, BE, Burkholder-Juhasz, RA, Zwolan, TA, and Xu, L (2008). “Psychophysical assessment of stimulation sites in auditory prosthesis electrode arrays.” Hearing research 242, 172–83. Picton, TW, Vajsar, J, Rodriguez, R, and Campbell, KB (1987). “Reliability estimates for steady-state evoked potentials”. Electroencephalography and Clinical Neurophysiology/Evoked Potentials Section 68, 119–131. Picton, TW, John, MS, Dimitrijevic, A, and Purcell, DW (2003). “Human auditory steady-state responses.” International journal of audiology 42, 177–219. R Development Core Team (2020). R: A Language and Environment for Statistical Computing. Vienna, Austria: Foundation for Statistical Computing. Rodriguez, R, Picton, TW, Linden, D, Hamel, G, and Laframboise, G (1986). “Human auditory steady state responses: effects of intensity and frequency.” Ear and hearing 7, 300–13. Rosen, S (1992). “Temporal information in speech: acoustic, auditory and linguistic aspects”. Phil. Trans. R. Soc. Lond. B 336, 367–373. Ross, B, Picton, TW, and Pantev, C (2002). “Temporal integration in the human auditory cortex as represented by the development of the steady-state magnetic field”. Hearing Research 165, 68–84. Shannon, RV, Zeng, FG, Kamath, V, Wygonski, J, and Ekelid, M (1995). “Speech recognition with primarily temporal cues.” Science (New York, N.Y.) 270, 303–4. Skrandies, W (1990). “Global field power and topographic similarity”. Brain Topogr 3, 137–141. 29 Undurraga, JA, Carlyon, RP, Macherey, O, Wouters, J, and Wieringen, AV (2012). “Spread of excitation varies for different electrical pulse shapes and stimulation modes in cochlear implants”. Hear. Res. 290, 21–36. Undurraga, JA, Carlyon, RP, Wouters, J, and van Wieringen, A (2013). “The polarity sensitivity of the electrically stimulated human auditory nerve measured at the level of the brainstem.” JARO 14, 359–77. Undurraga, JA, Haywood, NR, Marquardt, T, and McAlpine, D (2016). “Neural Representation of Interaural Time Differences in Humans—an Objective Measure that Matches Behavioural Performance”. JARO, 1–17. Undurraga, JA, van Wieringen, A, Carlyon, RP, Macherey, O, and Wouters, J (2010). “Polarity effects on neural responses of the electrically stimulated auditory nerve at different cochlear sites.” Hear. Res. 269, 146–161. Valderrama, JT, Torre, Adl, and Dun, BV (2018). “An automatic algorithm for blink-artifact suppression based on iterative template matching: application to single channel recording of cortical auditory evoked potentials”. J. Neural Eng. 15, 016008. Zeng, FG, Grant, G, Niparko, J, Galvin, J, Shannon, RV, Opie, J, and Segel, P (2002). “Speech dynamic range and its effect on cochlear implant performance”. The Journal of the Acoustical Society of America 111, 377–377. Zeng, FG and Shannon, RV (1992). “Loudness balance between electric and acoustic stimulation.” Hearing research 60, 231–5. Zhang, F, Underwood, G, McGuire, K, Liang, C, Moore, DR, and Fu, QJ (2019). “Frequency change detection and speech perception in cochlear implant users”. Hearing Research 379, 12–20. 30 5 Appendix 5.1 Cadaveric recordings In our previous studies (Mathew et al., 2017, 2018) as well as in the present study we removed electrical artefacts by means of interpolation and DSS. However, it may still be possible that some residual artefacts are present and confused with a biological response. Therefore, we assessed the effectiveness of the methods to subtract electrical artefacts used in this study by measuring these artefacts from the head of an implanted human specimen. This experiment was conducted at University of Cambridge and it was approved by the ethical board of Cambridge University (19/NE/0366) and complied with the Human Tissue Act (2004). 5.1.1 Implantation The cadaveric head of a 74 years old male human was implanted in the right cochlea with a Cochlear electrode array (CI522). The sample was fresh frozen, without any preservatives to preserve the tissue in optimal conditions, and defrosted within 24 hours of the experiment. The CI was fully inserted (round window approach) and the cochlea was flushed prior to implantation using a 1% saline solution in order to remove air bubbles. The quality of the electrodes was assessed by impedance measures which were ≈ 2 kΩ for the electrodes tested (15 and 16). Recordings were conducted using the same settings as described in Methods, with the only difference we used 32 instead of a 64 EEG recording electrodes. Voltage offsets were within ±20 mV and EEG data were referenced to the channel contralateral to the implant (T7). 5.1.2 Stimuli If not stated explicitly, all stimuli consisted of QP-C pulses with 43µs phase width and 7 µs interphase gap presented in monopolar mode (MP 1+2). Stimulation level was 180 CU in all recordings. A total of 241 epochs of ≈ 1 s were presented per condition. 31 5.1.3 Topographic maps at different CI return electrode locations To characterize the topographic maps as a function of the location of the extracochlear return electrode used with monopolar stimulation, we presented symmetric biphasic pulses with a long phase width at a low rate (35 pps, 400 µs per phase, and 7µs interphase gap) and recorded the electrical artefact of these pulses. The return electrode (MP1) was placed on three different locations: 1) the promontori, 2) the posterior medial of the temporal muscle, and 3) near orbicularis oculi muscle (lateral edge of orbicularis oculi right eye). As expected, the topgraphic maps changed with the location of the return electrode (Fig. 11), confirming that the using the topological distribution and the known location of the implanted CI are a good indication to identify residual artefacts. [Figure 11 about here.] 5.1.4 Effectiveness of interpolation and DSS in the removal of artefact in ASSRs To assess the extent to which artefact removal can be achieved via interpolation and DSS, we recorded artefacts produced by a 40 Hz 100% AM’ed pulse trains (between 0 and 180 CU) using pulse rates between 100 and 902 pps. First, we proceed by interpolating the electrical artefacts starting at two different time points (the interpolation length was fixed to the pulse rate period). In the first case, interpolation was applied between points located at the onset of the electrical pulse—likely contaminated by strong electrical artefacts—and, in the second case, interpolation was applied between points located before the onset of the pulse (one eight of the respective pulse rate period), where we expected an optimal artefact cancellation. In agreement with previous studies, interpolation was inefficient when the starting point was too close to the electrical pulse (Fig. 12A), but it was optimal when the starting point preceded the electrical pulse for pulse rates between 100 and 400 pps for electrodes located contralaterally to the CI, and in all electrodes for rates below 400 pps (Fig. 12B). To fully characterize the recovery of ASSRs from contaminated recordings, we simulated and added “neural responses” to the recorded artefacts. This allowed us to quantify the effect of artefact removal on otherwise highly temporally correlated conditions (Fig. 13A & B). The artefact free neural response was set so that large activity was found in frontal electrodes and small activity towards the back of the head, similar to the observed distribution of these 32 responses when referenced to left or right mastoids (Fig. 12C & Fig. 13A). The results showed that interpolation works well, i.e. with minimal distortions, for pulse rates below 400 pps across all recording electrodes (cf. Figures 12C & D). As the pulse rate increased from and above 400 pps, artefacts started dominate the recordings obstructing the detection of the neural source. Removing residual artefact components via DSS allowed the recovery of the ASSR across all pulse rates (Fig. 12E & Fig. 13C). Only a minor residual component was observed at 400 pps on the right side and it was caused by the similarity between neural and artefact amplitudes, which made their separation difficult. [Figure 12 about here.] [Figure 13 about here.] To further investigate the relative effect the ASSR phase on the recovered response at 902 pps via DSS, a total of 16 different starting phases equally separated in steps of 22.5° were simulated at the optimal interpolation point (one eight of the pulse rate period before the onset of the pulse). The effect of the electrical artefact (Fig. 14A, Artefact panel) on the target neural response (Fig. 14A, Neural panel) resulted in a distorted amplitude-phase response in which the electrical artefact “drags” the neural response (Fig. 14A, Artefact + Neural panel) towards its own direction. By removing the DSS component with the largest energy on the side of the CI device, the amplitude of recovered ASSR was reliably for many of the simulated neural phases across channels (Fig. 14B). However, the phase and amplitude of the recovered responses were distorted. For some specific neural phases, the recovered amplitudes had smaller amplitudes and their phase was biased in orthogonal directions to the artefact phase than those of the target neural source (Fig. 14A, Recovered panel, B, & D). The estimated GFP to each simulated condition (Fig. 14C) indicated that the average difference between the target (neural) (−16.5 dB ± 0.03 dB) and the recovered GFP (−21.3 dB ± 4.40 dB) was 4.8 dB. These results demonstrate that ASSRs can be recovered from highly contaminated recordings at low pulse rates via interpolation and at high pulse rates via interpolation and DSS artefact removal. However, the amplitude may result suppressed and the phase biased when DSS is applied, depending on the specific relationship between the neural and the phase of the artefact. 33 It should be noted that the recovered topographic map was similar to that of the neural source across all conditions and, therefore, it can be used as an indication for the presence of a genuine neural response. [Figure 14 about here.] 5.1.5 Effectiveness of interpolation and DSS in the removal of artefacts by AM frequency and electrode alternations To fully characterize the limitations of our artefact rejection method, we recorded the electrical artefact from the cadaveric specimen using the AM frequency alternating paradigm. The pulse train was presented at 902 pps and the AM frequencies (20 Hz and 40 Hz) were presented at an alternating rate of 1 Hz. As in the previous section, we simulated and added “neural responses”, both ACC and ASSRs, to the recorded data. A template ACC response was imposed after each AM transition. To assess the representation of relative neural contributions, the ACC amplitudes to each AM differed by half (Fig. 15A). We also simulated ASSRs to each AM frequency, both set with the same amplitude (Fig. 15A & D). As in the previous section, we investigated the relative effect of interpolation and ASSR phase on the recovered response, a total of of 16 different neural phases were simulated equally separated in steps of 22.5°, and 8 different equally spaced starting interpolation points (between 0 and half the pulse rate period back from the pulse onset), yielding a total of 128 different simulations. The recovered ACC responses were excellent across all simulations (cf. Figures 15B, E, C & F). The difference between target and recovered amplitudes or GFP (Fig. 16A & B) was small. The target N1-P2 and cN1-cP2 GFP were 3.04 dB ± 0.70 dB and −0.89 dB ± 0.20 dB, respectively (3.94 dB difference), whilst the recovered GFP at N1-P2 and cN1-cP2 were 2.68 dB ± 0.46 dB and −2.25 dB ± 0.95 dB, respectively (4.93 dB difference). This resulted in relative differences between target and recovered response of ≈ 1 dB and confirmed the reliability of our artefact removal method for ACC responses in CI listeners (Mathew et al., 2017). [Figure 15 about here.] [Figure 16 about here.] 34 Next, we investigated the extent to which ASSRs to each AM embedded on the response could be reliably recovered. This is illustrated in Fig. 17, where both the amplitude and phase to each AM frequency are shown at Cz. At 20 Hz (Fig. 17A), the electrical artefact (Artefact panel) was stronger than at 40 Hz (Fig. 17B). As in the previous section, it can be observed how the electrical artefact drags the neural response (Neural panel) towards its own direction, distorting the phase and amplitude of the neural response (Arefact + Neural panel). By interpolating and removing the largest DSS component on the side of the CI, the amplitude of the recovered 20 Hz ASSR was reliably recovered for many of the simulated neural phases across channels (Fig. 18A). However, the phase of the recovered responses was distorted and had a direction orthogonal to the artefact (Fig. 17A and Fig. 18C). At 40 Hz, both amplitude and phase (Fig. 18A & C) of the ASSR was reliably recovered across most conditions. Slight differences were only observed when the phase of the neural response was in the direction of that of the electrical artefact (Fig. 17B & Fig. 18A). The estimated GFP to each simulated condition (Fig. 18B) indicated that at 20 Hz, the average difference between the target (neural) (−23.3 dB ± 0.42 dB) and the recovered GFP (−28.8 dB ± 5.76 dB) was 5.5 dB, whilst at 40 Hz, the average difference between the target (neural) (−23.2 dB ± 0.18 dB) and the recovered GFP (−26.2 dB ± 2.75 dB) was 3.0 dB. [Figure 17 about here.] [Figure 18 about here.] As with the alternating AM frequency paradigm, we also evaluated the ability to extract ACC responses from artefact recordings to electrode changes at 902 pps. The results were identical, ACC responses were reliably recovered in all conditions. These results demonstrate that interpolation and DSS based artefact removal method allow for successfully recovery of ACC and ASSRs. However, ASSR amplitude and phase may result distorted (smaller amplitudes and wrong phase) as a result of the strong temporal correlation between the artefact and neural waveforms. 5.1.6 Effectiveness of DSS to recover AC-FR Finally, we investigated the ability to recover AC-FRs from contaminated recordings. Because we were unable to observe electrical artifacts related to the alternating rate for either electrode 35 or AM frequency in the cadaveric recordings, we recreated a highly contaminated AC-FR scenario using the data of the alternating AM frequency paradigm (20 and 40 Hz AM frequencies alternating at 1 Hz at 902 pps). Considering that potential residual artefacts (e.g. DC current caused be electrode changes, or accumulation of charge from one AM to another) should result in periodicities that are half the alternating rate, we simulated neural signals oscillating at twice the rate of the recorded AM frequency artefacts, i.e. we recreated a transposed version of the AC-FRs paradigm with neural sources oscillating at 40 and 80 Hz (Fig. 19A). As in the previous sections, we added the simulated steady-state neural data to the contaminated recordings (Fig. 19B). A total of four different interpolation points (between zero and one eight of the pulse rate period back from the pulse onset) and 16 different neural phases were simulated (64 different simulations). The results demonstrated that we were able to recover AC-FRs in all conditions (Fig. 19C). Both phase and amplitudes were similar to that of the target neural source (Fig. 20A-E). [Figure 19 about here.] The estimated GFP to each simulated condition (Fig. 20D) indicated that at 40 Hz, the average difference between the target (neural) (−22.3 dB ± 0.005 dB) and the recovered GFP (−19.6 dB ± 1.68 dB) was −2.7 dB, whilst at 80 Hz, the average difference between the target (neural) (−22.5 dB ± 0.005 dB) and the recovered GFP (−21.0 dB ± 1.22 dB) was −1.5 dB. [Figure 20 about here.] We also explored the ability to separate DC currents by means of a periodic squared pulse train as a “neural” source. The results were identical, and we could always recover amplitude and phase reliably. Our results demonstrate that AC-FR amplitude and phase can be recovered with minimal distortions. Deviations were observed only on the side of the CI (Fig. 20C), but these occurred when the phase of the artefact was similar to that of the neural response, an unlikely realistic expectation, since neural responses occur at a different timing than electrical artefacts, i.e. they have a different phase. 5.1.7 Summary and conclusions The results, where realistic artefacts were obtained from an implanted human head, provide strong evidence that combining interpolation and DSS can significantly reduce CI artefacts. 36 ACC and AC-FRs can always be recovered at high (via interpolation and DSS) or low (via interpolation) pulse rates, whilst ASSRs can be reliably recovered at low pulse rates (via interpolation). At high pulse rates, the recovered ASSR (via interpolation and DSS) had either similar or smaller in amplitudes than the neural source. The results also demonstrated that, regardless amplitude-phase distortions, topographic maps are well preserved. Thus, topographic maps provide valuable information to assess the nature of the recovered response. We also found that ASSR amplitude and phase should be carefully assessed. For example, if the phase of the recovered response were orthogonal to that of the contaminated response, this would suggest that the phase and amplitude have been distorted. Conversely, if the phase of the recovered response is not orthogonal to that of the contaminated response, the phase may have been properly recovered, but the amplitude could be smaller than for the neural source. However, it is very unlikely to expect that the neural response will have the same phase than the electrical artefact due to implicit neural delays associated to a response. In either case, our results indicate that the presence of the ASSR (after artefact removal) is indicative of a true neural response, whilst the lack of it does not proof its absence. 5.2 Artefact removal in CI listeners 5.2.1 Transient ACC responses to AM alternation An example from one participant is shown in Fig. 21 and it is observed how the interpolation successfully removed the electrical artefacts whilst keeping the overall envelope of the EEG recording. Transient ACC responses obtained with a 0.5 Hz alternating rate are also shown in Fig. 21 for one subject, demonstrating a typical P1-N1-P2 morphology. [Figure 21 about here.] 5.2.2 AC-FR artefact cancellation to electrode alternations Fig. 22 illustrates the effect of high rate electrode alternations (6.1 Hz) in the time- and frequency- domain for one participant. The left panel shows the projection of the first DSS component on the sensor space. A large artefact contamination following a zero-phase DC drift for each alternation can be observed. In order to minimize artefact contamination, we removed that component. 37 When artefacts were not removed (left panel), we observed a large sinusoidal component whose phase was almost zero (relative to a sinusoidal signal) or having an opposite polarity (180 degrees) depending on recording electrode. Note that the topographic map was largely biased towards the side of the implant (left hemisphere for this participant). The strength of the artefact component was strongly reflected in the frequency-domain (middle panel), which it is shown as a large peak at the frequency of a full cycle (3.05 Hz) and its odd harmonics. Removing the artefact component (right panel) reduced the influence of the 3.05 Hz component and its odd harmonics and we could observe a response to the high alternating rate (6.1 Hz). Now, the topographic map shows a distribution that is similar to those observed with the low alternating rates (see Fig. 3). [Figure 22 about here.] 38 Table 1: AM frequency and electrode discrimination experimental details for each CI participant (indicated by the ID). For the AM frequency discrimination, the stimulation rate (in pps), AM frequencies, AM depths (in percentage), thresholds (THRs) and comfortable levels (CLs) (in Current Units, the clinical unit used in Cochlear devices) for QP-A and QP-C pulse shapes, are indicated. The AMs were presented at electrode 16. The 50% modulation depth, in dB, is shown in the AM depths column with 0 dB corresponding to 100% modulation depth and −6 dB corresponding to half of the dynamic range measured in a linear scale (i.e. in µA). For the CI electrode discrimination the rate, electrode pairs, and CLs for electrodes tested are indicated. For all participants and experimental conditions pulses consisted of 43 µs pulse width, and 7 µs IPG. ID AM discrimination Electrode discriminationRate [pps] AM frequencies [Hz] AM depths [%] THR-CL [CU] Rate [pps] Electrode Pair CL [CU] S1 126 20-35 100 & 50 (−4.7 dB) QPA: 94-154 827 1: 14-15 14: 122 100 & 50 (−4.7 dB) QPC: 94-154 2: 15-16 15: 122 3: 16-17 16: 122 801 20-35 100 & N.A. QPC: 57-144 17: 122 S2 126 20-35 100 & 50 (−3.9 dB) QPA: 140-185 827 1: 14-15 14: 168 100 & 50 (−4.7 dB) QPC: 140-194 2: 15-16 15: 171 3: 16-17 16: 171 801 20-35 100 & 50 (−4.5 dB) QPC: 123-174 17: 171 S3 126 20-35 100 & 50 (−7.3 dB) QPA: 107-200 827 1: 14-15 14: 165 100 & 50 (−7.8 dB) QPC: 107-206 2: 15-16 15: 165 3: 16-17 16: 165 801 20-35 100 & 50 (−8.2 dB) QPC: 72-177 17: 165 S4 126 20-35 100 & 50 (−5.4 dB) QPA: 130-199 827 1: 14-15 14: 166 100 & 50 (−5.4 dB) QPC: 130-199 2: 15-16 15: 171 3: 21-22 16: 171 21: 185 801 20-35 100 & 50 (−7.1 dB) QPC: 117-207 22: 204 S5 126 20-28 100 & 50 (−4.7 dB) QPA: 113-173 827 1: 14-15 14: 172 100 & 50 (−4.7 dB) QPC: 113-173 2: 15-16 15: 172 3: 16-17 16: 169 801 N.A. N.A. N.A. 17: 163 39 Table 2: Summary of parameters used for the assessment of spectral and temporal processing in CI and NH listeners. Pulse rate, AM frequencies, AM depths, pulse shape, electrodes, carrier frequency (NH) and alternating rates are listed for each measure performed. The precise pulse rates and modulation frequencies are rounded for simplicity. For one participant, the most apical electrode pair consisted of electrodes 21 and 22. † For one participant, the AM frequency alternated between 20 and 28 Hz. Individual details for CI listeners are provided in Table 1. Spectral processing CI Rate [pps] Electrode Pair Pulse shape Alternating rate [Hz](ACC & AC-FR) 827 1: 14-15 QP-C 0.5 & 6.1 827 2: 15-16 QP-C 0.5 & 6.1 827 3: 16-17* QP-C 0.5 & 6.1 Temporal processing Rate [pps] Electrode AM frequencies [Hz](ACC & AC-FR) AM depths [%] Pulse shape Alternating rate [Hz] (ACC & AC-FR) 126 16 20-35 & 20-35 † 100 & 50 QP-A 0.5 & 6.9 126 16 20-35 & 20-35 100 & 50 QP-C 0.5 & 6.9 801 16 20-35 & 20-35 100 & 50 QP-C 0.5 & 6.9 Temporal processing NH Carrier [Hz] AM frequencies [Hz] (ACC & AC-FR) AM depths [%] Alternating rate [Hz] (ACC & AC-FR) 500 20-35 & 19.2-32 100 & 50 0.5 & 6.4 40 AB Time .... .... C Am pl itu de D 100.0 % depth Time Am pl itu de 50.0 % depth QP-C ISI QP-A 7μs IPG E Figure 1: Schematic of stimulation paradigm to discriminate between two electrodes for A standard ACC, B continuous alternating ACC, and C the steady-state AC-FR. Thin gray lines show onset and offset evoked responses associated with start and end of stimulation. Blue waveform illustrates cortical response (ACC and AC-FR) to the transitions. D Alternating AM stimuli showing (top) fully modulated (100%) stimulus and (bottom) half-wave modulated (50%) stimulus. Different AM frequencies are shown by the coloured lines. E CI stimuli pulse shape, on top, QP-C and bottom QP-A separated by a given ISI. 41 A B C Figure 2: Percentage of correct identification of target stimuli from all CI participants. A shows the electrode discrimination for the three adjacent electrode pairs (1: electrodes 14 and 15; 2: electrodes 15 and 16; and 3: electrodes 16 and 17 or 21-22; details in Table 1). B shows amplitude modulation frequency discrimination at two modulation depths using a 801 pps QP-C pulse train. C shows the discrimination of the amplitude modulation frequencies at two modulation depths using a 126 pps carrier and two polarities, anodic (QP-A) and cathodic (QP-C). The horizontal line denotes the 80% discrimination threshold. Participants are indicated by each marker shape. The boxes show the 25th to the 75th percentile range and the black horizontal shows the median. 42 Figure 3: Example of transient ACC responses to electrode changes from participant S5 on the three different electrode pairs. A Electrodes 14-15, B electrodes 15-16, and C electrodes 16-17 obtained at 0.5 Hz alternating rate. P1-N1-P2 peaks are shown in each plot by the vertical lines, respectively. Horizontal grey lines show the residual noise of the recording. Topographic maps at the time of each peak (vertical lines) are shown in each panel. Amplitudes at each channel are indicated by colour code (in µV). 43 50 N1 P2 Pair 1 5 0 Am pl itu de [ V] N1 P2 Pair 2 0.0 0.2 0.4 0.6 Time [s] 5 0 N1 P2 Pair 3 A S1 S2 S3 S4 S5 −10 −5 0 5 10 G FP [d B ref 1u V] B 90 120 150 180 210 Pair 1 Pair 2 Pair 3 Electrode pair N 1 La te nc y [m s] C Figure 4: A Transient ACC responses to electrode changes at 0.5 Hz alternating rate. Individual responses from the EEG channel with the largest SNR are shown in grey and the group average in magenta. B GFP (in dB ref 1µV) and C latencies for all electrode pairs (1: electrodes 14 and 15; 2: electrodes 15 and 16; and 3: electrodes 16 and 17 or 21-22; details in Table 1). Individual responses are shown by each marker. The boxes show the 25th to the 75th percentile range and the black horizontal shows the median. 44 20 2 Am pl itu de [ V] N1 P2 A 0.0 0.1 0.2 0.3 C 0.0 0.5 Time [s] 2 0 2 Am pl itu de [ V] N1 P2 B 0 50 Frequency [Hz] 0.0 0.1 0.2 0.3 D −7.5 −5.0 −2.5 0.0 2.5 5.0 G FP [d B ref 1u V] E 150 200 250 300 350 0.5 1 Modulation depth N 1 La te nc y [m s] F Figure 5: Transient ACCs evoked by continuously alternating at 0.5 Hz the AM frequency between 20 and 35 Hz in NH listeners. A and B time-domain, C and D frequency-domain responses at 100% and 50% modulation depth, respectively. Individual responses are shown in grey and average response in magenta. ASSRs evoked by the modulation frequencies are indicated by the dotted vertical lines at 20 and 35 Hz, respectively. E GFP (in dB ref 1µV) and F latencies for both modulation depths. Individual responses are shown by filled dots. The boxes show the 25th to the 75th percentile range and the black horizontal shows the median. 45 42 0 2 4 Am pl itu de [ V] N1 P2 A 0.0 0.2 0.4 C 0.0 0.5 Time [s] 4 2 0 2 4 Am pl itu de [ V] N1 P2 B 0 50 Frequency [Hz] 0.0 0.2 0.4 D 4 2 0 2 4 N1 P2 E 0.0 0.2 0.4 G 0.0 0.5 Time [s] 4 2 0 2 4 N1 P2 F 0 50 Frequency [Hz] 0.0 0.2 0.4 H 4 2 0 2 4 N1 P2 I 0.0 0.2 0.4 K 0.0 0.5 Time [s] 4 2 0 2 4 N1 P2 J 0 50 Frequency [Hz] 0.0 0.2 0.4 L S1 S2 S3 S4 S5 −15 −10 −5 0 5 M −15 −10 −5 0 5 G FP [d B ref 1u V] O −15 −10 −5 0 5 Q 150 200 250 300 350 50 100 Modulation depth N 150 200 250 300 350 50 100 Modulation depth N 1 La te nc y [m s] P 150 200 250 300 350 50 100 Modulation depth R Figure 6: Transient ACC evoked by continuously alternating (0.5 Hz) the AM frequency using pulse trains presented at 801 pps A-D, and 126 pps E-L. A and B time-domain, C and D frequency-domain responses at 100% and 50% AM depth using QP-C 801 pps pulse trains. E and F time-domain, G and H frequency-domain responses at 100% and 50% AM depth using QP-A 126 pps pulse trains. I and J time-domain, K and L frequency-domain responses at 100% and 50% AM depth using QP-C 126 pps pulse trains, respectively. Individual responses are shown in grey whilst average response in magenta. ASSRs evoked by the AM frequencies embedded on the ACC waveform are indicated by the vertical dotted lines at 20 and 35 Hz in the frequency-domain panels. The respective GFP (in dB ref 1µV) and latencies are shown in M and N for QP-C at a pulse rate of 801 pps, O and P for QP-A at a pulse rate of 126 pps, and Q and R for QP-C at a pulse rate of 126 pps, respectively. The modulation depth is indicated in the horizontal axis. Individual responses are shown by filled dots. The boxes show the 25th to the 75th percentile range and the black horizontal shows the median. 46 0.5 0.0 0.5 Am pl itu de [ V] A 0.0 0.2 0.4 C 0.0 0.5 Time [s] 0.5 0.0 0.5 Am pl itu de [ V] B 0 50 Frequency [Hz] 0.0 0.2 0.4 D 6.4 19.2 32 −30 −25 −20 −15 G FP d B [re f 1 uV ] E 6.4 19.2 32 50 100 50 100 50 100 50 75 100 Modulation depth La te nc y [m s] F Figure 7: NH listeners AC-FRs evoked by rapid AM alternating rates (6.4 Hz). A and B time-domain, C and D frequency-domain responses at 100% and 50% AM depth, respectively. Individual responses are shown in grey and average response in magenta. Frequencies correspond- ing to the alternating rate (6.4 Hz) and each AM (19.2 and 32 Hz) are indicated by the vertical dotted lines, in the frequency-domain panels. Vertical lines in time-domain panels indicate the moment of the AM transition. E GFP amplitudes and F latencies for the alternating rate (6.4 Hz), and the AM frequencies (19.2 and 32 Hz) as a function of the modulation depth. 47 0.5 0.0 0.5 Am pl itu de [ V] A 0.0 0.2 0.4 C 0.0 0.5 Time [s] 0.5 0.0 0.5 Am pl itu de [ V] B 0 50 Frequency [Hz] 0.0 0.2 0.4 D 0.5 0.0 0.5 E 0.0 0.2 0.4 G 0.0 0.5 Time [s] 0.5 0.0 0.5 F 0 50 Frequency [Hz] 0.0 0.2 0.4 H 0.5 0.0 0.5 I 0.0 0.2 0.4 K 0.0 0.5 Time [s] 0.5 0.0 0.5 J 0 50 Frequency [Hz] 0.0 0.2 0.4 L Figure 8: AC-FRs evoked by continuously alternating (6.9 Hz) the AM frequency in CI users. A and B time-domain, C and D frequency-domain responses at 100% and 50% AM depth using QP-C 801 pps pulse trains. E and F time-domain, G and H frequency-domain responses at 100% and 50% AM depth using QP-A 126 pps pulse trains. I and J time-domain, K and L frequency-domain responses at 100% and 50% AM depth using QP-C 126 pps pulse trains, respectively. Individual responses are shown in grey whilst average response in magenta. AC-FRs and ASSRs evoked by the AM frequencies are indicated by the vertical dotted lines at 6.9, 20, and 35 Hz in the frequency-domain panels. Vertical lines in time-domain panels indicate the moment of the AM transition. 48 0.5 0.0 0.5 Pair 1 0.0 0.2 0.4 Pair 1 0.5 0.0 0.5 Am pl itu de [ V] Pair 2 0.0 0.2 0.4 Pair 2 0.0 0.5 Time [s] 0.5 0.0 0.5 Pair 3 0 50 Frequency [Hz] 0.0 0.2 0.4 Pair 3 A l S1 S2 S3 S4 S5 l l l −27.5 −25.0 −22.5 −20.0 −17.5 G FP [d B ref 1u V] B l l l 40 80 120 160 Pair 1 Pair 2 Pair 3 Electrode pair La te nc y [m s] C Figure 9: A Time and frequency domain AC-FRs obtained at 6.1 Hz alternating rate after artefact removal. Individual responses are shown in grey whilst average response in magenta. Vertical lines in time-domain panels indicate the moment of the electrode transition. The vertical line at 6.1 Hz in frequency-domain responses indicates the alternating rate. B GFP (in dB ref 1µV) and C latencies for all electrode pairs. Individual responses are shown by each marker. The boxes show the 25th to the 75th percentile range and the black horizontal shows the median. 49 10 20 30 CI NH Ad ap ta tio n In de x [dB ] AM/100 AM/50 EL/Pair 1 EL/Pair 2 EL/Pair 3 Figure 10: Adaptation index for CI and NH listeners. The source of the adaptation index, i.e., CI or NH listener, is indicated in the horizontal axis. The AM depth or electrode pair are indicated by each color. The boxes show the 25th to the 75th percentile range and the black horizontal shows the median. Violin plots illustrate the full distribution of the data. 50 A B C Figure 11: Topographic maps at different reference electrode (MP1) locations. A On the promontori, B posterior medial of the temporal muscle, and C near orbicularis oculi muscle. Topographic maps were obtained for two time points, at the middle of each pulse phase, shown by vertical lines alongside the waveforms. 51 A 902 0.0 2.0 4.0 6.0 8.0 Amplitude [ V] 500 400 300 200 100 B 902 0.0 1.5 3.0 4.5 Amplitude [ V] 500 400 300 200 100 C 902 0.0 0.2 0.3 0.5 Amplitude [ V] 500 400 300 200 100 D 902 0.0 0.2 0.3 0.5 Amplitude [ V] 500 400 300 200 100 E 902 0.0 0.2 0.3 0.5 Amplitude [ V] 500 400 300 200 100 Figure 12: Topographic maps for ASSR amplitudes at different pulse rates and interpolation time points. A Electrical artefacts obtained by interpolating between pulses from the onset of each electrical pulse (strong artefact contamination), and B interpolating from one eight of the pulse rate period, before the electrical pulse onset (minimum artefact contamination). C Amplitudes of simulated ASSR; D topographic map of contaminated response (B + C); and E, topographic maps of recovered neural response. The amplitude of the 40 Hz frequency component at each recording electrode is colour coded and indicated by the colorbar in each panel. Note that the maximum value shown in the colour scales was limited to facilitate visualisation across conditions, however, residual artefacts exceeded 24µV in many cases. 52 A B C Figure 13: Topographic maps for ASSR at 902 pps. A Simulated neural response, B recorded electrical artefact plus simulated neural response, and C recovered neural response. Topographic maps were obtained for the ASSR frequency response (40 Hz). The response amplitude at each recording electrode is colour coded and indicated by the colorbar in each panel. 53 045 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 Artefa ct N eural Artefa ct + N eural R eco ve red 0.1 0.3 0.5 0.1 0.3 0.10.3 0.50.7 0.9 0.1 0.3 Am pl itu de [µ V] 40 HzA L M R 40 Hz 0 68 158 247 337 0 68 158 247 337 0 68 158 247 337 −40 −30 −20 −10 Neural phase [Degrees] Am pl itu de [d B ref 1µ V] Neural Recovered B −30 −20 −10 0 40 Hz Modulation Frequency G FP [d B ref 1µ V] Neural Recovered Artefact Artefact + Neural C 40 Hz 0 100 200 300 0 68 158 247 337 Recovered phase [Degrees] N eu ra l p ha se [D eg ree s] Neural Recovered D Figure 14: A Amplitude and phase of the 40 Hz ASSRs at Cz. Each row shows the amplitude and phase of: artefact only, target (neural), artefact plus neural, and recovered ASSRs, respectively. The different phases of the neural source are colour coded. B ASSR amplitudes for central and frontal recording electrodes on the left (L), middle (M), and right (R) side of the head. C GFP for all simulated ASSRs. Target (neural), recovered, artefact only, and artefact plus neural data are colour coded. D Target (neural) and recovered phase (in degrees) for simulated ASSRs at Cz. 54 A B C D E F Figure 15: Topographic maps for AM frequency alternating simulations. A Target neural response, B artefact plus neural response, and C recovered neural response. Topographic maps were obtained for the time points shown by vertical lines alongside the waveforms. D, E, and F same as before but in the frequency-domain. 55 Neural Recovered Artefact Artefact + Neural L M R cN1−cP2 N1−P2 cN1−cP2 N1−P2 cN1−cP2 N1−P2 −10 0 10 20 Am pl itu de [d B ref 1µ V] A −5 0 5 10 15 cN1−cP2 N1−P2 G FP [d B ref 1µ V] B Figure 16: Amplitude and GFP across all simulated conditions. A ACC amplitudes for the first (N1-P2) and second (cN1-cP2) simulated responses for central and frontal recording electrodes on the left (L), middle (M), and right (R) side of the head. B GFP for the first (N1-P2) and second (cN1-cP2) simulated ACC responses. Target (neural), recovered, artefact only, and artefact plus neural data are colour coded. 56 045 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 0.0625 0.125 0.1875 0.25 0.3125 0.375 0.4375 Artefa ct N eural Artefa ct + N eural R eco ve red 0.1 0.3 0.5 0.7 0.1 0.10.3 0.50.7 0.9 0.1 Am pl itu de [µ V] 20 HzA 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 0.0625 0.125 0.1875 0.25 0.3125 0.375 0.4375 Artefa ct N eural Artefa ct + N eural R eco ve red 0.1 0.3 0.1 0.1 0.3 0.5 0.1 Am pl itu de [µ V] 40 Hz B Figure 17: Amplitude and phase of ASSRs to each AM frequency at Cz. A for 20 and B fro 40 Hz. Each column corresponds to a different starting interpolation point (as a fraction of the pulse rate period). Each row shows the amplitude and phase of: artefact only, target (neural), artefact plus neural, and recovered ASSR, respectively. The different phases of the neural source are colour coded. 57 L M R 20 H z 40 H z 0 68 158 247 337 0 68 158 247 337 0 68 158 247 337 −50 −40 −30 −20 −10 −50 −40 −30 −20 −10 Neural phase [Degrees] Am pl itu de [d B ref 1µ V] Neural Recovered A −40 −30 −20 −10 0 20 Hz 40 Hz Modulation Frequency G FP [d B ref 1µ V] Neural Recovered Artefact Artefact + Neural B 20 Hz 40 Hz 0 100 200 300 0 100 200 300 0 68 158 247 337 Recovered phase [Degrees] N eu ra l p ha se [D eg ree s] Neural Recovered C Figure 18: A Amplitudes across all frontal and central electrodes for target (neural), and recovered response. B GFP across all simulated conditions. Individual points correspond to different interpolation starting points and different neural phases. Target (neural), recovered response, artefact only, and artefact plus neural data are colour coded. C Target (neural) and recovered phase (in degrees) for simulated ASSRs to each AM frequency (indicated above each panel) at Cz. Individual points correspond to different interpolation starting points. 58 A B C Figure 19: Topographic maps for AC-FR simulations at 902 pps. A Target neural response, B artefact plus neural response, and C recovered neural response. The amplitude is colour coded (in µV). Note that comodulations around the target frequencies are caused by the beating of each frequency (1 Hz) within the analysis window used to compute the frequency-domain response (≈ 4 seconds). 59 045 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 0.03125 0.0625 0.09375 Artefa ct N eural Artefa ct + N eural R eco ve red 0.1 0.1 0.1 0.3 0.1 0.3 Am pl itu de [µ V] 40 Hz AC−FRA 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 45 90 135 180 225 270 315 0 0.03125 0.0625 0.09375 Artefa ct N eural Artefa ct + N eural R eco ve red 0.1 0.1 0.1 0.3 0.1 80 Hz AC−FRB L M R 40 Hz 80 Hz 0 68 158 247 337 0 68 158 247 337 0 68 158 247 337 −25 −20 −15 −10 −5 0 −25 −20 −15 −10 −5 0 Neural phase [Degrees] Am pl itu de [µ V] C −20 −15 −10 40 Hz 80 Hz Modulation Frequency G FP [d B ref 1µ V] Neural Recovered Artefact Artefact + NeuralD 40 Hz 80 Hz 0 100 200 300 0 100 200 300 0 68 158 247 337 Recovered phase [Degrees] N eu ra l p ha se [D eg ree s] Neural RecoveredE Figure 20: A and B Amplitude and phase of 40 Hz and 80 Hz AC-FRs at Cz, respectively. Each row shows the amplitude and phase of: artefact only, target (neural), artefact plus neural, and recovered ASSRs, respectively. The different phases of the neural source are colour coded. C ASSR amplitudes for central and frontal recording electrodes on the left (L), middle (M), and right (R) side of the head. D GFP for all simulated ASSRs. Target (neural), recovered, artefact only, and artefact plus neural data are colour coded. E Target (neural) and recovered phase (in degrees) for simulated ASSRs at Cz. 60 (a) (b) (c) (d) Figure 21: Top, example traces of artefact interpolation recorded from one CI participant. The original trace in shown in blue, interpolation points in red, and resulting waveform in orange. Two different time scales are shown from a) left to b) right. Bottom, example transient ACC responses to AM frequency changes imposed on a 126 pps QP-C pulse train presented on electrode 16. c), full (100%) modulated, and d) halfway (50%) modulated. The respective P1-N1-P2 peaks are shown by the vertical lines. The topographic map of each peak is shown on top where the amplitude is colour coded (in µV). 61 A B C D Figure 22: Example electrical artefact at a high alternating rate in a CI participant. A Show the original waveform and B the recovered response after removing the first two DSS components shown in C. Topographic maps show the amplitude of each frequency component indicated by the triangular markers (in µV). D Illustrates the projection of first DSS component on the sensor space. Topographic maps at the beginning and the end of the waveform show clearly that this component is caused by the CI (located on the right ear). 62