Visual masking explained

Visual masking is a phenomenon of visual perception. It occurs when the visibility of one image, called a target, is reduced by the presence of another image, called a mask.^[1] The target might be invisible or appear to have reduced contrast or lightness. There are three different timing arrangements for masking: forward masking, backward masking, and simultaneous masking. In forward masking, the mask precedes the target. In backward masking the mask follows the target. In simultaneous masking, the mask and target are shown together. There are two different spatial arrangements for masking: pattern masking and metacontrast. Pattern masking occurs when the target and mask locations overlap. Metacontrast masking occurs when the mask does not overlap with the target location.

Factors affecting visual masking

Target-to-mask spatial separation

Suppression can be seen in both forward and backward masking when there is pattern masking, but not when there is metacontrast. Simultaneous masking, however, will produce facilitation of target visibility during pattern masking. Facilitation also comes about when metacontrast is combined with either simultaneous or forward masking.^[2] This is because it takes time for the mask to reach the target's location through lateral propagation. As the target gets further from the mask, the time required for lateral propagation increases. Thus, the masking effect will increase as the mask gets closer to the target.

Target-to-mask temporal separation

As the time difference between the target and the mask increases, the masking effect decreases. This is because the integration time of a target stimulus has an upper limit 200 ms, based on physiological experiments^[3] ^[4] ^[5] and as the separation approaches this limit, the mask is able to produce less of an effect on the target, as the target has had more time to form a full neural representation in the brain.Polat, Sterkin, and Yehezkel went into great detail in explaining the effect of temporal matching between target input and lateral propagation of the mask. Based on data from previous single-unit recordings, they concluded that the time window for any sort of efficient interaction with target processing is 210 to 310 ms after the target's appearance. Anything outside of this window would fail to cause any sort of masking effect. This explains why there is a masking effect when the mask is presented 50 ms after the target, but not when the inter-stimulus interval between mask and target is 150 ms. In the first case, mask response would propagate to the target location and be processed with a delay of 260 to 310 ms, whereas the ISI of 150 would result in a delay of 410 to 460 ms.

Monoptic vs. dichoptic visual masking

In dichoptic visual masking, the target is presented to one eye and the mask to the other, whereas in monoptic visual masking, both eyes are presented with the target and the mask. It was found that the masking effect was just as strong in dichoptic as it was in monoptic masking, and that it showed the same timing characteristics.^[6] ^[7] ^[8]

Possible neural correlates

There are multiple theories surrounding the neural correlates of masking, but most of them agree on a few key ideas. First, backward visual masking comes about from suppression of the target's “after-discharge”,^[9] where the after-discharge can be thought of as the neural response to the target's termination. Impairments in backward masking have been consistently found in those with schizophrenia^[10] as well as in their unaffected siblings,^[11] ^[12] thus suggesting that the impairments might be an endophenotype for schizophrenia.^[13]

Forward masking, on the other hand, is correlated to the suppression of the target's “onset-response”, which can be thought of as the neural response to the target's appearance.

Two-channel model

Originally proposed by Breitmeyer and Ganz in 1976,^[14] the original version of this model stated that there were two different visual information channels- one being fast and transient, the other being slow and sustained. The theory asserts that each stimulus travels up each channel, and both channels are necessary for proper and full processing of any given stimulus. It explained backward masking by saying that the neural representation of the mask would travel up the transient channel and intercept the neural representation of the target as it travelled up the slower channel, suppressing the target's representation and decreasing its visibility. One problem with this model, as proposed by Macknik and Martinez-Conde, is that it predicts masking to occur as a function of how far apart, temporally, the stimulus onset is. However, Macknik and Martinez-Conde showed that backward masking is actually more dependent on how far apart stimulus termination is.

Retino-cortical dynamics model

Breitmeyer and Ögmen modified the two-channel model in 2006,^[15] renaming it to the retino-cortical dynamics (RECOD) model in the process. Their main proposed modification was that the fast and slow channels were actually feed forward and feedback channels, instead of the magnocellular and parvocellular retino-geniculocortical pathways, which is what had previously been proposed. Thus, according to this new model, backward masking is caused when feed forward input from the mask interferes with the feedback coming from the higher visual areas’ response to the target, thus reducing visibility.

Lamme's recurrent feedback hypothesis of visual awareness and masking

This model proposes that backward masking is caused by an interference with feedback from higher visual areas.^[16] In this model, target duration is irrelevant because masking is supposed to occur as a function of feedback, which is generated when the target appears on screen. Lamme's group further supported their model when they described that the surgical removal of the extrastriate cortex in monkeys leads to a reduction of area V1 late responses.^[17]

Lateral inhibition circuits

Proposed by Macknik and Martinez-Conde in 2008, this theory proposes that masking can be explained almost entirely by feed forward lateral inhibition circuits. The idea is that the edges of the mask, if positioned in close proximity to the target, may inhibit the responses caused by the edges of the target, inhibiting perception of the target.

Coupled interactions between V1 and fusiform gyrus

Haynes, Driver, and Rees proposed this theory in 2005,^[18] stating that visibility derives from the feed forward and feedback interactions between the V1 and fusiform gyrus. In their experiment, they required subjects to attend actively to the target- thus, as Macknik and Martinez-Conde point out, it is possible that their results were confounded by the attentional aspect of the trials, and that the results may not accurately reflect the effects of visual masking.

Frontal lobe processing of visual masking

This was proposed by Thompson and Schall, based on experiments conducted in 1999^[19] and 2000.^[20] They concluded that visual masking is processed in the frontal-eye fields, and that the neural correlate of masking lies not in the inhibition of the response to the target but in the “merging” of target and mask responses. One criticism of their experiment, however, is that their target was almost 300x dimmer than the mask, so their results may have been confounded by the different response latencies one would expect from stimuli with such differences in brightness.

Evidence from monoptic and dichoptic visual masking

Macknik & Martinez-Conde^[21] recorded from neurons in the lateral geniculate nucleus (LGN) and V1 V1 while presenting monoptic and dichoptic stimuli, and found that monoptic masking occurred in all the LGN and V1 neurons that were recorded, but dichoptic masking only occurred in some of the binocular neurons in V1, which supports the hypothesis that visual masking in monoptic regions is not due to feedback from dichoptic regions. This is because, if there had been feedback from higher areas of the visual field, the early circuits would have “inherited” dichoptic masking from the feedback coming from higher levels, and so would exhibit both dichoptic and monoptic masking. Although monoptic masking is stronger in the early visual areas, monoptic and dichoptic masking are equivalent in magnitude. Thus, dichoptic masking must become stronger as it proceeds down the visual hierarchy if the preceding hypothesis is correct. In fact, dichoptic masking was shown to begin downstream of area V2.

Notes and References

10.4249/scholarpedia.3330. Visual masking. Scholarpedia. 2. 7. 3330. 2007. Ogmen. Haluk. Breitmeyer. Bruno . vanc . 2007SchpJ...2.3330B. free.
Polat U, Sterkin A, Yehezkel O . Spatio-temporal low-level neural networks account for visual masking . Advances in Cognitive Psychology . 3 . 1–2 . 153–65 . July 2008 . 20517505 . 2864984 . 10.2478/v10053-008-0021-4 .
Albrecht DG . Visual cortex neurons in monkey and cat: effect of contrast on the spatial and temporal phase transfer functions . Visual Neuroscience . 12 . 6 . 1191–210 . 1995 . 8962836 . 10.1017/s0952523800006817 . 14072147 .
Mizobe K, Polat U, Pettet MW, Kasamatsu T . 12541180 . Facilitation and suppression of single striate-cell activity by spatially discrete pattern stimuli presented beyond the receptive field . Visual Neuroscience . 18 . 3 . 377–91 . 2001 . 11497414 . 10.1017/s0952523801183045 .
Polat U, Mizobe K, Pettet MW, Kasamatsu T, Norcia AM . 205024244 . Collinear stimuli regulate visual responses depending on cell's contrast threshold . Nature . 391 . 6667 . 580–4 . February 1998 . 9468134 . 10.1038/35372 . 1998Natur.391..580P .
Crawford BH . Visual adaptation in relation to brief conditioning stimuli . Proceedings of the Royal Society of London. Series B, Biological Sciences . 134 . 875 . 283–302 . March 1947 . 20292379 . 10.1098/rspb.1947.0015 . 1947RSPSB.134..283C .
Macknik SL, Livingstone MS . 10520581 . Neuronal correlates of visibility and invisibility in the primate visual system . Nature Neuroscience . 1 . 2 . 144–9 . June 1998 . 10195130 . 10.1038/393 .
Macknik SL, Martinez-Conde S, Haglund MM . The role of spatiotemporal edges in visibility and visual masking . Proceedings of the National Academy of Sciences of the United States of America . 97 . 13 . 7556–60 . June 2000 . 10852945 . 16584 . 10.1073/pnas.110142097 . 2000PNAS...97.7556M . free .
Macknik SL, Martinez-Conde S . The role of feedback in visual masking and visual processing . Advances in Cognitive Psychology . 3 . 1–2 . 125–52 . July 2008 . 20517504 . 2864985 . 10.2478/v10053-008-0020-5 .
Green MF, Horan WP, Lee J . Nonsocial and social cognition in schizophrenia: current evidence and future directions . World Psychiatry . 18 . 2 . 146–161 . June 2019 . 31059632 . 10.1002/wps.20624 . 6502429 .
da Cruz JR, Shaqiri A, Roinishvili M, Favrod O, Chkonia E, Brand A, Figueiredo P, Herzog MH . 6 . Neural Compensation Mechanisms of Siblings of Schizophrenia Patients as Revealed by High-Density EEG . Schizophrenia Bulletin . January 2020 . 46 . 4 . 1009–1018 . 31961928 . 10.1093/schbul/sbz133 . 7345810 .
Chkonia E, Roinishvili M, Makhatadze N, Tsverava L, Stroux A, Neumann K, Herzog MH, Brand A . 6 . The shine-through masking paradigm is a potential endophenotype of schizophrenia . PLOS ONE . 5 . 12 . e14268 . December 2010 . 21151559 . 3000331 . 10.1371/journal.pone.0014268 . 2010PLoSO...514268C . free .
Green MF, Lee J, Wynn JK, Mathis KI . Visual masking in schizophrenia: overview and theoretical implications . Schizophrenia Bulletin . 37 . 4 . 700–8 . July 2011 . 21606322 . 10.1093/schbul/sbr051 . 3122285 .
Breitmeyer BG, Ganz L . Implications of sustained and transient channels for theories of visual pattern masking, saccadic suppression, and information processing . Psychological Review . 83 . 1 . 1–36 . January 1976 . 766038 . 10.1037/0033-295x.83.1.1 .
Book: Breitmeyer B, Öğmen H . Visual Masking: Time slices through conscious and unconscious vision. 2nd. Oxford, UK. Oxford University Press. 2006.
Lamme VA, Zipser K, Spekreijse H . 1975279 . Masking interrupts figure-ground signals in V1 . Journal of Cognitive Neuroscience . 14 . 7 . 1044–53 . October 2002 . 12419127 . 10.1162/089892902320474490 .
Lamme VA, Zipser K, Spekreijse H . Figure-ground signals in V1 depend on extrastriate feedback. Investigative Ophthalmology & Visual Science. 1997. 38. 4. S969.
Haynes JD, Driver J, Rees G . 6543247 . Visibility reflects dynamic changes of effective connectivity between V1 and fusiform cortex . Neuron . 46 . 5 . 811–21 . June 2005 . 15924866 . 10.1016/j.neuron.2005.05.012 . free .
Thompson KG, Schall JD . 7085444 . The detection of visual signals by macaque frontal eye field during masking . Nature Neuroscience . 2 . 3 . 283–8 . March 1999 . 10195223 . 10.1038/6398 .
Thompson KG, Schall JD . Antecedents and correlates of visual detection and awareness in macaque prefrontal cortex . Vision Research . 40 . 10–12 . 1523–38 . 2000 . 10788656 . 10.1016/s0042-6989(99)00250-3 . 412043 .
Macknik SL, Martinez-Conde S . 10.1016/j.neucom.2004.01.126. The spatial and temporal effects of lateral inhibitory networks and their relevance to the visibility of spatiotemporal edges. Neurocomputing. 58-60. 775–782. 2004 .