Two-streams hypothesis explained

The two-streams hypothesis is a model of the neural processing of vision as well as hearing.^[1] The hypothesis, given its initial characterisation in a paper by David Milner and Melvyn A. Goodale in 1992, argues that humans possess two distinct visual systems.^[2] Recently there seems to be evidence of two distinct auditory systems as well. As visual information exits the occipital lobe, and as sound leaves the phonological network, it follows two main pathways, or "streams". The ventral stream (also known as the "what pathway") leads to the temporal lobe, which is involved with object and visual identification and recognition. The dorsal stream (or, "how pathway") leads to the parietal lobe, which is involved with processing the object's spatial location relative to the viewer and with speech repetition.

History

Several researchers had proposed similar ideas previously. The authors themselves credit the inspiration of work on blindsight by Weiskrantz, and previous neuroscientific vision research. Schneider first proposed the existence of two visual systems for localisation and identification in 1969.^[3] Ingle described two independent visual systems in frogs in 1973.^[4] Ettlinger reviewed the existing neuropsychological evidence of a distinction in 1990.^[5] Moreover, Trevarthen had offered an account of two separate mechanisms of vision in monkeys back in 1968.^[6]

In 1982, Ungerleider and Mishkin distinguished the dorsal and ventral streams, as processing spatial and visual features respectively, from their lesion studies of monkeys – proposing the original where vs what distinction.^[7] Though this framework was superseded by that of Milner & Goodale, it remains influential.^[8]

One hugely influential source of information that has informed the model has been experimental work exploring the extant abilities of visual agnosic patient D.F. The first, and most influential report, came from Goodale and colleagues in 1991^[9] and work is still being published on her two decades later. This has been the focus of some criticism of the model due to the perceived over-reliance on findings from a single case.

Two visual systems

Goodale and Milner amassed an array of anatomical, neuropsychological, electrophysiological, and behavioural evidence for their model. According to their data, the ventral 'perceptual' stream computes a detailed map of the world from visual input, which can then be used for cognitive operations, and the dorsal 'action' stream transforms incoming visual information to the requisite egocentric (head-centered) coordinate system for skilled motor planning.The model also posits that visual perception encodes spatial properties of objects, such as size and location, relative to other objects in the visual field; in other words, it utilizes relative metrics and scene-based frames of reference. Visual action planning and coordination, on the other hand, uses absolute metrics determined via egocentric frames of reference, computing the actual properties of objects relative to the observer. Thus, grasping movements directed towards objects embedded in size-contrast-ambiguous scenes have been shown to escape the effects of these illusions, as different frames of references and metrics are involved in the perception of the illusion versus the execution of the grasping act.^[10] Norman^[11] proposed a similar dual-process model of vision, and described eight main differences between the two systems consistent with other two-system models.

Factor	Ventral system (what)	Dorsal system (how)
scope=row	Function	Recognition/identification	Visually guided behaviour
scope=row	Sensitivity	High spatial frequencies - details	High temporal frequencies - motion
scope=row	Memory	Long-term stored representations	Only very short-term storage
scope=row	Speed	Relatively slow	Relatively fast
scope=row	Consciousness	Typically high	Typically low
scope=row	Frame of reference	Allocentric or object-centered	Egocentric or viewer-centered
scope=row	Visual input	Mainly foveal or parafoveal	Across retina
scope=row	Monocular vision	Generally reasonably small effects	Often large effects e.g. motion parallax

Dorsal stream

The dorsal stream is proposed to be involved in the guidance of actions and recognizing where objects are in space. The dorsal stream projects from the primary visual cortex to the posterior parietal cortex. It was initially termed the "where" pathway since it was thought that the dorsal stream processes information regarding the spatial properties of an object.^[12] However, later research conducted on a famous neuropsychological patient, Patient D.F., revealed that the dorsal stream is responsible for processing the visual information needed to construct the representations of objects one wishes to manipulate. Those findings led the nickname of the dorsal stream to be updated to the "how" pathway.^[13] ^[14] The dorsal stream is interconnected with the parallel ventral stream (the "what" stream) which runs downward from V1 into the temporal lobe.

General features

The dorsal stream is involved in spatial awareness and guidance of actions (e.g., reaching). In this it has two distinct functional characteristics—it contains a detailed map of the visual field, and is also good at detecting and analyzing movements.

The dorsal stream commences with purely visual functions in the occipital lobe before gradually transferring to spatial awareness at its termination in the parietal lobe.

The posterior parietal cortex is essential for "the perception and interpretation of spatial relationships, accurate body image, and the learning of tasks involving coordination of the body in space".^[15]

It contains individually functioning lobules. The lateral intraparietal sulcus (LIP) contains neurons that produce enhanced activation when attention is moved onto the stimulus or the animal saccades towards a visual stimulus, and the ventral intraparietal sulcus (VIP) where visual and somatosensory information are integrated.

Effects of damage or lesions

Damage to the posterior parietal cortex causes a number of spatial disorders including:

Simultanagnosia: where the patient can only describe single objects without the ability to perceive it as a component of a set of details or objects in a context (as in a scenario, e.g. the forest for the trees).
Optic ataxia: where the patient cannot use visuospatial information to guide arm movements.
Hemispatial neglect: where the patient is unaware of the contralesional half of space (that is, they are unaware of things in their left field of view and focus only on objects in the right field of view; or appear unaware of things in one field of view when they perceive them in the other). For example, a person with this disorder may draw a clock, and then label all twelve of the numbers on one side of the face and consider the drawing complete.
Akinetopsia: inability to perceive motion.
Apraxia: inability to produce discretionary or volitional movement in the absence of muscular disorders.

Ventral stream

The ventral stream is associated with object recognition and form representation. Also described as the "what" stream, it has strong connections to the medial temporal lobe (which is associated with long-term memories), the limbic system (which controls emotions), and the dorsal stream (which deals with object locations and motion).

The ventral stream gets its main input from the parvocellular (as opposed to magnocellular) layer of the lateral geniculate nucleus of the thalamus. These neurons project to V1 sublayers 4Cβ, 4A, 3B and 2/3a^[16] successively. From there, the ventral pathway goes through V2 and V4 to areas of the inferior temporal lobe: PIT (posterior inferotemporal), CIT (central inferotemporal), and AIT (anterior inferotemporal). Each visual area contains a full representation of visual space. That is, it contains neurons whose receptive fields together represent the entire visual field. Visual information enters the ventral stream through the primary visual cortex and travels through the rest of the areas in sequence.

Moving along the stream from V1 to AIT, receptive fields increase their size, latency, and the complexity of their tuning. For example, recent studies have shown that the V4 area is responsible for color perception in humans, and the V8 (VO1) area is responsible for shape perception, while the VO2 area, which is located between these regions and the parahippocampal cortex, integrates information about the color and shape of stimuli into a holistic image.^[17]

All the areas in the ventral stream are influenced by extraretinal factors in addition to the nature of the stimulus in their receptive field. These factors include attention, working memory, and stimulus salience. Thus the ventral stream does not merely provide a description of the elements in the visual world—it also plays a crucial role in judging the significance of these elements.

Damage to the ventral stream can cause inability to recognize faces or interpret facial expression.^[18]

Two auditory systems

Ventral stream

Along with the visual ventral pathway being important for visual processing, there is also a ventral auditory pathway emerging from the primary auditory cortex.^[19] In this pathway, phonemes are processed posteriorly to syllables and environmental sounds.^[20] The information then joins the visual ventral stream at the middle temporal gyrus and temporal pole. Here the auditory objects are converted into audio-visual concepts.^[21]

Dorsal stream

The function of the auditory dorsal pathway is to map the auditory sensory representations onto articulatory motor representations. Hickok & Poeppel claim that the auditory dorsal pathway is necessary because, "learning to speak is essentially a motor learning task. The primary input to this is sensory, speech in particular. So, there must be a neural mechanism that both codes and maintains instances of speech sounds, and can use these sensory traces to guide the tuning of speech gestures so that the sounds are accurately reproduced."^[22] In contrast to the ventral stream's auditory processing, information enters from the primary auditory cortex into the posterior superior temporal gyrus and posterior superior temporal sulcus. From there the information moves to the beginning of the dorsal pathway, which is located at the boundary of the temporal and parietal lobes near the Sylvian fissure. The first step of the dorsal pathway begins in the sensorimotor interface, located in the left Sylvian parietal temporal (Spt) (within the Sylvian fissure at the parietal-temporal boundary). The spt is important for perceiving and reproducing sounds. This is evident because its ability to acquire new vocabulary, be disrupted by lesions and auditory feedback on speech production, articulatory decline in late-onset deafness and the non-phonological residue of Wernicke's aphasia; deficient self-monitoring. It is also important for the basic neuronal mechanisms for phonological short-term memory. Without the Spt, language acquisition is impaired. The information then moves onto the articulatory network, which is divided into two separate parts. The articulatory network 1, which processes motor syllable programs, is located in the left posterior inferior temporal gyrus and Brodmann's area 44 (pIFG-BA44). The articulatory network 2 is for motor phoneme programs and is located in the left M1-vBA6.^[23]

Conduction aphasia affects a subject's ability to reproduce speech (typically by repetition), though it has no influence on the subject's ability to comprehend spoken language. This shows that conduction aphasia must reflect not an impairment of the auditory ventral pathway but instead of the auditory dorsal pathway. Buchsbaum et al^[24] found that conduction aphasia can be the result of damage, particularly lesions, to the Spt (Sylvian parietal temporal). This is shown by the Spt's involvement in acquiring new vocabulary, for while experiments have shown that most conduction aphasiacs can repeat high-frequency, simple words, their ability to repeat low-frequency, complex words is impaired. The Spt is responsible for connecting the motor and auditory systems by making auditory code accessible to the motor cortex. It appears that the motor cortex recreates high-frequency, simple words (like cup) in order to more quickly and efficiently access them, while low-frequency, complex words (like Sylvian parietal temporal) require more active, online regulation by the Spt. This explains why conduction aphasiacs have particular difficulty with low-frequency words which requires a more hands-on process for speech production. "Functionally, conduction aphasia has been characterized as a deficit in the ability to encode phonological information for production," namely because of a disruption in the motor-auditory interface.^[25] Conduction aphasia has been more specifically related to damage of the arcuate fasciculus, which is vital for both speech and language comprehension, as the arcuate fasiculus makes up the connection between Broca and Wernicke's areas.^[25]

Criticisms

Goodale & Milner's innovation was to shift the perspective from an emphasis on input distinctions, such as object location versus properties, to an emphasis on the functional relevance of vision to behaviour, for perception or for action. Contemporary perspectives however, informed by empirical work over the past two decades, offer a more complex account than a simple separation of function into two-streams.^[26] Recent experimental work for instance has challenged these findings, and has suggested that the apparent dissociation between the effects of illusions on perception and action is due to differences in attention, task demands, and other confounds.^[27] ^[28] There are other empirical findings, however, that cannot be so easily dismissed which provide strong support for the idea that skilled actions such as grasping are not affected by pictorial illusions.^[29] ^[30] ^[31] ^[32]

Moreover, recent neuropsychological research has questioned the validity of the dissociation of the two streams that has provided the cornerstone of evidence for the model. The dissociation between visual agnosia and optic ataxia has been challenged by several researchers as not as strong as originally portrayed; Hesse and colleagues demonstrated dorsal stream impairments in patient DF;^[33] Himmelbach and colleagues reassessed DF's abilities and applied more rigorous statistical analysis demonstrating that the dissociation was not as strong as first thought.^[34]

A 2009 review of the accumulated evidence for the model concluded that whilst the spirit of the model has been vindicated the independence of the two streams has been overemphasised.^[35] Goodale & Milner themselves have proposed the analogy of tele-assistance, one of the most efficient schemes devised for the remote control of robots working in hostile environments. In this account, the dorsal stream is viewed as a semi-autonomous function that operates under guidance of executive functions which themselves are informed by ventral stream processing.Thus the emerging perspective within neuropsychology and neurophysiology is that, whilst a two-systems framework was a necessary advance to stimulate study of the highly complex and differentiated functions of the two neural pathways; the reality is more likely to involve considerable interaction between vision-for-action and vision-for-perception. Robert McIntosh and Thomas Schenk summarize this position as follows:

Notes and References

Book: Eyesenck MW, Keane MT . Cognitive Psychology: A Student's Handbook . registration . 2010 . Psychology Press . Hove, UK.
Goodale MA, Milner AD . Separate visual pathways for perception and action . Trends Neurosci. . 15 . 1 . 20–5 . 1992 . 1374953 . 10.1016/0166-2236(92)90344-8. 793980 .
Schneider . GE. . Two visual systems . Science . 163 . 3870 . 895–902 . Feb 1969 . 10.1126/science.163.3870.895. 5763873 . 1969Sci...163..895S .
Ingle . D. . Two visual systems in the frog . Science . 181 . 4104 . 1053–5 . Sep 1973 . 10.1126/science.181.4104.1053. 4542178 . 1973Sci...181.1053I . 42609212 .
Cortex . 1990 . 26 . 3 . 319–41 . "Object vision" and "spatial vision": the neuropsychological evidence for the distinction. . Ettlinger G. . 2123426 . 10.1016/s0010-9452(13)80084-6. 4488069 . free .
Trevarthen . CB. . Two mechanisms of vision in primates . Psychol Forsch . 31 . 4 . 299–348 . 1968 . 10.1007/bf00422717. 4973634. 27943693 .
Behav Brain Res . 1982 . 6 . 1 . 57–77 . Contribution of striate inputs to the visuospatial functions of parieto-preoccipital cortex in monkeys. . Mishkin M, Ungerleider LG . 7126325 . 10.1016/0166-4328(82)90081-X. 33359587 .
Schenk. Thomas. McIntosh. Robert D.. Do we have independent visual streams for perception and action?. Cognitive Neuroscience. 1. 1. 2010. 52–62. 1758-8928. 10.1080/17588920903388950. 24168245. 3560697.
Goodale . MA. . Milner . AD. . Jakobson . LS. . Carey . DP. . A neurological dissociation between perceiving objects and grasping them . Nature . 349 . 6305 . 154–6 . Jan 1991 . 10.1038/349154a0 . 1986306 . 1991Natur.349..154G . 4238254 .
Curr. Biol. . 1995 . 5 . 6 . 679–85 . Size-contrast illusions deceive the eye but not the hand. . Aglioti S, DeSouza JF, Goodale MA . 10.1016/S0960-9822(95)00133-3 . 7552179. 206111613 . free .
Behav Brain Sci . 2002 . 25 . 1 . Two visual systems and two theories of perception: An attempt to reconcile the constructivist and ecological approaches . 73–144 . Norman J. . 10.1017/s0140525x0200002x. 12625088 .
Mishkin . Mortimer . Ungerleider . Leslie G. . Macko . Kathleen A. . 1983-01-01 . Object vision and spatial vision: two cortical pathways . Trends in Neurosciences . en . 6 . 414–417 . 10.1016/0166-2236(83)90190-X . 15565609 . 0166-2236.
Goodale . M. A. . Milner . A. D. . Jakobson . L. S. . Carey . D. P. . January 1991 . A neurological dissociation between perceiving objects and grasping them . Nature . en . 349 . 6305 . 154–156 . 10.1038/349154a0 . 1986306 . 1991Natur.349..154G . 4238254 . 1476-4687.
Book: Milner . A.D. . The Visual Brain in Action . Goodale . M.A. . OUP Oxford . 2006 . 978-0198524724 . 2nd.
Book: Mark F Bear . Barry Connors . Michael Paradiso . Neuroscience: Exploring the Brain . registration . Lippincott Williams & Wilkins . Hagerstown, MD . 2007 . 978-0-7817-6003-4.
10.1016/S0959-4388(98)80042-1 . Lamme . Victor AF . Supèr . Hans . Spekreijse . Henk . 1998 . Feedforward, horizontal, and feedback processing in the visual cortex . Current Opinion in Neurobiology . 8 . 4. 529–535 . 9751656 . 1499090 .
Kozlovskiy . Stanislav . Rogachev . Anton . How Areas of Ventral Visual Stream Interact When We Memorize Color and Shape Information . Advances in Intelligent Systems and Computing . 2021 . 1358 . 95–100 . 95–100 . 10.1007/978-3-030-71637-0_10 . Springer-Nature . 978-3-030-71636-3 . 234902744 . 2194-5357.
Web site: Impairment of vision due to damage to the brain. www.ssc.education.ed.ac.uk.
Hickok. Gregory. Poeppel. David. 2007-04-13. The cortical organization of speech processing. Nature Reviews Neuroscience. 8. 5. 393–402. 10.1038/nrn2113. 17431404. 6199399. 1471-003X.
DeWitt. I.. Rauschecker. J. P.. 2012-02-01. Phoneme and word recognition in the auditory ventral stream. Proceedings of the National Academy of Sciences. 109. 8. E505–E514. 10.1073/pnas.1113427109. 22308358. 0027-8424. 3286918. 2012PNAS..109E.505D. free.
Gow. David W.. June 2012. The cortical organization of lexical knowledge: A dual lexicon model of spoken language processing. Brain and Language. 121. 3. 273–288. 10.1016/j.bandl.2012.03.005. 22498237. 3348354. 0093-934X.
Hickok. Gregory. The cortical organization of speech processing: Feedback control and predictive coding the context of a dual-stream model. Journal of Communication Disorders. 45. 6. 393–402. 3468690. 2012. 10.1016/j.jcomdis.2012.06.004. 22766458.
Web site: Howard. Harry. The dorsal stream. Brain and Language. 5 December 2015. https://web.archive.org/web/20170510185006/http://www.tulane.edu:80/~howard/BrLg/t11-Dorsal.html, . 2017-05-10.
Buchsbaum. Bradley R.. Baldo. Juliana. Okada. Kayoko. Berman. Karen F.. Dronkers. Nina. D'Esposito. Mark. Hickok. Gregory. December 2011. Conduction aphasia, sensory-motor integration, and phonological short-term memory – An aggregate analysis of lesion and fMRI data. Brain and Language. 119. 3. 119–128. 10.1016/j.bandl.2010.12.001. 21256582. 3090694. 0093-934X.
Web site: Howard. Harry. The sensorimotor interface. Brain and Language.
Milner . AD. . Goodale . MA. . Two visual systems re-viewed . Neuropsychologia . 46 . 3 . 774–85 . February 2008 . 10.1016/j.neuropsychologia.2007.10.005 . 18037456 . 1475566 .
Psychol. Sci. . 2000 . 11 . 1 . 20–5 . Grasping visual illusions: no evidence for a dissociation between perception and action. . Franz VH, Gegenfurtner KR, Bülthoff HH, Fahle M . 10.1111/1467-9280.00209 . 11228838. 6461170 .
10.1037/0096-1523.31.6.1359 . Franz VH, Scharnowski F, Gegenfurtner KR . 2005 . Illusion effects on grasping are temporally constant not dynamic. . J Exp Psychol Hum Percept Perform . 31 . 1359–78 . 6 . 16366795.
Nature . 2003 . 426 . 6967 . 664–7 . Visual control of action but not perception requires analytical processing of object shape . Ganel T, Goodale MA . 10.1038/nature02156 . 14668865. 2003Natur.426..664G . 4314969 .
Psychol. Sci. . 2008 . 19 . 3 . 221–5 . A double dissociation between action and perception in the context of visual illusions: opposite effects of real and illusory size. . Ganel T, Tanzer M, Goodale MA . 10.1111/j.1467-9280.2008.02071.x . 18315792. 15679825 .
10.1163/187847510X503588. On the Perceptual/Motor Dissociation: A Review of Concepts, Theory, Experimental Paradigms and Data Interpretations. Cardoso-Leite, Pedro . Gorea, Andrei. Seeing and Perceiving . 23 . 2 . 2010 . 89–151 . 20550823.
Vision Res. . 2011 . 51 . 14 . 1567–87 . Transforming vision into action. . Goodale MA. . 10.1016/j.visres.2010.07.027 . 20691202. free .
Hesse . C. . Ball . K. . Schenk . T. . Visuomotor performance based on peripheral vision is impaired in the visual form agnostic patient DF . Neuropsychologia . 50 . 1 . 90–7 . Jan 2012 . 10.1016/j.neuropsychologia.2011.11.002 . 22085864 . 3092864 .
Himmelbach . M. . Boehme . R. . Karnath . HO. . 20 years later: a second look on DF's motor behaviour . Neuropsychologia . 50 . 1 . 139–44 . Jan 2012 . 10.1016/j.neuropsychologia.2011.11.011 . 22154499 . 7513099 .
McIntosh . RD. . Schenk . T. . Two visual streams for perception and action: current trends . Neuropsychologia . 47 . 6 . 1391–6 . May 2009 . 10.1016/j.neuropsychologia.2009.02.009 . 19428404 . 32937236 .

Two-streams hypothesis explained

History

Two visual systems

Dorsal stream

General features

Effects of damage or lesions

Ventral stream

Two auditory systems

Ventral stream

Dorsal stream

Criticisms

See also

Notes and References