Talking Brains: On the nature of sensorimotor integration for speech processes

Thursday, February 10, 2011

On the nature of sensorimotor integration for speech processes

For the last few years I have been thinking a lot about a few different things: What specifically is our proposed dorsal stream doing? How does the motor system contribute to speech perception? What is the relation between sensorimotor processes used during speech production (e.g., feedback-based motor control models) and purported sensorimotor processes in speech perception? How do computational models of speech production (e.g., feedback control models, psycholinguistic models, neurolinguistic models) relate to neural models of speech processing? A new "Perspective" article which just appeared today in Neuron and is currently free to download, summarizes the outcome of my thoughts on these questions, mixed with a heavy dose of input from my co-authors John Houde (UCSF) and Feng Rong (Talking Brains West post doc). Yes, I'm very proud that the piece has been labeled a "perspective" rather than a "review" -- that means it is theoretically novel rather than a summary statement ;-)

The starting point for the article is the observation that there are two main lines of research on sensorimotor integration, which paradoxically do not interact. Namely the idea that the auditory system is critically involved in speech production (exemplified by the motor control folks like Frank Guenther and John Houde) and the idea that the motor system is critically involved in speech perception (exemplified by folks like Stephen Wilson, Pulvermuller, and many others). We wondered whether these two lines of work could be integrated into one model. The answer, we propose, is yes.

The basic idea is the dorsal stream sensorimotor integration circuit is built to support speech production via a state feedback control architecture of the sort that is common in the visual-manual motor control literature. But the computational properties of the system, particularly the generation of forward sensory predictions of motor consequences provides a ready made mechanism for the motor system to modulate (not drive) the perception of others' speech under some circumstances (e.g., when the acoustic signal is weak or ambiguous).

In addition, we attempted to show how psycholinguistic models of speech production (e.g., Levelt, Dell) as well as neurolinguistic models (e.g., the concept of input and output phonological lexicons) relate to the proposed state feedback control model. I never liked the idea of there being two phonological "lexicons" but it actually makes a lot of sense in the framework of state feedback control architectures.

The model also does a decent job of explaining some of the key symptoms of conduction aphasia and stuttering which are explained as different types of disruption of the same feedback control mechanism.

The graphic depiction of the model is below. I'm looking forward to your feedback on this!

Hickok, G., Houde, J., & Rong, F. (2011). Sensorimotor Integration in Speech Processing: Computational Basis and Neural Organization Neuron, 69 (3), 407-422 DOI: 10.1016/j.neuron.2011.01.019

3 comments:

Tilak said...: I have enjoyed your blog since its inception.

What do you think about correlating this with variation in cortical thickness in specific regions eg the PT varies in thickness as you go posteriorly from HS. You might have seen our paper [Qiu A, Vaillant M, Barta P, Ratnanather JT, Miller MI. Surface-Based Gaussian Random Field Model with Application
to Cortical Thickness Variation of Left Planum Temporale in Schizophrenia and Bipolar Disorder. Hum. Brain Mapp.,
29: 973-985, 2007].

This trending appears to be backed up by postmortem histology [Chance et al. The cytoarchitecture of sulcal folding in Heschl's sulcus and the temporal cortex in the normal brain and schizophrenia: lamina thickness and cell density. Neuroscience letters, 2004].

We need similar detailed and focused analysis of thickness approaches rather than infer from thickness computations from global brain analyses though they do help point the way forward.; February 11, 2011 at 8:10 AM
Fred said...: This looks massively interesting. Unfortunately, it appears to no longer be available for download, and my library doesn't have the latest volume/issue yet (not even online).

Do you have a preprint that you might be able to share?; February 13, 2011 at 8:06 PM
Greg Hickok said...: You are welcome to contact me directly re: reprint requests: greg.hickok@uci.edu; February 14, 2011 at 1:21 PM