Implante cerebral permite a mujer hablar 30 años después de un ictus

The scientists, who have published the discovery in ‘Nature Neuroscience’, managed to improve this device, known as a brain-computer interface (BCI), with artificial intelligence (AI) algorithms that decoded the phrases as the woman thought them and then spoke them out loud using a synthetic voice.

Unlike previous initiatives that only produced sounds when the user finished speaking a phrase, the current method can detect words simultaneously and convert them into voice in less than three seconds.

The participant in the study, Ann, lost the ability to speak after suffering a stroke in the brainstem in 2005. Eighteen years later, she underwent a surgical procedure to implant a thin rectangle with 253 electrodes on the surface of the cerebral cortex. The implant can record the combined activity of thousands of neurons at the same time.

The researchers customized the synthetic voice to sound like Ann’s voice before her injury, training artificial intelligence algorithms with recordings of her wedding video.

«We adopted continuous flow transducer techniques, similar to those used by popular ASR methods like Siri or Alexa, and repurposed them for personalized brain-voice synthesis,» said Kaylo Littlejohn, co-lead author of the study.

«This approach led to significant improvements in the decoding speed of the brain-voice neuroprosthesis compared to previous approaches with longer delays,» Littlejohn highlighted.

In the study, the scientists explain that natural oral communication is instantaneous, and delays in speech of more than a few seconds can disrupt the natural flow of conversation.

«This makes it difficult for people with paralysis to engage in meaningful dialogue, which can lead to feelings of isolation and frustration,» they point out.

Therefore, they designed and used recurrent deep learning neural network transducer models to achieve a personalized, intelligible, wide-vocabulary speech synthesis online according to the participant’s voice.

«Our findings introduce a speech-neuroprosthesis paradigm to restore naturalistic spoken communication in people with paralysis,» the scientists emphasize.

### TRAINING OF ANN

The researchers have designed a speech synthesis neuroprosthesis that allows Ann to synthesize the desired speech from neural signals acquired from a 253-channel ECoG matrix implanted on the surface of her sensorimotor cortex and a small portion of the temporal lobe.

To train the system, they recorded neural data while Ann attempted to pronounce individual phrases. She was presented with text on a monitor and asked to silently start speaking once a visual ‘GO’ appeared.

Additionally, the synthesized speech was transmitted through a nearby analog speaker, and the decoded text was displayed on the monitor. The neural decoders of the system were bimodal, as they were trained not only to synthesize speech but also to decode text simultaneously.

They also evaluated the system using a set of 50 reduced-vocabulary phrases and a set of extensive vocabulary phrases with 1,024 general words. The set of 50 phrases was designed as a set of predefined phrases to express the primary needs of caregivers.

In contrast, the set of 1,024 general words was designed as a set of high-vocabulary phrases containing 12,379 unique phrases composed of 1,024 unique words extracted from the X social network and movie transcripts.

FUENTE