File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1016/j.neuroimage.2017.06.042
- Scopus: eid_2-s2.0-85023200931
- PMID: 28648889
- WOS: WOS:000443268900022
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: Mapping between fMRI responses to movies and their natural language annotations
Title | Mapping between fMRI responses to movies and their natural language annotations |
---|---|
Authors | |
Keywords | FMRI Multi-modal model Natural language processing Natural movie stimulus Shared response model Text annotations |
Issue Date | 2018 |
Citation | NeuroImage, 2018, v. 180, p. 223-231 How to Cite? |
Abstract | Several research groups have shown how to map fMRI responses to the meanings of presented stimuli. This paper presents new methods for doing so when only a natural language annotation is available as the description of the stimulus. We study fMRI data gathered from subjects watching an episode of BBCs Sherlock (Chen et al., 2017), and learn bidirectional mappings between fMRI responses and natural language representations. By leveraging data from multiple subjects watching the same movie, we were able to perform scene classification with 72% accuracy (random guessing would give 4%) and scene ranking with average rank in the top 4% (random guessing would give 50%). The key ingredients underlying this high level of performance are (a) the use of the Shared Response Model (SRM) and its variant SRM-ICA (Chen et al., 2015; Zhang et al., 2016) to aggregate fMRI data from multiple subjects, both of which are shown to be superior to standard PCA in producing low-dimensional representations for the tasks in this paper; (b) a sentence embedding technique adapted from the natural language processing (NLP) literature (Arora et al., 2017) that produces semantic vector representation of the annotations; (c) using previous timestep information in the featurization of the predictor data. These optimizations in how we featurize the fMRI data and text annotations provide a substantial improvement in classification performance, relative to standard approaches. |
Persistent Identifier | http://hdl.handle.net/10722/341211 |
ISSN | 2023 Impact Factor: 4.7 2023 SCImago Journal Rankings: 2.436 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Vodrahalli, Kiran | - |
dc.contributor.author | Chen, Po Hsuan | - |
dc.contributor.author | Liang, Yingyu | - |
dc.contributor.author | Baldassano, Christopher | - |
dc.contributor.author | Chen, Janice | - |
dc.contributor.author | Yong, Esther | - |
dc.contributor.author | Honey, Christopher | - |
dc.contributor.author | Hasson, Uri | - |
dc.contributor.author | Ramadge, Peter | - |
dc.contributor.author | Norman, Kenneth A. | - |
dc.contributor.author | Arora, Sanjeev | - |
dc.date.accessioned | 2024-03-13T08:41:02Z | - |
dc.date.available | 2024-03-13T08:41:02Z | - |
dc.date.issued | 2018 | - |
dc.identifier.citation | NeuroImage, 2018, v. 180, p. 223-231 | - |
dc.identifier.issn | 1053-8119 | - |
dc.identifier.uri | http://hdl.handle.net/10722/341211 | - |
dc.description.abstract | Several research groups have shown how to map fMRI responses to the meanings of presented stimuli. This paper presents new methods for doing so when only a natural language annotation is available as the description of the stimulus. We study fMRI data gathered from subjects watching an episode of BBCs Sherlock (Chen et al., 2017), and learn bidirectional mappings between fMRI responses and natural language representations. By leveraging data from multiple subjects watching the same movie, we were able to perform scene classification with 72% accuracy (random guessing would give 4%) and scene ranking with average rank in the top 4% (random guessing would give 50%). The key ingredients underlying this high level of performance are (a) the use of the Shared Response Model (SRM) and its variant SRM-ICA (Chen et al., 2015; Zhang et al., 2016) to aggregate fMRI data from multiple subjects, both of which are shown to be superior to standard PCA in producing low-dimensional representations for the tasks in this paper; (b) a sentence embedding technique adapted from the natural language processing (NLP) literature (Arora et al., 2017) that produces semantic vector representation of the annotations; (c) using previous timestep information in the featurization of the predictor data. These optimizations in how we featurize the fMRI data and text annotations provide a substantial improvement in classification performance, relative to standard approaches. | - |
dc.language | eng | - |
dc.relation.ispartof | NeuroImage | - |
dc.subject | FMRI | - |
dc.subject | Multi-modal model | - |
dc.subject | Natural language processing | - |
dc.subject | Natural movie stimulus | - |
dc.subject | Shared response model | - |
dc.subject | Text annotations | - |
dc.title | Mapping between fMRI responses to movies and their natural language annotations | - |
dc.type | Article | - |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1016/j.neuroimage.2017.06.042 | - |
dc.identifier.pmid | 28648889 | - |
dc.identifier.scopus | eid_2-s2.0-85023200931 | - |
dc.identifier.volume | 180 | - |
dc.identifier.spage | 223 | - |
dc.identifier.epage | 231 | - |
dc.identifier.eissn | 1095-9572 | - |
dc.identifier.isi | WOS:000443268900022 | - |