The Language Network Is Not Engaged in Object Categorization
The Language Network Is Not Engaged in Object Categorization
https://doi.org/10.1093/cercor/bhad289
Advance access publication date 9 August 2023
Original Article
*Corresponding author: Department of Psychology, Manchester Metropolitan University, Brooks Building, Birley Fields Campus, 53 Bonsall Street, Manchester M15
6GX, United Kingdom. Email: [email protected]
† Yael Benn and Anna A. Ivanova contributed equally
‡ Co-senior authors
The relationship between language and thought is the subject of long-standing debate. One claim states that language facilitates
categorization of objects based on a certain feature (e.g. color) through the use of category labels that reduce interference from other,
irrelevant features. Therefore, language impairment is expected to affect categorization of items grouped by a single feature (low-
dimensional categories, e.g. “Yellow Things”) more than categorization of items that share many features (high-dimensional categories,
e.g. “Animals”). To test this account, we conducted two behavioral studies with individuals with aphasia and an fMRI experiment with
healthy adults. The aphasia studies showed that selective low-dimensional categorization impairment was present in some, but not
all, individuals with severe anomia and was not characteristic of aphasia in general. fMRI results revealed little activity in language-
responsive brain regions during both low- and high-dimensional categorization; instead, categorization recruited the domain-general
multiple-demand network (involved in wide-ranging cognitive tasks). Combined, results demonstrate that the language system is not
implicated in object categorization. Instead, selective low-dimensional categorization impairment might be caused by damage to brain
regions responsible for cognitive control. Our work adds to the growing evidence of the dissociation between the language system and
many cognitive tasks in adults.
contribute to the observed results (above and beyond alleged patients with aphasia (and patients with Parkinson’s disease and
effects of category type). Finally, some have argued for a healthy adults as controls) and an functional Magnetic Resonance
relationship between categorization difficulties and conceptual- Imaging (fMRI) study. In Study 1, we use the setup from Lupyan
semantic rather than purely linguistic impairments (Caramazza and Mirman (2013; L&M henceforth) to determine whether their
et al. 1982; Whitehouse et al. 1978; cf. Le Dorze and Nespoulous findings can be replicated in a sample of participants with mod-
1989). erate aphasia. In Study 2, we adjust the experimental paradigm
to reduce task complexity by decreasing the amount of visual
The possible role of cognitive control information on the screen at any one time, and test whether
mechanisms in LD categorization the LD-selective categorization impairment holds in a sample
Even if individuals with aphasia consistently showed a selective of individuals with severe anomia. In the fMRI study, we collect
impairment in LD categorization, this result would not neces- data from neurotypical individuals to test the prediction that the
sarily implicate language as the source of the deficit. In par- language system is engaged during LD categorization more than
ticular, the language network in the left hemisphere, especially during HD categorization.
in the left frontal cortex, lies adjacent to the domain-general To foreshadow our results, the LD-selective categorization
TPO, time post onset; BNT, Boston Naming Test; HD, high dimension categories; LD, low dimension categories; SD, standard deviation
the 11 participants in the aphasia group (P1, P4, P6, P8, P9, were animals without stripes, and inanimate objects with stripes).
P10, and P11) were below the cut-off for normative naming A total of 1087 unique images were used (any given image
performance. Individuals with PD (n = 13 (8 F), age M = 68.08, appeared as a target in 0–2 categories and as a distractor in
SD = 8.20) were recruited from the Parkinson’s UK Research 0–2 categories). All photographs depicted objects on a white
Registry. For detailed participant information, see Table 1. All background. The materials and the experimental scripts for all
participants used English as their primary language. Patients were studies are available on OSF: https://osf.io/guwh8/.
offered a £10.00 reimbursement. Ethical approval was granted by To determine the extent of lexical impairment in the aphasia
the UCL Research Ethics panel, Project ID: LC/2013/05, and all group and to compare lexical abilities across the three groups,
volunteers gave informed consent to participate in the study. all participants completed the BNT (Goodglass et al. 1983), where
they were sequentially presented with up to 60 line drawings
Design and materials of objects and asked to overtly name each one. The standard
The critical categorization task was modeled closely on L&M’s discontinuation rule was applied, with testing stopped after eight
study, with two modifications. First, the original study used consecutive failed naming attempts. No semantic or phonological
34 unique categories (18 HD categories and 16 LD categories), cues were given.
with some repetition of categories in each condition. We chose
to not repeat any categories, so we limited the materials to
16 categories in each condition (dropping “BODY PARTS” and Experimental procedure
“FACIAL FEATURES” from the HD set). And second, we used a Testing was carried out individually either in a quiet well-lit room
different set of images. L&M used normed color drawings (Rossion at the UCL Aphasia clinic or at the participants’ home, using a
and Pourtois 2004), and we used high-quality color photographs MacBook Pro (Retina, 13-inch display) and an external computer
selected from the Hemera Photo Objects 5000 and Google Images. mouse. The study was set up using PsychoPy (Version 1.83), and
For each category, we selected 8–15 targets and 25–27 distractors. the procedure closely followed that used in L&M’s study, except
Distractors included some items which were related to the target where noted. On each trial (see Fig. 1A for a sample HD and LD
category (for example, for the category “DANGEROUS ANIMALS,” trial), participants were presented with a 4 x 5 grid of images. The
13 of the 26 distractors were animals that were not dangerous, and image sets for the individual trials—each consisting of 20 images
the category “ANIMALS WITH STRIPES” included distractors that (4 targets and 16 distractors)—were randomly selected from the
10384 | Cerebral Cortex, 2023, Vol. 33, No. 19
pool of targets/distractors for each participant separately. The cat- Lastly, due to a technical error, if participants accidently double-
egory was stated at the top of the screen in lower-case Arial bold clicked the “Done” button, the next set of images was skipped,
letters (e.g. “objects that hold water”) and remained on the screen and the software registered it as though no response was made
for the duration of the trial. Participants selected the objects that by participants. As a result, we excluded trials where no selection
belonged to the target category by clicking on each relevant image. was made and where the trial length was less than 5 seconds.
A gray frame appeared around an image once it was clicked; This resulted in the exclusion of 40 trials (out of 2,112; ∼ 2%),
clicking the image again de-selected it (removed the gray frame) spread randomly between participants, groups and categories.
to allow participants to modify responses. Once the participant The analysis code is available on OSF: https://osf.io/guwh8/.
had selected all of the images they deemed appropriate for the
target category, they clicked a large green button with the word Results
“Done” at the bottom of the screen (in the L&M version, the button Group profiles
said “click here when done”). Doing so triggered the next trial. As expected, the neurotypical, aphasia, and PD groups differed
Although each trial contained a fixed number of targets (four), significantly in their BNT scores (F(2,31) = 9.85, P < 0.001). Post-hoc
participants were not informed of the number of targets during pairwise comparisons showed that the BNT scores of participants
SE = 4.02, P = 0.044) and the PD group (M = 21.40, SD = 5.14; exists a relationship between the BNT score and categorization
β = −13.30, SE = 3.66, P < 0.001). The interactions between category performance, they do not support the LD-specific language
dimension and group were not significant (neurotypical>aphasia: recruitment hypothesis.
β = 0.82, SE = 1.29, P = 0.522; PD > aphasia: β = 1.98, SE = 1.17,
P = 0.091). Follow-up analyses showed no overall effect of category
dimension across groups (β = 3.81, SE = 2.19, P = 0.249), within the
Interim discussion
neurotypical group (β = 3.92, SE = 2.34, P = 0.271) or within the PD In Study 1, we use the setup from a previous study (Lupyan and
group (β = 2.76, SE = 2.27, P = 0.521). Mirman 2013, or L&M) to test the hypothesis that language is
selectively recruited to support LD categorization. To examine
Effect of naming performance the generality of the language-categorization link, we recruited a
To explore the effect of naming ability on the categorization task group of individuals with aphasia with diverse degrees of aphasia
performance, we fitted a logistic mixed effect linear regression severity. We found that the aphasia group performed comparably
model with the BNT score, category dimension, and their inter- to the control groups on the categorization tasks. Naming ability
action as fixed effects and participants (across the three groups) (as measured with the BNT) predicted overall categorization per-
and categories (e.g. “DANGEROUS ANIMALS”) as random effects. formance, but we observed no interaction between naming ability
Similar to L&M, we also included education level as a fixed effect. and category dimension (HD vs LD). In summary, Study 1 provides
We found that BNT was a significant predictor of accuracy no support for the hypothesis that language plays a special role
(β = 0.36, SE = 0.08, P < 0.001) and RT (β = −5.26, SE = 1.41, P < 0.001), in LD categorization.
such that higher BNT scores corresponded to more accurate Participants with aphasia performed object categorization as
and faster performance (Fig. 2C, D). There was no main effect accurately as the neurotypical controls. Participants with PD per-
of category dimension (accuracy: β = −0.24, SE = 0.26, P = 0.358; RT: formed better than the other groups, but this difference is likely
β = −3.73, SE = 2.14, P = 0.092) and no interaction between BNT and explained by the higher education level in this group. As in L&M,
category dimension (accuracy: β = −0.05, SE = 0.04, P = 0.271; RT: participants with aphasia were significantly slower to complete
β = 0.74, SE = 0.49, P = 0.131). Education was a significant predictor the categorization task compared with the neurotypical group,
for both accuracy (β = 0.23, SE = 0.07, P = 0.001) and RT (β = −2.95, and to our additional, PD control group. However, this slower per-
SE = 1.24, P = 0.024). Whereas these results indicate that there formance in the aphasia group can be explained by the presence
10386 | Cerebral Cortex, 2023, Vol. 33, No. 19
of motor impairments (e.g. right hemiplegia)—often more severe participants took part in Study 1. All participants used English as
than in participants with PD—which often necessitate use of their their primary language and were offered a £15.00 reimbursement.
non-preferred hand. This difference could also be explained by Ethical approval was granted by the UCL Research Ethics panel,
the fact that participants with aphasia may require longer to pro- Project ID: LC/2013/05, and all volunteers gave informed consent
cess the category descriptions, which are presented verbally and to participate in the study.
sometimes in lengthy phrases (e.g. “NON-FOOD THINGS FOUND
IN THE KITCHEN”). Thus, we are hesitant to place a lot of weight Design and materials
on the RT differences. The categories were identical to those of Study 1. The images were
Across groups, BNT scores significantly predicted performance also largely the same although some were replaced by better qual-
on all three outcome measures (although this effect did not differ ity photographs. Unlike Study 1, we presented the images sequen-
for LD and HD categorization). Although BNT scores may be a tially (Fig. 2B). Each block started with a category label, followed
proxy for the severity of linguistic impairment, they also might by 12 images presented one at a time. The category label remained
index the degree of executive function impairments (Higby et al. on the screen to minimize memory demands. The images for each
2019). Due to the proximity of language-specific and multiple- category block were randomly selected from the general set of
TPO, time post onset; BNT, Boston Naming Test; HD, high dimension categories; LD, low dimension categories; SD, standard deviation
significantly more years of education than participants with P = 0.021) and for the aphasia versus PD comparison (β = 0.32,
aphasia (P = 0.010 and 0.016, respectively). The neurotypical SE = 0.15, P = 0.037).
participants and participants with PD did not differ (P > 0.999).
Response times
Categorization task RT results were also consistent with the LD-specific language
Categorization results for Study 2 are summarized in Fig. 3. recruitment hypothesis. Participants with aphasia were slower
to respond during LD trials (M = 2.37, SD = 0.70) compared with
Accuracy HD trials (M = 2.22, SD = 0.64; LD > HD: β = 0.16, SE = 0.08, P = 0.044).
As in Study 1, participants with aphasia had similar accura- The overall RTs for participants with aphasia (M = 2.30, SD = 0.64)
cies for LD (M = 0.87, SD = 0.07) and HD categories (M = 0.90, were longer than for neurotypical participants (M = 1.48, SD = 0.34;
SD = 0.08; LD > HD: β = −0.24, SE = 0.22, P = 0.282). Participants β = −.81, SE = 0.19, P < 0.001) and participants with PD (M = 1.43,
with aphasia had overall lower accuracies (M = 0.88, SD = 0.07) SD = 0.29; β = −0.86, SE = 0.19, P < 0.001). We also observed an
compared with neurotypical participants (M = 0.98, SD = 0.01; interaction between group and category dimension for both the
neurotypical>aphasia: β = 1.70, SE = 0.28, P < 0.001) and partici- neurotypical versus aphasia comparison (β = −0.23, SE = 0.03,
pants with PD (M = 0.97, SD = 0.02; PD > aphasia: β = 1.44, SE = 0.28, P < 0.001) and the PD versus aphasia comparison (β = −0.19,
P < 0.001), which is consistent with the negative relationship SE = 0.03, P < 0.001), such that participants with aphasia had
between naming ability and categorization performance observed longer RTs for LD categories compared with HD categories.
in Study 1. We did not observe a reliable category dimension by
group interaction for the aphasia versus neurotypical comparison Effect of naming performance
(β = 0.44, SE = 0.26, P = 0.086), nor for the aphasia versus PD As in Study 1, BNT was a significant predictor of categorization
comparison (β = 0.42, SE = 0.23, P = 0.070). Critically, in accordance performance (accuracy: β = 0.50, SE = 0.11, P < 0.001; RT: β = −0.29,
with the LD-specific language recruitment hypothesis, we SE = 0.07, P < 0.001). There was no main effect of category
observed a category dimension by group interaction both for dimension (accuracy: β = 0.06, SE = 0.21, P = 0.787; RT: β = −0.02,
the aphasia versus neurotypical comparison (β = 0.37, SE = 0.16, SE = 0.07, P = 0.742); however, unlike Study 1, and as predicted by
10388 | Cerebral Cortex, 2023, Vol. 33, No. 19
the LD-specific language recruitment hypothesis, we observed an (with the goal of reducing executive demands). We found that, in
interaction between BNT and category dimension for accuracy accordance with the LD-selective language recruitment hypothe-
(β = 0.13, SE = 0.05, P = 0.007) and RT (β = −0.08, SE = 0.01, P < 0.001). sis, individuals with aphasia were impaired on LD categorization
Finally, education was not a significant predictor of performance more than on HD categorization. However, performance of indi-
in this dataset (accuracy: β = 0.12, SE = 0.13, P = 0.372; RT: β = −0.01, vidual participants offers a reason to be skeptical about a direct
SE = 0.08, P = 0.940). link between naming and categorization. Participants A4 and A5
demonstrated dissociation between these two tasks: despite very
Single case analysis low BNT scores (lower than 5/60), they performed similarly on
Although the effect of naming performance in Study 2 is in the HD and LD categorization trials, and their accuracy on both
line with L&M’s prediction, careful examination of individual conditions was well within the range of the control groups.
participants’ scores casts doubt on the causal relationship Dissociations observed in individual case studies are critical in
between naming ability and categorization performance. Specif- informing debates about cognitive architecture (e.g. Caramazza
ically, participants A4 and A5 in the aphasia group (Table 2) and McCloskey 1988; Badecker et al. 1991; Caramazza and
had very low BNT scores (1/60 and 4/60), but nonetheless Coltheart 2006). Naturally occurring brain lesions do not respect
performed well relative to both the neurotypical and PD groups the boundaries between functionally distinct brain areas, and
(accuracy: LD A4 = 95%; A5 = 91%; HD A4 = 96%; A5 = 96%). Using comorbidities or associations of impairments are common (e.g.
the Adjusted F Calculator for comparing single cases to groups Bates et al. 2003). For example, damage to the left inferior frontal
(Hulleman and Humphreys 2007), these two participants did gyrus (LIFG) is likely to cause multiple cognitive impairments due
not differ significantly from the combined neurotypical and PD to the high functional heterogeneity of that region (Fedorenko
groups for either the HD condition (A4: F[1,29] < 0.01, P (one- et al. 2012; Fedorenko and Blank 2020). Thus, a correlation that we
tailed) = 0.414; A5: F[1,29] < 0.01, P (one-tailed) = 0.414) or the LD observe between naming and categorization might be because the
condition (A4: F[1,29] = 0.02, P (one-tailed) = 0.337; A5: F[1,29] = 0.14, brain regions that support these functions are located nearby and
P (one-tailed) = 0.154). This dissociation indicates that naming thus are likely to be damaged together (rather than naming and
impairment is not necessarily accompanied by a decrement in categorization engaging the same brain region/mechanism). The
LD categorization. dissociation that we observe in participants A4 and A5 supports
this possibility: in both cases, severely limited lexical access
Interim discussion did not prevent success on the categorization task, revealing
In Study 2, we examined object categorization performance of that intact linguistic (naming) skills are not necessary for object
individuals with severe anomia using a modified task paradigm categorization.
Yael Benn et al. | 10389
As in Study 1, naming ability significantly predicted perfor- words (albeit no single region or voxel is sensitive just to word-
mance. Furthermore, possibly because in this study we recruited level or sentence-level meaning; Blank et al. 2016; Fedorenko
participants with aphasia who had extremely poor naming per- et al. 2020). Therefore, if a task requires activating verbal labels,
formance, we also observed a group difference: participants with we expect to observe activity in the regions identified with the
aphasia had lower accuracy and longer response times than the language localizer.
two control groups. This evidence points to a possible link between The multiple demand localizer identifies a set of brain regions
naming performance and categorization. As in Study 1, this link that respond to a wide range of cognitively demanding tasks.
might arise from the fact that task instructions are presented ver- Specifically, these regions are sensitive to general cognitive effort,
bally; thus, linguistic impairments might affect task performance exhibiting higher activity when the task is more difficult (Assem
simply because they make it more challenging to process the et al. 2020b; Duncan 2010; Fedorenko et al. 2013; Hugdahl et al.
instructions. Another explanation, also offered by L&M, is that LD 2015). The hard>easy response signature in the multiple demand
categorization is correlated with naming impairments because network holds across many diverse tasks, including spatial WM,
both tasks may be affected by damage to cognitive control logic, math, relational reasoning, and cognitive control (Fedorenko
mechanisms, which lay in close proximity to language areas, et al. 2013; Coetzee and Monti 2018; Shashidhara et al. 2019;
Language network localizer completed three runs. Across the three runs, any given participant
Participants read sentences (e.g. NOBODY COULD HAVE PRE- saw a random subset of the 32 categories, with some categories
DICTED THE EARTHQUAKE IN THIS PART OF THE COUNTRY) repeating (but never repeating within a run; see Appendix S1,
and lists of unconnected, pronounceable nonwords (e.g. U BIZBY Table 1 for details). Condition order was counterbalanced across
ACWORRILY MIDARAL MAPE LAS POME U TRINT WEPS WIBRON runs and participants.
PUZ) in a blocked design. Each stimulus consisted of twelve word-
s/nonwords. The sentences > nonword-lists contrast has been
fMRI data acquisition
previously shown to reliably activate high-level language process- Structural and functional data were collected on the whole-
ing regions and to be robust to changes in the materials, task, body, 3 Tesla, Siemens Trio scanner with a 32-channel head coil,
and modality of presentation (Fedorenko et al. 2010; Mahowald at the Athinoula A. Martinos Imaging Center at the McGovern
and Fedorenko 2016; Scott et al. 2017). For details of how the Institute for Brain Research at MIT. T1-weighted structural images
language materials were constructed, see Fedorenko et al. (2010). were collected in 176 sagittal slices with 1-mm isotropic voxels
The materials are available at http://evlab.mit.edu/funcloc. Stim- (TR = 2,530 ms, TE = 3.48 ms). Functional, blood oxygenation level
dependent (BOLD), data were acquired using an EPI sequence
condition (included to model variability in the HRF delays), as The responses to the localizer conditions (sentences and non-
well as nuisance regressors to control for the effect of slow linear words for language fROIs, hard and easy WM conditions for mul-
drifts, subject-motion parameters, and potential outlier scans on tiple demand fROIs, and LD and HD categorization for categoriza-
the BOLD signal. tion fROIs) were estimated using an across-runs cross-validation
procedure, where one run was used to define the fROI and the
Defining individual functional regions of interest other to estimate the response magnitudes, then the procedure
was repeated switching the runs used for fROI definition versus
Responses to the critical categorization experiment were extracted
response estimation, and finally the estimates were averaged to
from regions of interest that were defined functionally in each
derive a single value per condition per fROI per participant. This
individual participant (Saxe et al. 2006; Nieto-Castañón and
cross-validation procedure allows one to use all of the data for
Fedorenko 2012). Three sets of functional regions of interest
defining the fROIs as well as for estimating their responses (see
(fROIs) were defined—one for the language network, one for
Nieto-Castañón and Fedorenko 2012, for discussion), while ensur-
the multiple demand network, and one for the putative LD > HD
ing the independence of the data used for fROI definition and
categorization regions. To do so, we used the Group-constrained
response estimation (Kriegeskorte et al. 2009). Two participants
Fig. 4. Categorization responses within the language brain network. (A) Parcels used to define fROIs in individual participants. (B) Average responses
within the language network to four conditions of interest (sentence reading and nonword reading vs. LD and HD categorization). (C) fROI responses to
M = 1.52 s, SD = 0.25 s). Mixed effect models with condition as a extent. There was also an interaction between the Hard>Easy WM
fixed effect and participant as a random intercept showed that task and hemisphere, such that the effect was greater in right
both accuracy and RT effects were significant (accuracy: β = −1.41, hemisphere (β = 0.38, SE = 0.19, P = 0.040).
SE = 0.202, P < 0.001; RT: β = 0.33, SE = 0.027, P < 0.001). Follow-up analyses on individual fROIs (Appendix 2, Table 2)
showed that responses to categorization were significantly above
Critical categorization task 0 in all fROIs. However, they were weaker than the overall
The accuracies for the two categorization conditions did not responses to the WM task in almost all fROIs (except left middle
significantly differ (LD M = 95.73%, SD = 4.20%; HD M = 95.44%, frontal fROI). This result highlights the domain-general nature
SD = 4.11%; LD > HD β = 0.14, SE = 0.20, P = 0.454). Similarly, there of these responses. Further, none of the fROIs had significantly
was no significant difference between response times in the LD different responses to LD and HD categories, despite the presence
condition (RT = 0.81 s, SD = 0.1 s) and the HD condition (RT = 0.84 s, of this effect in the network-level analysis.
SD = 0.1 s; LD > HD β = −0.03, SE = 0.02, P = 0.156).
Whole-brain analyses
Functional response profile of the language network We also conducted a whole-brain analysis to identify fROIs that
There was no significant difference between language network might respond more strongly to LD or HD categorization but lie
responses to LD and HD categorization (β = −0.02, SE = 0.10, outside the language and multiple demand fROIs described above.
P = 0.848). Overall, responses to the categorization task were The GSS analysis (see Methods for details) revealed that no regions
barely above 0 (β = 0.42, SE = 0.19, P = 0.054; see Fig. 4), not exhibited consistent HD > LD responses across participants; how-
significantly different from responses to nonword reading, the ever, the LD > HD contrast revealed two parcels, both located in
control condition in the language localizer task (β = 0.13, SE = 0.09, left parietal lobe (Fig. 6). Further analysis of fROIs defined within
P = 0.144), and significantly weaker than responses to sentences these parcels showed that the LD > HD response only reached
(β = −1.49, SE = 0.09, P < 0.001). significance in fROI 2 (β = 0.43, SE = 0.17, P = 0.013), but not in fROI
Follow-up analyses in individual language fROIs (Appendix 2, 1 (β = 0.58, SE = 0.30, P = 0.060). The overall categorization response
Table 1) showed that responses to categorization were signifi- was significantly above 0 in fROI 1 (β = 0.65, SE = 0.19, P = 0.001) but
cantly above 0 in frontal fROIs (MFG, IFG, and IFGorb). However, not fROI 2 (β = −0.13, SE = 0.15, P = 0.389).
none of the responses were significantly higher than responses Importantly, both fROIs responded to the WM task more
during the control task, nonword reading, indicating that these strongly than to the categorization task (fROI 1: β = 1.66, SE = 0.21,
responses are not language-specific. Thus, our results suggest P < 0.001; fROI 2: β = 0.64, SE = 0.12, P < 0.001), indicating that these
that the language network does not support either LD or HD regions likely respond to general cognitive effort rather than to LD
categorization in neurotypical participants. categorization (or feature selection) specifically, and thus likely
belong to the MD network. Neither of the two fROIs exhibited a
Functional response profile of the multiple demand network sentences>nonwords effect; in fact, both showed a trend in the
Multiple demand network response to LD categorization was opposite direction (fROI 1: β = −0.51, SE = 0.30, P = 0.094; fROI 2:
higher than to HD categorization (β = 0.19, SE = 0.09, P = 0.025), β = −0.28, SE = 0.17, P = 0.098), which shows that these regions do
indicating that, as predicted, LD categorization is more effortful. not respond to linguistic input.
In general, multiple demand network responses to categorization The whole-brain analysis provides additional evidence against
were significantly above 0 (β = 1.07, SE = 0.21, P < 0.001; see Fig. 5) the LD-specific language recruitment hypothesis and shows that
and stronger than responses to control conditions from the lan- differences in LD versus HD categorization, if present, are likely
guage localizer task (categorization > sentences: β = 0.73, SE = 0.08, caused by domain-general mechanisms.
P < 0.001; categorization > nonwords: β = 0.41, SE = 0.08, P < 0.001).
However, they were weaker than responses to the spatial WM Interim discussion
task (β = −1.43, SE = 0.07, P < 0.001), indicating that the WM task In the fMRI Experiment, we examined neural responses to LD and
was more effortful. Responses to the categorization task were HD categorization. Our main goal was to evaluate the hypothesis
stronger in the left hemisphere (β = 0.24, SE = 0.09, P = 0.005). We that LD categorization relies more heavily on linguistic resources
also observed an interaction between the WM > categorization compared with HD categorization. For this purpose, we identi-
contrast and hemisphere (β = 0.29, SE = 0.13, P = 0.024), showing fied the language network individually in 14 healthy adults and
that the WM task engages the right hemisphere to a greater examined its responses during LD and HD categorization. The
Yael Benn et al. | 10393
A 4
B C
2
3
5
7
6
8
9
1 10
D 4
E F
2
10 1
Fig. 5. Categorization responses within the multiple demand brain network. (A) Left hemisphere parcels used to define fROIs in individual participants.
(B) Average responses within the left hemisphere fROIs to four conditions of interest (hard and easy WM tasks vs. LD and HD categorization). (C) Left
hemisphere fROI responses to the four conditions of interest. (D–F) Parcels, average responses, and fROI-level responses in the right hemisphere.
Fig. 6. Results of the whole-brain analyses. (A) Parcels defined with the LD > HD categorization contrast. (B) Responses to conditions of interest within
the two fROIs (defined as the top 10% of voxels within each parcel, sorted by the magnitude of the LD > HD response). WM, working memory task.
language network exhibited low responses to both categorization domain-general multiple demand regions and not on language-
tasks, which did not differ from activations elicited by reading of specific regions. Future work should examine whether the
nonword sequences (a low-level control condition). There was no small difference between LD and HD categories is driven by
difference between responses to LD and HD categories, contrary a small subset of categories or whether it indeed ref lects
to the prediction that the language network would be selec- greater domain-general cognitive demands associated with all LD
tively or preferentially engaged during LD categorization. Thus, categorization.
we conclude that (i) the neuroimaging results disconfirm the LD- Neuroimaging of healthy individuals provides a powerful
specific language recruitment hypothesis and (ii) the language complement to patient studies. Given the strong and selective
network is not at all engaged in object categorization, highlighting engagement of the language network during all behaviors
a dissociation between linguistic processing and non-linguistic requiring access to linguistic representations (Fedorenko et al.
semantic cognition. 2010; Fedorenko et al. 2011; Menenti et al. 2011; Scott et al.
Unlike the language network, the domain-general multiple 2017; Giglio et al. 2022; Hu et al. 2021, among others), the lack
demand network (also defined individually in each participant) of activity in the language regions during categorization strongly
was engaged during categorization, indicating that this task is suggests that they do not contribute to categorization (Mather
cognitively challenging. This network responded more strongly et al. 2013). The response to categorization within the multiple
to LD than HD categorization, but this effect was small. The demand network, on the other hand, indicates its involvement in
whole-brain analyses specifically aimed at identifying regions categorization, even though we note that fMRI evidence described
with stronger responses to LD than HD categorization confirmed here is correlational, not causal, and should be complemented
that the two identified fROIs, responded more strongly to a with patient studies or brain stimulation studies that specifically
WM task than to a categorization task, and the LD > HD effect target this hypothesis (that interfering with the activity in the
was small and/or not statistically significant. We conclude that multiple demand network or damage to this network should lead
categorization, and LD categorization in particular, relies on to impairments in categorization tasks). Neuroimaging evidence
10394 | Cerebral Cortex, 2023, Vol. 33, No. 19
is particularly helpful when patient studies do not produce object because the information required for categorization (e.g.
conclusive results, as in our case. color, length) is directly extractable from the image. For semantic
Whereas some previous work suggested that a region within categorization (e.g. danger level or typical location), however, the
left angular gyrus is involved in inhibiting irrelevant semantic identity of the object is important. The result of this re-coding is
information (Lewis et al. 2019), as may be required for LD cat- reported in Appendix 1. The rest of the analyses were the same as
egorization, the results of our study suggest that activation of those described for LD/HD category types.
the language-responsive portion of the left angular gyrus was
comparable during LD and HD categorization. If anything, this Results and discussion
language fROI showed numerically higher activation during HD The results are shown in Appendices S2 and S3. In both aphasia
categorization, suggesting that it may be recruited for recognizing studies, category type had no effect on accuracy, nor did it interact
and thinking about established sets more than for constructing with participant group or BNT. However, semantic categorization
novel sets that may require inhibition of object-irrelevant char- overall elicited longer response times compared with perceptual
acteristics. We also did not find significant differences in the categorization. This main effect of category type on response
engagement of the language fROIs in the left inferior frontal cor- times interacted with participant group for both studies, but
therefore concluded that lexical retrieval is not necessary for suc- Yet another possibility is that both naming and categorization
cessful categorization, including categorization based on single performance rely not only on domain-general, but also on
features. semantic control resources. Semantic control is a cognitive
In Study 3, we used a complementary approach and examined construct posited by several groups that investigate controlled
the engagement of the language network and a domain-general retrieval of conceptual information (e.g. Thompson-Schill et al.
multiple demand network in HD and LD categorization using fMRI 1997; Badre and Wagner 2002; Jefferies 2013; Lambon Ralph et al.
in neurotypical adults. The language network was not engaged 2017). Although the location of the putative regions responsible
during either LD or HD categorization: its responses did not for semantic control (or, more neutrally, semantic demand)
significantly differ from responses during the control, nonword resembles that of the language regions, precise localization
reading, task. This observation goes against the hypothesis that approaches in individual brains indicate that language, multiple
categorization (either LD or HD) relies on linguistic resources. demand, and semantic demand regions are spatially distinct
In contrast, the multiple demand network was recruited dur- (Ivanova et al. in prep). If semantic demand regions support
ing the categorization task, consistent with prior evidence of deliberate, controlled semantic tasks, damage to these regions
its involvement in diverse cognitively challenging tasks (Duncan might explain both categorization and naming difficulties in
under other accounts (e.g. Langland et al. 2021), “f lying” might low-level verbal/phonological rehearsal appears to rely on lower-
be a feature in and of itself, uniting objects that are otherwise level speech processing mechanisms (e.g. Scott and Perrachione
highly diverse. The lack of clarity on what exactly constitutes an 2019) and the domain-general multiple-demand network (e.g.
HD category makes it hard to generalize the results beyond the Fedorenko et al. 2011; Shashidhara et al. 2020), not on the lan-
specific categories used in the study. guage network. In any case, the verbal rehearsal account is quite
Furthermore, not all LD categories as defined by Lupyan different from L&M’s original LD-specific language recruitment
and Mirman (2013) necessarily involve conceptual processing. hypothesis.
For instance, many are based on color: e.g. “THINGS THAT
ARE YELLOW”. Although color is often encoded as part of
Relationship to other work on language and
the conceptual representation of an object, this conceptual
categorization
representation was not required for the task in question:
participants were simply asked to indicate whether the object Other results from psycho- and neurolinguistics also support the
they were viewing was yellow, and decisions could be made on view that linguistic resources do not typically mediate catego-
the basis of surface perceptual features alone. Thus, even if “true” rization in humans. If access to linguistic representations were
CRediT taxonomy Baayen RH, Davidson DJ, Bates DM. Mixed-effects modeling with
crossed random effects for subjects and items. J Mem Lang.
Yael Benn (Conceptualization, Formal analysis, Methodology,
2008:59(4):390–412.
Project administration, Software, Validation, Writing—original
Badecker W, Caramazza A. On considerations of method and theory
draft, Writing—review and editing), Anna Ivanova (Conceptual-
governing the use of clinical categories in neurolinguistics and
ization, Data curation, Formal analysis, Validation, Visualization,
cognitive neuropsychology: the case against agrammatism. Cog-
Writing—original draft, Writing—review and editing), Oliver
nition. 1985:20(2):97–125.
Clark (Formal analysis, Software, Validation), Zachary Mineroff
Badecker W, Nathan P, Caramazza A. Varieties of sentence compre-
(Formal analysis, Investigation, Methodology), Chloe Seikus
hension deficits: a case study. Cortex. 1991:27(2):311–321.
(Investigation, Methodology), Jack Santos Silva (Investigation,
Badre D, Wagner AD. Semantic retrieval, mnemonic control, and
Methodology), Rosmary Varley (Conceptualization, Investigation,
prefrontal cortex. Behav Cogn Neurosci Rev. 2002:1(3):206–218.
Methodology, Supervision, Writing—original draft, Writing—
Bain A. The senses and the intellect. London: John W. Parker & Son; 1855.
review and editing), Evelina Fedorenko (Conceptualization,
Baldo JV, Bunge SA, Wilson SM, Dronkers NF. Is relational reasoning
Methodology, Supervision, Writing—original draft, Writing—
dependent on language? A voxel-based lesion symptom mapping
Burger RA, Muma JR. Cognitive distancing in mediated categorization Fedorenko E, Duncan J, Kanwisher N. Language-selective and
in aphasia. J Psycholinguist Res. 1980:9(4):355–365. domain-general regions lie side by side within Broca’s area. Curr
Caramazza A, Badecker W. Patient classification in neuropsycholog- Biol. 2012:22(21):2059–2062.
ical research. Brain Cogn. 1989:10(2):256–295. Fedorenko E, Duncan J, Kanwisher N. Broad domain generality in
Caramazza A, Coltheart M. Cognitive neuropsychology twenty years focal regions of frontal and parietal cortex. Proc Natl Acad Sci U
on. Cogn Neuropsychol. 2006:23(1):3–12. S A. 2013:110(41):16616–16621.
Caramazza A, McCloskey M. The case for single-patient studies. Cogn Fedorenko E, Blank IA, Siegelman M, Mineroff Z. Lack of selectivity
Neuropsychol. 1988:5(5):517–527. for syntax relative to word meanings throughout the language
Caramazza A, Berndt RS, Brownell HH. The semantic deficit hypoth- network. Cognition. 2020:203:104348.
esis: perceptual parsing and object classification by aphasic Ferguson B, Waxman S. Linking language and categorization in
patients. Brain Lang. 1982:15(1):161–189. infancy. J Child Lang. 2017:44(3):527–552.
Carruthers P. The cognitive functions of language. Behav Brain Sci. Franklin A, Drivonikou GV, Clifford A, Kay P, Regier T, Davies IRL. Lat-
2002:25(6):657–674 discussion 674-725. eralization of categorical perception of color changes with color
Chen X, Affourtit J, Norman-Haignere S, Jouravlev O, Malik-Moraleda term acquisition. Proc Natl Acad Sci. 2008:105(47):18221–18225.
both lexical access and sentence generation during language Lewis GA, Poeppel D, Murphy GL. Contrasting semantic versus
production. Biorxiv. 2021:2021–09. inhibitory processing in the angular gyrus: an fMRI study. Cereb
Hugdahl K, Raichle ME, Mitra A, Specht K. On the existence of Cortex. 2019:29(6):2470–2481.
a generalized non-specific task-dependent network. Front Hum Luo X, Sexton NJ, Love BC. A deep learning account of how
Neurosci [Internet]. 2015:9:430. language affects thought. Language, Cognition and Neuroscience.
Hulleman J, Humphreys GW. Maximizing the power of comparing 2023:38(4):499–508.
single cases against a control sample: an argument, a program for Lupyan G. Extracommunicative functions of language: verbal inter-
making comparisons, and a worked example from the Pyramids ference causes selective categorization impairments. Psychon Bull
and Palm Trees Test. Cogn Neuropsychol. 2007:24(3):279–291. Rev. 2009:16(4):711–718.
Ivanova AA, Hofer M. Linguistic overhypotheses in category learning: Lupyan G. Linguistically modulated perception and cognition: The
explaining the label advantage effect. In: Proceedings of the 42nd label-feedback hypothesis. Front Psychol. 2012:3:54.
Annual Conference of the Cognitive Science Society. Cognitive Science Lupyan G, Casasanto D. Meaningless words promote meaningful
Society; 2020, p. 723–729. categorization. Lang Cogn. 2015:7(2):167–193.
Ivanova AA, Mineroff Z, Zimmerer V, Kanwisher N, Varley R, Lupyan G, Mirman D. Linking language and categorization: evidence
Petersen SE, Posner MI. The attention system of the human brain: 20 Sloutsky VM. From perceptual categories to concepts: what devel-
years after. Annu Rev Neurosci. 2012:35:73–89. ops? Cogn Sci. 2010:34(7):1244–1286.
Pinel P, Dehaene S. Beyond hemispheric dominance: brain regions Sloutsky VM, Fisher AV. Induction and categorization in young chil-
underlying the joint lateralization of language and arithmetic to dren: a similarity-based model. J Exp Psychol Gen. 2004:133(2):
the left hemisphere. J Cogn Neurosci. 2009:22(1):48–66. 166–188.
Plunkett K, Hu J-F, Cohen LB. Labels can override perceptual cate- Smith LB, Heise D. Perceptual similarity and conceptual structure.
gories in early infancy. Cognition. 2008:106(2):665–681. In: Burns B, editor. Advances in psychology. Vol. 93. North-Holland:
Pontillo DF, Salverda AP, Tanenhaus MK. 2015. Flexible use of phono- Elsevier; 1992. pp. 233–272.
logical and visual memory in language-mediated visual search. Smith EE, Medin DL. Categories and concepts. Cambridge MA: Harvard
In: Proceedings of the 37th Meeting of the Cognitive Science Society. University Press; 1981.
Pasadena, California. Thompson-Schill SL, D’Esposito M, Aguirre GK, Farah MJ. Role of left
Posner MI, Petersen SE. The attention system of the human brain. inferior prefrontal cortex in retrieval of semantic knowledge: a
Annu Rev Neurosci. 1990:13:25–42. reevaluation. Proc Natl Acad Sci. 1997:94(26):14792–14797.
Potter MC, Faulconer BA. Time to understand pictures and words. Vallila-Rohter S, Kiran S. An examination of strategy implementa-