Task Selection and Variation Criteria¶
Date: 2026-04-18 (updated) Purpose: Reference document specifying how the 103-task HED catalog was scoped, how tasks were selected, and the rules for classifying variations vs. aliases vs. non-variations.
1. Task Selection Criteria¶
What counts as a task in this catalog?¶
A task is a structured experimental paradigm that:
Produces event-based data. The paradigm generates a sequence of discrete, time-stamped events (stimulus presentations, participant responses, feedback) that can be annotated with HED tags.
Has a specific, reproducible procedure. A researcher reading the task description could implement it. Generic categories like “memory task” or “attention task” are not tasks — they are domains.
Is widely used in cognitive/behavioral neuroscience. The paradigm appears across multiple labs, has a recognized name, and is cited in the literature as a standard method.
Engages identifiable cognitive processes. The task is designed to isolate or measure specific cognitive, perceptual, motor, or affective processes.
What is excluded?¶
Self-report instruments: Personality inventories, mood questionnaires, symptom checklists (e.g., BDI, STAI). These do not produce event-structured data.
Clinical screening tools: Instruments designed for diagnostic classification rather than experimental measurement (e.g., MMSE as a screening tool, though components like digit span are included as standalone tasks).
Broad task categories: “Working memory task” as a generic label. The catalog includes specific instantiations (N-Back, Operation Span, Change Detection, etc.).
One-off experimental designs: Paradigms used in a single study without broader adoption.
Neurological examinations: Reflex testing, cranial nerve exams, etc.
The 103-task scope¶
The catalog targets approximately 100 canonical tasks spanning the major domains of cognitive and behavioral neuroscience: attention, memory, executive function, language, perception, motor control, emotion, social cognition, decision-making, and learning. The number 103 is not a hard constraint — it reflects the current state of consolidation from the Cognitive Atlas plus gap-filling. Tasks may be added or removed as the catalog matures.
2. Naming Conventions¶
Each task has a canonical_name (the standard reference name) and zero or more aliases (alternative names used in the literature).
Canonical name rules¶
Use the most widely recognized name in the literature
Include “Task” as a suffix (e.g., “Stroop Color-Word Task”, not just “Stroop”)
For named paradigms, the standard abbreviation goes in the
aliasesarray, not in the canonical name (e.g., canonical name “Cambridge Face Memory Task”, alias “CFMT”)Prefer the specific paradigm name over the generic process name when both exist (e.g., “Eriksen Flanker Task” not “Response Conflict Task”)
Alias rules¶
An alias is an alternative name for the same task. Aliases are:
Abbreviations (e.g., “AGL” for Artificial Grammar Learning Task)
Eponymous names (e.g., “Galton-Crovitz Task” for the cue-word variant of Autobiographical Memory Task)
Common shortened forms (e.g., “Antisaccade” for Anti-Saccade Task)
Instrument-specific names (e.g., “IAPS Viewing” for Affective Picture Viewing Task)
Aliases are not variations. An alias refers to the same procedure with a different name. A variation refers to a different procedure under the same task umbrella.
Identifier formats¶
Every term in the catalog is typed by its identifier prefix, so any ID encountered anywhere in the documents is unambiguously classifiable:
Task IDs have the form
hedtsk_<slug>(e.g.,hedtsk_eriksen_flanker,hedtsk_change_detection,hedtsk_balloon_analog_risk). The slug is mechanically derived from thecanonical_nameby stripping the trailing word “Task” or “Test”, lowercasing, removing apostrophes, replacing/,-, and spaces with_, collapsing runs of_, and prependinghedtsk_. It is therefore a stable, deterministic identifier once the canonical name is fixed. Thehedtsk_prefix is mandatory.Process IDs have the form
hed_<slug>(e.g.,hed_response_inhibition,hed_visual_working_memory). Thehed_prefix is mandatory. Thehed_prefix is this catalog’s working prefix and is not a claim about HED-schema membership (see §2.5).Canonical names for tasks always end in the word
TaskorTest.Taskis the default. The canonical name contains no parenthetical abbreviations and no spaced slashes — abbreviations and alternative names go inaliases. Examples: “Balloon Analog Risk Task” with alias “BART”; “False Belief Task” with alias “Theory of Mind Task”.Process names are sentence case (first word capitalized, remainder lowercase except proper nouns).
A reader who sees hedtsk_balloon_analog_risk knows it is a task; hed_response_inhibition is a process; BART is a human-readable abbreviation not an ID. Any cross-reference between the two axes (task-to-process, or process-to-task) must preserve these prefixes verbatim.
3. The Inclusion Test¶
Each task has a three-part inclusion_test that defines the boundary of the task:
Structure¶
{
"procedure": "What the participant does — the observable sequence of events",
"manipulation": "What the experimenter varies — the independent variables",
"measurement": "What is recorded — the dependent variables and signals"
}
How to use the inclusion test¶
A specific experiment is an instance of a given task if and only if:
Its procedure matches the procedure field (the participant performs the described sequence of actions)
It manipulates at least one variable from the manipulation field
It records at least one measure from the measurement field
The procedure field is the most important. Two experiments with different manipulations or measurements but the same procedure are instances of the same task. Two experiments with different procedures are different tasks, even if they target the same cognitive construct.
4. Variation Criteria¶
What is a variation?¶
A variation is a named version of a task that changes what the participant experiences or does such that the event structure in the recorded data would differ. The key test:
If you compared the event logs from the standard version and the variation, would there be structural differences in the types of events, their temporal organization, or the participant’s required responses?
Examples of genuine variations¶
Different response modalities: Verbal naming vs. button press (Picture Naming Task)
Different stimulus configurations: Simultaneous masking vs. forward masking vs. backward masking (Auditory Masking Task)
Different cognitive demands: 1-choice vs. 2-choice vs. go/no-go versions of the same paradigm
Different task structures: Blocked vs. interleaved conditions when this changes what the participant must track (e.g., interleaved pro/anti-saccade trials require task-set switching)
Additional participant actions: Production version of AGL (participant generates strings, not just judges them)
Different stimulus-response mappings: Prosaccade vs. antisaccade (same cue, opposite required response)
Named sub-paradigms: Sally-Anne, Smarties Task, Animated Triangles as distinct False Belief implementations
What is NOT a variation? (12 active DROP categories + 1 retired)¶
The following do not count as task variations because they do not change what the participant experiences or does (EMOT was retired 2026-04-17; see §5.1):
Code |
Category |
Description |
Examples |
|---|---|---|---|
MEAS |
Measurement modality |
Adding or changing recording equipment |
“with fMRI,” “with EEG,” “with eye-tracking,” “with SCR/startle probe,” “with MEG,” “with TMS recording” |
ANAL |
Analysis method |
Post-hoc data analysis technique |
Computational modeling, DDM, ROC analysis, Gratton effect analysis, process dissociation, clustering analysis, strategy analysis, locus-of-slack logic, reverse correlation |
DESG |
Design/parameter choice |
Experimenter decisions about trial structure |
Blocked vs. event-related design, varying ISI/SOA (as generic sweep), parametric difficulty, variable set size (as parameter sweep), speed-accuracy tradeoff instructions |
STIM |
Stimulus substitution |
Swapping one stimulus set for an equivalent one |
IAPS→OASIS/NAPS, race-specific face sets with identical procedures, alternative word lists, translated stimuli |
POPL |
Population/clinical use |
Testing a different population |
“in amnesia,” “for children” (unless procedure changes), “clinical screening,” “developmental version,” “in schizophrenia,” cross-cultural comparison |
IDIF |
Individual differences |
Using a task as a correlational measure |
“as individual differences measure,” correlational studies |
TRAN |
Training/intervention |
Using repeated task exposure as an intervention |
Bias modification training, working memory training, practice/training interventions, memory specificity training |
PHAR |
Pharmacological challenge |
Administering drugs during the task |
“under pharmacological challenge,” medication on/off designs |
SCOR |
Scoring method |
How performance is quantified |
Partial-credit scoring, B-minus-A derived scores, B/A ratio, trial-level bias scores |
ALIA |
Alias/redundant |
Restates the canonical task or duplicates another variation |
“Standard X” that merely restates the canonical definition, near-duplicate entries from dual-source generation |
MOTI |
Motivation/reward add-on |
Adding incentives without changing task structure |
“Rewarded X,” reward modulation, mood induction before task |
DUAL |
Dual-task add-on |
Adding a concurrent secondary task |
Articulatory suppression, working memory load, concurrent secondary task (unless it creates a recognized named paradigm) |
EMOT |
Emotional stimulus swap — RETIRED 2026-04-17 |
~~Replacing neutral stimuli with emotional ones~~ |
This category was retired by user decision on 2026-04-17. All attested Emotional-X paradigms are now kept as variations under their parent tasks. No emotional variations are dropped on EMOT grounds. See §5.1 for the resolution. |
Applying the criteria consistently¶
The DROP categories are applied uniformly across all 103 tasks. If “with fMRI” is dropped for one task, it must be dropped for all tasks. If “in children” is dropped when only stimuli change for age-appropriateness, it must be dropped everywhere that applies.
The one exception is when a modification creates a recognized named paradigm with its own literature. For example, the Emotional Stroop Task is kept as a separate top-level task entry because it has a distinct, established identity in the literature. Note that the EMOT category was retired entirely on 2026-04-17 (see §5.1) — all attested Emotional-X paradigms are now kept as variations, whether or not they have independent literatures.
5. Borderline Cases and Policy Questions¶
These areas required explicit policy decisions. §5.1 was resolved on 2026-04-17; the remaining five are open. The variation audit (applied 2026-04-18) applied the default rules listed below:
5.1 Emotional variants — RESOLVED 2026-04-17¶
Decision: The EMOT DROP category was retired on 2026-04-17. All attested Emotional-X paradigms are kept as variations under their parent tasks. No emotional variations are promoted to top-level tasks (decided 2026-04-18).
What this means in practice: The following emotional variations are kept as variations (not top-level tasks): Emotional Anti-Saccade, ANT with Emotional Stimuli, CPT with Emotional Distractors, Emotional Directed Forgetting, Emotional Flanker, Emotional Navon, Emotional Oddball, Emotional Simon Task, Emotional Go/No-Go, Emotional N-Back, Emotional TNT, Emotion-Induced Blindness, Emotional Attentional Blink. The Emotional Stroop Task, Emotion Regulation Task, and Facial Emotion Recognition Task remain top-level tasks (they were always independent entries, not promoted from variations).
Rationale: Emotional content does change what the participant experiences, and these are attested paradigms in the literature. Keeping them as variations rather than promoting them avoids proliferating task entries for every stimulus-domain cross.
5.2 Proportion-congruent manipulations¶
Current rule: KEPT for Flanker, Stroop, and Simon tasks because proportion-congruent manipulations change the cognitive control context substantially (participants adjust their strategies based on conflict frequency).
Open question: Could be argued as a design choice (DESG) since it’s an experimenter-controlled parameter.
5.3 Child/developmental versions¶
Current rule: KEPT when the procedure is genuinely simplified or uses different stimuli that change the task. DROPPED when only stimuli are swapped for age-appropriateness without changing the procedure.
Currently kept child/developmental variations (5): Child ANT (fish stimuli, simplified flanker), Developmental / Child IAT (picture-based categories), Iowa Gambling Task Child Versions (simplified decks), RMET Child Version (reduced item set, simplified vocabulary), Child-Adapted SART (modified timing and stimuli).
Open question: Should there be a uniform rule for all child versions?
5.4 Dual-task add-ons — RESOLVED 2026-04-18¶
Rule: Uniformly DROPPED. Adding a concurrent secondary task (working memory load, articulatory suppression, etc.) is a design choice (DUAL), not a task variation, regardless of how widely used the combination is. No exceptions for “well-established” dual-task pairings — popularity does not change the structural argument.
Applied: All DUAL-coded entries were removed in the original audit. The one borderline survivor from that pass — Concurrent Cognitive Load (Affective Picture Viewing Task) — was dropped 2026-04-18 for consistency.
5.5 Confidence ratings / subjective reports — RESOLVED 2026-04-18¶
Rule: Adding a confidence rating or subjective report to an otherwise unchanged task is a measurement add-on (MEAS), not a variation. The fact that it adds a participant response to the event stream is not sufficient — by that logic any bolted-on scale would create a variation. Confidence-rating add-ons are uniformly DROPPED.
Exception: A confidence-related variation is kept only when the confidence judgment defines a recognized named paradigm whose construct of interest is the metacognitive correspondence, not the base task’s performance. The sole current example is the Confidence-Accuracy Paradigm (Heartbeat Detection Task), where Garfinkel et al. (2015) defined a three-dimensional model of interoception in which confidence-accuracy correspondence is one of the core measured dimensions.
Dropped in the original audit (6): AGL with Confidence Ratings, ROC Analysis (Old/New Recognition), Confidence Judgment Version (Random Dot Kinematogram), ROC Analysis (Remember/Know), Backward Masking with Confidence Ratings (Visual Masking), Source Confidence Ratings (Source Memory). Dropped 2026-04-18 (2): Confidence-Rated Recognition (Old/New Recognition), Remember/Know with Confidence Ratings (Remember/Know). These two had initially survived the audit but were dropped for consistency — adding a graded scale to old/new or R/K judgments is the same kind of add-on as the six already dropped.
5.6 Computerized vs. manual versions — RESOLVED 2026-04-18¶
Rule (refined 2026-04-18): A computerized version is a genuine variation only if it changes the sensory modality or motor activity the participant uses. Manipulating physical objects vs. clicking on a screen qualifies. Handwriting vs. key-pressing qualifies. Stylus-on-tablet vs. pencil-on-paper does not (same motor modality). Automated scoring or administration software alone is never a variation (that is MEAS or DESG).
Kept computerized/digital variations (6): Computerized (eCorsi) — physical block tapping → screen tapping changes spatial/motor demands; Computerized DSST/SDMT — handwriting → button pressing is a different fine motor skill; Computerized Mirror Tracing — loss of haptic edge feedback from physical template; Computerized Adaptive Version (Raven’s) — adaptive algorithm changes which items are presented, a procedural change beyond interface; Computerized vs. Physical Versions (Tower of London) — grasping/placing 3D beads → drag-and-drop; PD against Computer Opponents (Prisoner’s Dilemma) — not an interface change but a structural one: deterministic program opponent vs. human.
Dropped 2026-04-18 (4): Computerized RAVLT (digital administration with automatic scoring — MEAS/DESG), Digital TMT/dTMT (stylus on tablet vs. pencil on paper — same motor modality; kinematic recording is MEAS), Computerized UFOV (UFOV is inherently screen-based; this is commercial software with automated staircase — DESG), Computerized WCST/WCST-CV (digital version with automated scoring — DESG).