How can you use HED?¶

HED (Hierarchical Event Descriptors) serves different needs throughout the research lifecycle. Whether you’re collecting data, annotating datasets, or analyzing results, HED provides tools and frameworks to make your work more efficient and your data more valuable.

New to HED?

If this is your first time learning about HED, start with the Introduction to HED for basic concepts and quick start paths. This guide provides detailed workflows for each research role.

Choose your path

Experimenters: Event logging and collection
Data annotators: Annotation workflows
Data analysts: Search and analysis
Tool developers: Integration guidance
Schema builders: Library schema development

Research roles and HED workflows¶

HED serves researchers in different capacities throughout the data lifecycle. Choose your primary role to find relevant tools and guidance:

🧪 Experimenter

Planning experiments, collecting data with proper event logging, and preparing data for analysis.

Key needs: Event logging best practices, data collection standards, log-to-event conversion

🧪 As an experimenter

📝 Data annotator

Adding HED annotations to existing datasets, curating data for sharing, BIDS/NWB integration.

Key needs: Annotation tools, validation workflows, standardized formats

📝 As a data annotator

📊 Data analyst

Searching and analyzing HED-annotated data, extracting design matrices, cross-study comparisons.

Key needs: Search tools, programming APIs, analysis workflows

📊 As a data analyst

🛠️ Tool developer

Building software that integrates with HED, creating analysis pipelines, extending HED functionality.

Key needs: APIs, schema specifications, integration guidelines

🛠️ As a tool developer

🏗️ Schema builder

Developing domain-specific vocabularies (HED library schemas) to capture domain specific concepts in metadata.

Key needs: Schema development tools, community coordination, validation frameworks

🏗️ As a schema builder

🧪 As an experimenter¶

Planning experiments, collecting data, and preparing for analysis

You’re designing and running experiments to test hypotheses and study behavior. HED helps you capture what actually happened during data collection in a way that maximizes downstream usability and enables powerful analyses.

Challenges for experimenters¶

❌ Without HED

Meaningless event codes (1, 2, 3)
Log files require constant documentation
Analysis code breaks when events change
Difficult to compare across experiments
Data sharing requires extensive explanation

✅ With HED

Self-documenting event annotations
Standardized vocabulary across studies
HED-enabled analysis works automatically
Easy cross-experiment comparisons
Data ready for sharing in BIDS/NWB

🎯 Data collection¶

🔄 Post-processing¶

📋 Data sharing¶

Standardizing data format for sharing

An important aspect of data collection is organizing your data in a standardized format so that analysis tools can read and manipulate the data without special-purpose reformatting code. BIDS and NWB are the most widely-used standards for organizing brain and behavioral data in neuroscience.

BIDS (Brain Imaging Data Structure)

BIDS is a widely used data organization standard for neuroimaging and behavioral data. BIDS focuses on file organization with appropriate experimental metadata.

Learn BIDS: The BIDS Starter Kit provides comprehensive introductions
File organization: Folders and Files explains BIDS directory structure
Metadata: The Annotating a BIDS dataset tutorial covers required metadata
Specification: See BIDS specification for detailed rules
Conversion tools: BIDS Tools lists available converters

HED in BIDS

There are two strategies for incorporating HED annotations in a BIDS dataset:

Method 1: Use a JSON (sidecar) file to hold the annotations (recommended)

Method 2: Annotate each line in each event file using the HED column

Method 1 is typical for most annotations. The HED online tools generate annotation templates. The BIDS annotation quickstart walks through this process.

Method 2 is usually used for instrument-generated annotations or manual marking (bad sections, special features).

When using HED in BIDS, specify HED schema versions in dataset_description.json in the dataset root directory. See HED schema versions for examples.

HED in NWB (Neurodata Without Borders)

NWB is a data standard for neurophysiology, providing specifications for storing cellular, intracellular, and extracellular physiology data, along with experimental metadata and behavior. A single NWB file holds all session data synchronized to a common timeline.

Learn NWB: PyNWB documentation provides tutorials and API references
HED extension: ndx-hed extension enables HED integration in NWB files
Guide: HED annotation in NWB provides detailed information
Examples: ndx-hed examples demonstrate real-world usage

HED annotations in NWB use the ndx-hed extension with three main classes:

HedLabMetaData: Required for all HED-annotated NWB files; specifies HED schema version
HedTags: Used for row-specific HED annotations in any NWB DynamicTable
HedValueVector: Used for column-wide HED templates with value placeholders

Installation: pip install -U ndx-hed

BIDS allows NWB file format for recording sessions within a BIDS-organized experiment. HED is well-integrated into both standards.

Resources for experimenters¶

📚 Guide: Actionable event annotation and analysis in fMRI - Practical guidance with sample data
🛠️ Tools: Table remodeler - Transform logs to event files
🌐 Online: Event processing in the HED online tools - Process files without installation

📝 As a data annotator¶

Curating datasets, adding HED annotations, and validating data quality

You’re adding meaningful annotations to event data, ensuring consistency and completeness, and validating that datasets meet quality standards. HED provides tools and workflows to make your data FAIR (Findable, Accessible, Interoperable, Reusable).

Annotator challenges¶

❌ Without HED

Events meaningless without docs
Each dataset needs custom interpretation
Hard to validate metadata completeness
Manual coding for every analysis
Hard to find similar datasets

✅ With HED

Self-documenting event annotations
Standardized vocabulary across datasets
Automated validation and quality checks
Analysis tools work out of the box
Easy cross-dataset search and comparison

📚 Basic background¶

✏️ Adding HED annotations¶

✓ Validating HED annotations¶

🔍 Checking for consistency¶

Resources for data annotators:¶

📚 Guides: HED annotation quickstart, BIDS annotation quickstart
🛠️ Tools: HED online tools - Validation, templates, and conversion
🌐 Browser: HED schema browser - Explore available vocabularies
📖 Paper: “Capturing the nature of events…” - Annotation best practices

📊 As a data analyst¶

Using HED-annotated data for scientific discovery and cross-study analysis

Whether you are analyzing your own data or using shared data produced by others to answer a scientific question, fully understanding the data and its limitations is essential for accurate and reproducible analysis. HED annotations and tools enable powerful analysis workflows that work consistently across different experiments.

Challenges for data analysts¶

❌ Without HED

Each dataset requires custom code
Event meanings buried in documentation
Cannot compare across studies
Manual data inspection for every analysis
Difficult to validate assumptions

✅ With HED

Standardized search across datasets
Self-documenting event structure
Automatic design matrix extraction
Consistent analysis workflows
Built-in data quality summaries

📊 Understanding the data¶

🛠️ Preparing the data¶

📈 Analyzing the data¶

Event selection and analysis workflows

HED enables powerful, flexible analysis through standardized event selection and design matrix extraction. The key advantage is that HED queries work consistently across different experiments using different event codes.

Factor vectors and selection

The most common analysis application is to select events satisfying particular criteria, and compare some measure on signals containing these events with a control.

HED facilitates this selection through factor vectors. A factor vector for an event file has the same number of rows as the event file (each row corresponding to an event marker). Factor vectors contain 1’s for rows in which a specified criterion is satisfied and 0’s otherwise.

Types of factor operations:

factor column operation: Creates factor vectors based on the unique values in specified columns. This operation not require HED annotations.

factor HED tags: Creates factor vectors based on a HED tag query. Enables flexible, generalizable event selection.

factor HED type: Creates factors based on HED tags representing structural information such as Condition-variable, Task, or Temporal-marker.

HED search queries

HED search queries allow you to find events based on their semantic properties rather than specific event codes. This enables:

Cross-study analysis: Same query works on different datasets
Flexible criteria: Complex boolean logic with AND, OR, NOT
Hierarchical matching: Search at any level of tag hierarchy
Temporal context: Find events within ongoing processes

The HED search guide explains the HED query structure and available search options in detail.

HED analysis in EEGLAB

EEGLAB, the interactive MATLAB toolbox for EEG/MEG analysis, supports HED through the EEGLAB HEDTools plugin.

Key capabilities:

Event search and epoching based on HED queries
Automated design matrix extraction
Integration with EEGLAB’s analysis pipeline
Support for both GUI and scripting workflows

Getting started with HED in EEGLAB:

The End-to-end processing of EEG with HED and EEGLAB book chapter, which can be found at https://doi.org/10.1007/978-1-0716-4260-3_6, works through the entire analysis process, including porting the analysis to high performance computing platforms. The site includes sample data to use in running the examples.

HED support in other tools

Work is underway to integrate HED support in other analysis packages:

FieldTrip: Search and epoching integration in development
MNE-Python: Planned support for HED search and summary
NEMAR/EEGNET: Platform integration for large-scale analysis

If you are interested in helping with HED integration in other tools, please email hed.maintainers@gmail.com.

Resources for data analysts¶

📚 Guides: HED search guide, HED summary guide, HED conditions and design matrices
🛠️ Tools: Table remodeler - Transform and analyze event files
🧪 EEGLAB: HED and EEGLAB - HED integration in EEGLAB
📖 Book chapter: End-to-end EEG processing - Complete analysis workflow

🛠️ As a tool developer¶

Building software that integrates with HED and expanding the HED ecosystem

The HED ecosystem relies on tools that read, understand, and incorporate HED as part of analysis. Whether you’re adding HED support to existing software or creating new analysis tools, HED provides well-documented APIs and validation libraries to make integration straightforward.

Challenges for tool developers¶

❌ Without HED

Custom event parsing for each dataset
Hard-coded event interpretations
Difficult to support multiple studies
Manual validation of event structure
Limited cross-tool compatibility

✅ With HED

Standardized event parsing
Semantic event interpretation
Works across any HED-annotated dataset
Automated validation with clear errors
Interoperable with HED ecosystem

🚀 Getting started with HEDTools¶

💻 Working with code bases¶

🤝 Contributing to HED development¶

The HED project welcomes contributions from the community. Whether you’re reporting bugs, suggesting features, or contributing code, your input helps improve HED for everyone.

Where to report:

General questions: hed-schemas/issues
Python HEDTools: hed-python/issues
JavaScript HEDTools: hed-javascript/issues
MATLAB HEDTools: hed-matlab/issues
Schema issues: hed-schemas/issues

Contributing code¶

All HED repositories welcome pull requests. See the CONTRIBUTING.md file in each repository for specific guidelines. If you have ideas or want to contribute to these efforts, please post an issue or email hed.maintainers@gmail.com.

Long-term vision¶

The HED community is actively working on expanding HED’s capabilities and integration:

Develop more sophisticated analysis methods
Leverage AI to reduce the cognitive burden on users
Better integrate experimental control software with annotation workflows
Capture event relationships for complex automated analysis
Expand library schemas for specialized domains

📚 Resources for tool developers¶

🔧 Code repositories: hed-python, hed-javascript, hed-matlab
📖 Documentation: Python tools, JavaScript tools, MATLAB tools
🌐 Online tools: hedtools.org - REST services and web validation
📋 Schemas: hed-schemas - Standard and library schemas
💬 Support: GitHub Discussions or email hed.maintainers@gmail.com

🏗️ As a schema builder¶

Extending HED vocabulary to support specialized domains and applications

HED annotations use terms from a hierarchically structured vocabulary called a HED schema. The HED standard schema contains basic terms common across most experiments, while HED library schemas extend the vocabulary for specialized domains like clinical EEG, movie annotation, or language studies.

Challenges for schema builders¶

❌ Without structured schemas

Inconsistent terminology across studies
Duplicate or conflicting terms
No validation of vocabulary structure
Difficult to share domain vocabularies
Hard to maintain backward compatibility

✅ With HED schema framework

Structured, hierarchical vocabulary
Automated validation of schema structure
Version control and compatibility tracking
Community review process
Standardized schema attributes and rules

🔍 Exploring existing schemas¶

Understanding existing HED vocabularies

Before proposing changes or creating a new schema, familiarize yourself with existing HED vocabularies to avoid duplication and understand HED’s organizational principles.

Viewing available schemas

The HED schema browser provides an interactive view of the HED standard schema, showing:

Hierarchical tag structure
Tag descriptions and attributes
Value classes and unit classes
Schema version history

All of the versions and prereleases of the schemas are available through the viewer.

Schema version	Type	Description	Status
8.4.0	standard	Basic infrastructure and vocabulary needed to annotate experiments	released
score_2.1.0	library	Clinical EEG annotation vocabulary	released
lang_1.1.0	library	Language stimulation annotations	released
slam_1.0.0	library	Sensor positions vocabulary for EMG	prerelease
mouse_1.0.0	library	Annotations for mouse experiments	prerelease
media	Library	Tags for annotating images and video	proposed

Understanding schema structure

HED schemas have structured elements including:

Tags: Hierarchically organized terms (e.g., Sensory-event)
Attributes: Properties of tags (e.g., relatedTag, reserved, topLevelTagGroup)
Value classes: Allowed value types (e.g., textClass, numericClass)
Unit classes: Measurement units and conversions (e.g., timeUnits, angleUnits)

Tags are the main elements of interest. The tags have attributes that specify behavior. For example the topLevelTagGroup specifies that this tag is a special tag that can only appear inside parentheses at the top level of an annotation. The Definition tag is a special tag in the standard schema that has both the reserved and the topLevelTagGroup attributes.

The value classes and unit classes are mainly relevant for tags that take values. These tags have a # placeholder in the schema hierarchy, indicating that they can be specified with a value. For example, the Temperature tag’s # child has the numeric value class and the temperatureUnits unit class, allowing annotations such as Temperature/32 oC.

Key principles:

Is-a relationship: Every tag has an “is-a” relationship with its parents
Uniqueness: Each term must be unique and not redundant with existing terms
Self-contained: Tag descriptions should be understandable without external context

See the HED specification for the rules that govern HED tag syntax and usage. See HED schema developer guide for detailed information on schema structure and design principles.

✏️ Contributing to existing schemas¶

🏛️ Creating new schemas¶

Developing new HED library schemas

If your domain requires extensive specialized vocabulary not appropriate for the standard schema, you may want to create a new HED library schema.

Creating a new library schema

Creating a library schema is a collaborative process that requires community engagement.

Getting started:

Post an issue on hed-schemas/issues to start the discussion.

Proposing a new HED library schema

Initial proposal should include:

Proposed name for the HED library schema
Brief description of the library’s purpose and scope
Example use cases and target audience
List of potential collaborators (GitHub handles)
Initial thoughts on vocabulary organization
Relationship to existing HED schemas (what’s missing from standard schema)

Requirements:

GitHub account: All schema development uses GitHub Pull Request mechanism
Community engagement: Willingness to participate in review discussions
Documentation: Commitment to documenting terms clearly
Long-term maintenance: Plan for maintaining the schema over time

Schema development workflow

The schema development process follows these general steps:

Proposal and discussion: Post issue and gather community feedback
Planning: Define scope, structure, and initial term list
Initial schema creation: Develop first version following HED schema rules
Validation: Use schema validation tools to check structure
Community review: Gather feedback through GitHub discussions
Iteration: Refine based on feedback
Release: Publish to hed-schemas repository

Essential reading:

The HED specification is the definitive source for the syntax and rules of HED. If you are interested in developing The HED schema developer’s guide provides comprehensive information on:

Schema XML structure and syntax
Attribute definitions and usage
Value class and unit class design
Versioning and compatibility
Testing and validation
Best practices for schema development

Private vocabularies and extensions

Can I create my own private HED schema?

While technically possible, private schemas are not recommended and have limited tool support.

Why standardized schemas are preferred:

Tool compatibility: Many HED tools assume standardized schemas from the hed-schemas repository
Schema fetching: Tools automatically fetch/cache official schemas
Cross-dataset comparison: Standardized vocabularies enable meta-analyses
Community benefit: Shared vocabularies benefit the entire research community
Long-term maintainability: Official schemas are maintained by the community

Alternatives to private schemas:

Propose library schema: Work with the community to develop a standardized library
Extend existing schema: Suggest additions to standard or library schemas
Use definitions: Create complex concepts using existing vocabulary
Temporary extensions: Use descriptive tags (e.g., Description/...) until proper terms are available

Decision rationale:

The HED Working Group decided to prioritize standardized schemas after observing that unvetted private vocabularies would compromise HED’s ability to enable standardized dataset summaries and cross-study comparisons.

If you have a use case that genuinely requires a private vocabulary, please email hed.maintainers@gmail.com to discuss options.

Resources for schema developers¶

🌐 Schema browser: HED schema browser - Interactive schema viewer
📖 Documentation: HED specification, Schema developer’s guide
📋 Repository: hed-schemas - All official HED schemas
💬 Discussions: hed-schemas/issues - Propose changes or new schemas
📧 Contact: hed.maintainers@gmail.com - Direct schema questions