Tutorial 1: Simple Motor Imagery#

In this example, we will go through all the steps to make a simple BCI classification task, downloading a dataset and using a standard classifier. We choose the dataset 2a from BCI Competition IV, a motor imagery task. We will use a CSP to enhance the signal-to-noise ratio of the EEG epochs and a LDA to classify these signals.

# Authors: Pedro L. C. Rodrigues, Sylvain Chevallier
#
# https://github.com/plcrodrigues/Workshop-MOABB-BCI-Graz-2019

import warnings

import matplotlib.pyplot as plt
import pandas as pd
import seaborn as sns
from mne.decoding import CSP
from sklearn.discriminant_analysis import LinearDiscriminantAnalysis as LDA
from sklearn.pipeline import make_pipeline

import moabb
from moabb.datasets import BNCI2014_001
from moabb.evaluations import WithinSessionEvaluation
from moabb.paradigms import LeftRightImagery


moabb.set_log_level("info")
warnings.filterwarnings("ignore")

Instantiating Dataset#

The first thing to do is to instantiate the dataset that we want to analyze. MOABB has a list of many different datasets, each one containing all the necessary information for describing them, such as the number of subjects, size of trials, names of classes, etc.

The dataset class has methods for:

  • downloading its files from some online source (e.g. Zenodo)

  • importing the data from the files in whatever extension they might be (like .mat, .gdf, etc.) and instantiate a Raw object from the MNE package

Accessing EEG Recording#

As an example, we may access the EEG recording from a given session and a given run as follows:

sessions = dataset.get_data(subjects=[1])

This returns a MNE Raw object that can be manipulated. This might be enough for some users, since the pre-processing and epoching steps can be easily done via MNE. However, to conduct an assessment of several classifiers on multiple subjects, MOABB ends up being a more appropriate option.

Choosing a Paradigm#

Once we have instantiated a dataset, we have to choose a paradigm. This object is responsible for filtering the data, epoching it, and extracting the labels for each epoch. Note that each dataset comes with the names of the paradigms to which it might be associated. It would not make sense to process a P300 dataset with a MI paradigm object.

imagery

For the example below, we will consider the paradigm associated to left-hand/right-hand motor imagery task, but there are other options in MOABB for motor imagery, P300 or SSVEP.

We may check the list of all datasets available in MOABB for using with this paradigm (note that BNCI2014_001 is in it)

[<moabb.datasets.bnci.bnci_2014.BNCI2014_001 object at 0x7f9de84b19f0>, <moabb.datasets.bnci.bnci_2014.BNCI2014_004 object at 0x7f9de5cb7fa0>, <moabb.datasets.beetl.Beetl2021_A object at 0x7f9de84b14b0>, <moabb.datasets.beetl.Beetl2021_B object at 0x7f9de5cb7dc0>, <moabb.datasets.brandl2020.Brandl2020 object at 0x7f9de5cb7f40>, <moabb.datasets.chang2025.Chang2025 object at 0x7f9de5cb7d60>, <moabb.datasets.gigadb.Cho2017 object at 0x7f9de5cb7c70>, <moabb.datasets.dreyer2023.Dreyer2023 object at 0x7f9de5cf32b0>, <moabb.datasets.dreyer2023.Dreyer2023A object at 0x7f9de5cb7fd0>, <moabb.datasets.dreyer2023.Dreyer2023B object at 0x7f9de5cf3580>, <moabb.datasets.dreyer2023.Dreyer2023C object at 0x7f9de5cf3610>, <moabb.datasets.forenzo2023.Forenzo2023 object at 0x7f9de5cf3550>, <moabb.datasets.mpi_mi.GrosseWentrup2009 object at 0x7f9de5cf35b0>, <moabb.datasets.guttmann_flury2025.GuttmannFlury2025_ME object at 0x7f9de5cf3430>, <moabb.datasets.guttmann_flury2025.GuttmannFlury2025_MI object at 0x7f9de5cf36d0>, <moabb.datasets.hefmi_ich2025.HefmiIch2025 object at 0x7f9de5cf36a0>, <moabb.datasets.kaya2018.Kaya2018 object at 0x7f9de5cf38e0>, <moabb.datasets.kumar2024.Kumar2024 object at 0x7f9de5cf30a0>, <moabb.datasets.Lee2019.Lee2019_MI object at 0x7f9de5cf2bc0>, <moabb.datasets.liu2024.Liu2024 object at 0x7f9de5cf33a0>, <moabb.datasets.physionet_mi.PhysionetMI object at 0x7f9de5cf0d90>, <moabb.datasets.schirrmeister2017.Schirrmeister2017 object at 0x7f9de5cf3820>, <moabb.datasets.bbci_eeg_fnirs.Shin2017A object at 0x7f9de5cf3880>, <moabb.datasets.stieger2021.Stieger2021 object at 0x7f9de5cf3760>, <moabb.datasets.wairagkar2018.Wairagkar2018 object at 0x7f9de5cf3790>, <moabb.datasets.Weibo2014.Weibo2014 object at 0x7f9de5cf3310>, <moabb.datasets.wu2020.Wu2020 object at 0x7f9de5cf2e60>, <moabb.datasets.yang2025.Yang2025 object at 0x7f9de5cf0bb0>, <moabb.datasets.Zhou2016.Zhou2016 object at 0x7f9de5cf32e0>, <moabb.datasets.zhou2020.Zhou2020 object at 0x7f9de5cf10f0>]

The data from a list of subjects could be preprocessed and return as a 3D numpy array X, follow a scikit-like format with the associated labels. The meta object contains all information regarding the subject, the session and the run associated to each trial.

Create Pipeline#

Our goal is to evaluate the performance of a given classification pipeline (or several of them) when it is applied to the epochs from the previously chosen dataset. We will consider a very simple classification pipeline in which the dimension of the epochs are reduced via a CSP step and then classified via a linear discriminant analysis.

pipeline = make_pipeline(CSP(n_components=8), LDA())

Evaluation#

To evaluate the score of this pipeline, we use the evaluation class. When instantiating it, we say which paradigm we want to consider, a list with the datasets to analyze, and whether the scores should be recalculated each time we run the evaluation or if MOABB should create a cache file.

Note that there are different ways of evaluating a classifier; in this example, we choose WithinSessionEvaluation, which consists of doing a cross-validation procedure where the training and testing partitions are from the same recording session of the dataset. We could have used CrossSessionEvaluation, which takes all but one session as training partition and the remaining one as testing partition.

evaluation = WithinSessionEvaluation(
    paradigm=paradigm, datasets=[dataset], overwrite=True, hdf5_path=None
)

We obtain the results in the form of a pandas dataframe

[codecarbon WARNING @ 15:49:54] Multiple instances of codecarbon are allowed to run at the same time.
2026-04-27 15:51:30,988 INFO MainThread moabb.evaluations.base csp+lda | BNCI2014-001 | 2 | 1test: Score 0.649
2026-04-27 15:51:30,988 INFO MainThread moabb.evaluations.base csp+lda | BNCI2014-001 | 2 | 0train: Score 0.658
2026-04-27 15:51:30,988 INFO MainThread moabb.evaluations.base csp+lda | BNCI2014-001 | 1 | 0train: Score 0.923
2026-04-27 15:51:30,988 INFO MainThread moabb.evaluations.base csp+lda | BNCI2014-001 | 1 | 1test: Score 0.956
2026-04-27 15:51:30,988 INFO MainThread moabb.evaluations.base csp+lda | BNCI2014-001 | 3 | 0train: Score 0.995
2026-04-27 15:51:30,988 INFO MainThread moabb.evaluations.base csp+lda | BNCI2014-001 | 3 | 1test: Score 0.994

The results are stored in locally, to avoid recomputing the results each time. It is saved in hdf5_path if defined or in ~/mne_data/results otherwise. To export the results in CSV:

results.to_csv("./results_part2-1.csv")

To load previously obtained results saved in CSV

results = pd.read_csv("./results_part2-1.csv")

Plotting Results#

We create a figure with the seaborn package comparing the classification score for each subject on each session. Note that the ‘subject’ field from the results is given in terms of integers, but seaborn accepts only strings for its labeling. This is why we create the field ‘subj’.

fig, ax = plt.subplots(figsize=(8, 7))
results["subj"] = results["subject"].apply(str)
sns.barplot(
    x="score", y="subj", hue="session", data=results, orient="h", palette="viridis", ax=ax
)
plt.show()
tutorial 1 simple example motor imagery
2026-04-27 15:51:31,023 INFO MainThread matplotlib.category Using categorical units to plot a list of strings that are all parsable as floats or dates. If these strings should be plotted as numbers, cast to the appropriate data type before plotting.
2026-04-27 15:51:31,026 INFO MainThread matplotlib.category Using categorical units to plot a list of strings that are all parsable as floats or dates. If these strings should be plotted as numbers, cast to the appropriate data type before plotting.

Total running time of the script: (1 minutes 49.937 seconds)

Gallery generated by Sphinx-Gallery