Statistical Analysis#

The MOABB codebase comes with convenience plotting utilities and some statistical testing. This tutorial focuses on what those exactly are and how they can be used.

# Authors: Vinay Jayaram <vinayjayaram13@gmail.com>
#
# License: BSD (3-clause)
# sphinx_gallery_thumbnail_number = -2

import matplotlib.pyplot as plt
from mne.decoding import CSP
from pyriemann.estimation import Covariances
from pyriemann.tangentspace import TangentSpace
from sklearn.discriminant_analysis import LinearDiscriminantAnalysis as LDA
from sklearn.linear_model import LogisticRegression
from sklearn.pipeline import make_pipeline

import moabb
import moabb.analysis.plotting as moabb_plt
from moabb.analysis.meta_analysis import (  # noqa: E501
    compute_dataset_statistics,
    find_significant_differences,
)
from moabb.datasets import BNCI2014_001
from moabb.evaluations import CrossSessionEvaluation
from moabb.paradigms import LeftRightImagery


moabb.set_log_level("info")

print(__doc__)

Results Generation#

First we need to set up a paradigm, dataset list, and some pipelines to test. This is explored more in the examples – we choose left vs right imagery paradigm with a single bandpass. There is only one dataset here but any number can be added without changing this workflow.

Create Pipelines#

Pipelines must be a dict of sklearn pipeline transformer.

The CSP implementation from MNE is used. We selected 8 CSP components, as usually done in the literature.

The Riemannian geometry pipeline consists in covariance estimation, tangent space mapping and finally a logistic regression for the classification.

pipelines = {}

pipelines["CSP+LDA"] = make_pipeline(CSP(n_components=8), LDA())

pipelines["RG+LR"] = make_pipeline(Covariances(), TangentSpace(), LogisticRegression())

pipelines["CSP+LR"] = make_pipeline(CSP(n_components=8), LogisticRegression())

pipelines["RG+LDA"] = make_pipeline(Covariances(), TangentSpace(), LDA())

Evaluation#

We define the paradigm (LeftRightImagery) and the dataset (BNCI2014_001). The evaluation will return a DataFrame containing a single AUC score for each subject / session of the dataset, and for each pipeline.

Results are saved into the database, so that if you add a new pipeline, it will not run again the evaluation unless a parameter has changed. Results can be overwritten if necessary.

paradigm = LeftRightImagery()
dataset = BNCI2014_001()
dataset.subject_list = dataset.subject_list[:4]
datasets = [dataset]
overwrite = True  # set to False if we want to use cached results
evaluation = CrossSessionEvaluation(
    paradigm=paradigm, datasets=datasets, suffix="stats", overwrite=overwrite
)

results = evaluation.process(pipelines)

BNCI2014-001-CrossSession:   0%|          | 0/4 [00:00<?, ?it/s]
BNCI2014-001-CrossSession:  25%|██▌       | 1/4 [00:07<00:22,  7.39s/it]
BNCI2014-001-CrossSession:  50%|█████     | 2/4 [00:14<00:14,  7.31s/it]
BNCI2014-001-CrossSession:  75%|███████▌  | 3/4 [00:21<00:07,  7.21s/it]

  0%|                                              | 0.00/37.2M [00:00<?, ?B/s]

  0%|                                     | 8.19k/37.2M [00:00<08:54, 69.5kB/s]

  0%|                                      | 56.3k/37.2M [00:00<02:19, 266kB/s]

  0%|▏                                      | 153k/37.2M [00:00<01:12, 513kB/s]

  1%|▎                                      | 352k/37.2M [00:00<00:37, 971kB/s]

  2%|▊                                     | 736k/37.2M [00:00<00:20, 1.78MB/s]

  4%|█▌                                   | 1.51M/37.2M [00:00<00:10, 3.38MB/s]

  8%|███                                  | 3.06M/37.2M [00:00<00:05, 6.50MB/s]

 17%|██████▏                              | 6.16M/37.2M [00:00<00:02, 12.7MB/s]

 24%|████████▊                            | 8.82M/37.2M [00:01<00:01, 15.7MB/s]

 32%|███████████▊                         | 11.9M/37.2M [00:01<00:01, 18.8MB/s]

 39%|██████████████▌                      | 14.6M/37.2M [00:01<00:01, 19.9MB/s]

 47%|█████████████████▌                   | 17.6M/37.2M [00:01<00:00, 21.6MB/s]

 55%|████████████████████▍                | 20.5M/37.2M [00:01<00:00, 22.3MB/s]

 63%|███████████████████████▍             | 23.6M/37.2M [00:01<00:00, 23.3MB/s]

 72%|██████████████████████████▌          | 26.7M/37.2M [00:01<00:00, 23.9MB/s]

 82%|██████████████████████████████▎      | 30.4M/37.2M [00:01<00:00, 26.0MB/s]

 90%|█████████████████████████████████▏   | 33.4M/37.2M [00:02<00:00, 25.7MB/s]

  0%|                                              | 0.00/37.2M [00:00<?, ?B/s]
100%|██████████████████████████████████████| 37.2M/37.2M [00:00<00:00, 131GB/s]

  0%|                                              | 0.00/41.7M [00:00<?, ?B/s]

  0%|                                     | 8.19k/41.7M [00:00<10:25, 66.7kB/s]

  0%|                                      | 56.3k/41.7M [00:00<02:42, 257kB/s]

  0%|▏                                      | 153k/41.7M [00:00<01:23, 496kB/s]

  1%|▎                                      | 352k/41.7M [00:00<00:44, 932kB/s]

  2%|▋                                     | 688k/41.7M [00:00<00:26, 1.58MB/s]

  3%|█▏                                   | 1.41M/41.7M [00:00<00:13, 3.01MB/s]

  7%|██▌                                  | 2.86M/41.7M [00:00<00:06, 5.86MB/s]

 13%|████▉                                | 5.60M/41.7M [00:00<00:03, 11.1MB/s]

 20%|███████▍                             | 8.42M/41.7M [00:01<00:02, 14.8MB/s]

 26%|█████████▋                           | 10.9M/41.7M [00:01<00:01, 16.4MB/s]

 32%|███████████▉                         | 13.4M/41.7M [00:01<00:01, 17.7MB/s]

 38%|██████████████▏                      | 16.0M/41.7M [00:01<00:01, 18.6MB/s]

 45%|████████████████▌                    | 18.7M/41.7M [00:01<00:01, 19.7MB/s]

 51%|███████████████████                  | 21.5M/41.7M [00:01<00:00, 20.6MB/s]

 58%|█████████████████████▍               | 24.2M/41.7M [00:01<00:00, 21.1MB/s]

 65%|███████████████████████▉             | 27.0M/41.7M [00:01<00:00, 21.6MB/s]

 71%|██████████████████████████▍          | 29.8M/41.7M [00:02<00:00, 21.9MB/s]

 78%|████████████████████████████▊        | 32.6M/41.7M [00:02<00:00, 22.1MB/s]

 85%|███████████████████████████████▎     | 35.3M/41.7M [00:02<00:00, 22.2MB/s]

 91%|█████████████████████████████████▊   | 38.1M/41.7M [00:02<00:00, 22.4MB/s]

 98%|████████████████████████████████████▎| 40.9M/41.7M [00:02<00:00, 22.4MB/s]

  0%|                                              | 0.00/41.7M [00:00<?, ?B/s]
100%|██████████████████████████████████████| 41.7M/41.7M [00:00<00:00, 160GB/s]

BNCI2014-001-CrossSession: 100%|██████████| 4/4 [00:35<00:00,  9.64s/it]
BNCI2014-001-CrossSession: 100%|██████████| 4/4 [00:35<00:00,  8.78s/it]

MOABB Plotting#

Here we plot the results using some of the convenience methods within the toolkit. The score_plot visualizes all the data with one score per subject for every dataset and pipeline.

fig = moabb_plt.score_plot(results)
plt.show()

For a comparison of two algorithms, there is the paired_plot, which plots performance in one versus the performance in the other over all chosen datasets. Note that there is only one score per subject, regardless of the number of sessions.

fig = moabb_plt.paired_plot(results, "CSP+LDA", "RG+LDA")
plt.show()

Statistical Testing and Further Plots#

If the statistical significance of results is of interest, the method compute_dataset_statistics allows one to show a meta-analysis style plot as well. For an overview of how all algorithms perform in comparison with each other, the method find_significant_differences and the summary_plot are possible.

stats = compute_dataset_statistics(results)
P, T = find_significant_differences(stats)

The meta-analysis style plot shows the standardized mean difference within each tested dataset for the two algorithms in question, in addition to a meta-effect and significance both per-dataset and overall.

fig = moabb_plt.meta_analysis_plot(stats, "CSP+LDA", "RG+LDA")
plt.show()

< RG+LDA better CSP+LDA better >, p-value

The summary plot shows the effect and significance related to the hypothesis that the algorithm on the y-axis significantly outperformed the algorithm on the x-axis over all datasets

moabb_plt.summary_plot(P, T)
plt.show()

Total running time of the script: (0 minutes 37.867 seconds)

Estimated memory usage: 716 MB

Gallery generated by Sphinx-Gallery