HRV Analysis Guidelines¶

Practical guidelines for using RRational in line with current scientific standards.

Which guidelines does RRational follow?¶

RRational implements recommendations from:

Quigley et al. (2024) — Current consensus on HRV measurement and reporting in psychophysiology
Task Force (1996) — Foundational standards for HRV measurement
Quintana et al. (2016) — GRAPH checklist for reporting HRV studies
Lipponen & Tarvainen (2019) — State-of-the-art artifact detection algorithm

Data Quality¶

Artifact Detection¶

RRational uses the Lipponen-Tarvainen algorithm (via NeuroKit2's Kubios implementation) which classifies beats into six categories: ectopic, long, short, missed, extra, and normal.

flowchart TD
  A[Run artifact detection] --> B{Artifact rate?}
  B -->|≤ 2%| C[Excellent\nUse as-is]
  B -->|2-5%| D[Good\nCorrect + analyze]
  B -->|5-10%| E[Moderate\nCorrect, time-domain only]
  B -->|> 10%| F[Poor\nExclude segment]

  style C fill:#28a745,color:#fff
  style D fill:#2E86AB,color:#fff
  style E fill:#ffc107,color:#000
  style F fill:#dc3545,color:#fff

Recommended Workflow

Run artifact detection on all validated sections (same segments)
Review per-segment quality grades
Exclude segments with > 10% artifacts
Correct segments with 2–10% artifacts
Report artifact rates in your publication

Quality Grades¶

RRational assigns quality grades following Quigley et al. (2024):

Grade	Artifact Rate	Valid Metrics	Action
Excellent	≤ 2%	All (time, frequency, nonlinear)	Use as-is
Good	2–5%	All	Correct artifacts, then analyze
Moderate	5–10%	Time-domain only	Correct, but avoid frequency metrics
Poor	> 10%	None reliably	Exclude from analysis

Segments with fewer than 50 beats are excluded automatically regardless of artifact rate (too short for any reliable HRV metric).

Minimum Data Requirements¶

Analysis	Minimum	Recommended	Why
Time-domain (RMSSD, SDNN)	100 beats	300+ beats	Statistical reliability
Frequency-domain (LF, HF)	300 beats	500+ beats	Spectral estimation needs sufficient data
Recording duration for frequency	2 minutes	5 minutes	5 min is the Task Force (1996) short-term window; the 2 min minimum is RRational's practical floor

Metric Interpretation¶

Recommended Metrics¶

Metric	What it reflects	Recommendation
RMSSD	Parasympathetic (vagal) activity	Primary metric for short-term studies
SDNN	Total autonomic variability	Use with consistent segment durations
HF Power	Parasympathetic activity (0.15–0.4 Hz)	Corroborates RMSSD
SD1	Short-term variability (Poincare)	Mathematically related to RMSSD

Metrics to Use with Caution¶

LF/HF Ratio

The LF/HF ratio should not be interpreted as "sympathovagal balance." This interpretation is rejected by current consensus (Quigley et al., 2024; Billman, 2013). The LF band reflects both sympathetic and parasympathetic activity plus baroreflex function. RRational includes LF/HF for legacy compatibility, but we recommend against using it as your primary outcome.

SDNN Across Different Durations

SDNN scales with recording duration. Never compare SDNN values from segments of different lengths. Use consistent window sizes (e.g., always 5 minutes) within a study.

Statistical Testing (Group & Sequence Comparison)¶

When you compare groups or conditions, RRational selects an appropriate test automatically and reports an effect size alongside each p-value.

Test selection is driven by the number of groups and a normality check (Shapiro-Wilk) on each metric:

Comparison	Distribution	Test	Effect size
2 groups	normal	Welch's t-test (unequal variances)	Cohen's d
2 groups	non-normal	Mann-Whitney U	—
3+ groups	normal	one-way ANOVA	η² (eta-squared)
3+ groups	non-normal	Kruskal-Wallis	η²

Why effect sizes? A p-value indicates whether a difference is unlikely under the null hypothesis; it does not say how large the difference is. Always report an effect size (Cohen's d or η²) so readers can judge practical significance — current reporting guidelines (Quigley et al., 2024) require this.

Multiple comparisons. Testing many metrics at once inflates the false-positive rate. RRational adjusts p-values across the metrics in a comparison using Holm (default), Bonferroni, or Benjamini-Hochberg FDR. State which correction you used.

Log-normal metrics. Frequency-domain powers (LF, HF, VLF, Total Power) are right-skewed and approximately log-normal. RRational log-transforms them before parametric testing so the normality assumption holds, in line with common HRV practice.

Normality on small samples

The Shapiro-Wilk test has low power with very small groups, so a "normal" verdict on a handful of participants is weak evidence. With small or clearly skewed samples, prefer the non-parametric option and interpret parametric results cautiously.

Reporting Checklist¶

Following the GRAPH guidelines (Quintana et al., 2016), your publication should report:

Recording device and sampling rate
Recording duration per condition
Artifact detection method and parameters
Mean artifact rate per condition (with SD)
Artifact correction algorithm
Number of excluded segments and criteria
HRV metrics computed, with window duration
Beat count per analyzed segment
Software and version used

RRational makes this easy

The Analysis Documentation panel (in Single Participant analysis) auto-generates a report with all these details. Export as HTML or Markdown and include in your supplementary materials.

Common Pitfalls¶

Pitfall	Why it's a problem	What to do instead
Comparing SDNN across different durations	SDNN scales with recording length	Use identical segment lengths
Interpreting LF as "sympathetic"	Not supported by evidence	Report LF but don't over-interpret
Using LF/HF as sympathovagal balance	Rejected by current consensus	Focus on RMSSD and HF
Over-correcting artifacts (> 10%)	Distorts the signal	Exclude the segment instead
Frequency metrics on < 2 min segments	Insufficient spectral resolution	Use time-domain only
Ignoring respiratory confounds	Breathing rate affects HF	Note if breathing was uncontrolled
Not reporting artifact rates	Reviewers can't assess data quality	Always report per condition

Key References¶

Quigley, K.S., et al. (2024). Publication guidelines for human heart rate and heart rate variability studies in psychophysiology. Psychophysiology, 61(9), e14604. doi:10.1111/psyp.14604
Task Force of ESC and NASPE (1996). Heart rate variability: Standards of measurement. Circulation, 93(5), 1043–1065. doi:10.1161/01.CIR.93.5.1043
Quintana, D.S., et al. (2016). Guidelines for Reporting Articles on Psychiatry and Heart rate variability (GRAPH). Translational Psychiatry, 6(5), e803. doi:10.1038/tp.2016.73
Lipponen, J.A., & Tarvainen, M.P. (2019). A robust algorithm for heart rate variability time series artefact correction. Journal of Medical Engineering & Technology, 43(3), 173–181. doi:10.1080/03091902.2019.1640306
Makowski, D., et al. (2021). NeuroKit2: A Python toolbox for neurophysiological signal processing. Behavior Research Methods, 53, 1689–1696. doi:10.3758/s13428-020-01516-y
Billman, G.E. (2013). The LF/HF ratio does not accurately measure cardiac sympatho-vagal balance. Frontiers in Physiology, 4, 26. doi:10.3389/fphys.2013.00026