client logo
Version: 1.0.0 | Published: 24 Mar 2026 | Updated: 16 days ago

White Swan UK Cardiovascular Online Patient & Public Conversations Dataset

Dataset

Summary

Description:
The dataset contains anonymised patient and public conversation which has taken place online regarding 17 cardiovascular disease areas (including: arrhythmia & tachycardia, cardiopulmonary disease and heart failure) in the UK. The dataset is segmented by cardiovascular conditions and topics conversed about such as testing, imaging and emotional impact.
Health Theme:
Noncommunicable diseases – metabolic & cardiopulmonary
Health Category:
  • Electronic Health Records (EHRs)
  • Data on factors impacting on health, including socio-economic, environmental & behavioural determinants of health
  • Data from clinical trials, clinical studies & clinical investigations
Number of Unique Individuals:
64613

Documentation

Associated Media:
on request
Documentation:
The dataset contains anonymised patient and public conversation which has taken place online regarding 17 cardiovascular disease areas (including: arrhythmia & tachycardia, cardiopulmonary disease and heart failure) in the UK. The dataset is segmented by cardiovascular conditions and topics conversed about such as testing, imaging and emotional impact.

Coverage

Spatial

Spatial Coverage:
United Kingdom

Temporal

Start Date:
01 February 2023
Frequency:
IRREGULAR
Date of First Release:
13 February 2025

Provenance

Origin

Purpose:
Research cohort
Collection Situation:
Other
Image Contrast:
Not stated
Method of Collection:
Free text NLP

Access and Governance

Usage

Data Use Requirements:
Project specific restriction

Access

Access Rights:
In Progress
Jurisdiction:
United Kingdom of Great Britain and Northern Ireland
Data Controller:
White Swan
Data Processor:
White Swan
Delivery Lead Time:
1-2 months
Legal Basis:
Project-specific restrictions
Health Data Access Body:
White Swan is a registered charity in England and Wales (1176486) improving health and wellbeing through AI technology and analytics.

Format and Standards

Language:
English
Format:
  • csv
  • xlsx
  • web page explorer
Conforms To:
  • OTHER
  • LOCAL
Coding System:
  • LOCAL
  • HPO
  • OTHER

Data Distribution

Data Status:
Available
Distribution:
On Request, On Request

Observations

Name
Population Type
Value
Description
Variable Measured
Unit Code
Observation Date
Number of Records
Minimum Typical Age
Maximum Typical Age
Persons
64613
Persons in this dataset are determined by the unique volume of chosen display names in the data. This is calculated per source (reddit, reviews, other forums), and then totaled together. In other forums and reviews domains persons may choose to denote themselves as anonymous. In this case, anonymous users are counted once per domain. For example, on 'healthunlocked.com/vasculitis-uk'.
Unique online names indicating number of persons
01 February 2025
64613
0
112