Version: 1.0.0 | Published: 24 Mar 2026 | Updated: 16 days ago
Summary
Description:
Cancer diagnoses before and after hospitalised cerebrovascular events. A dataset of more than 16,000 stroke patients including granular ethnicity, multi-morbidity, serial physiology, blood biomarkers, intervention and outcome data.
Access Tier:
Controlled
Contact Point:
Health Theme:
Cancer
Health Category:
- Electronic Health Records (EHRs)
- Data from clinical trials, clinical studies & clinical investigations
Number of Unique Individuals:
15908
Documentation
Documentation:
Common causes of cerebrovascular events include arrhythmias such as atrial fibrillation, damage to the small vessels of the brain termed ‘small vessel disease’, large vessel disease and haemorrhage.
Anecdotally, clinicians have described an increased prevalence of newly diagnosed cancers in people presenting with cerebrovascular disease. However, there is limited information on how common cancer is associated with stroke, what types of cancers are most commonly diagnosed, and how this effects prognosis both in relation to the stroke and the cancer.
Furthermore, it is unclear how people with cancer-related strokes should be treated; including if standard treatments are still beneficial or whether a more tailored approach is required.
This is a highly granular dataset of >16,000 patients with a confirmed cerebrovascular event including hospital presentation, serial physiology, every treatment prescribed and administered, and outcomes for the subsequent 12 months. It differentiates patients into those with a known or newly diagnosed malignancy and those without, and cancer types can be linked to pathology staining information, if needed.
PIONEER geography: The West Midlands (WM) has a population of 5.9 million and includes a diverse ethnic and socio-economic mix.
UHB is one of the largest NHS Trusts in England, providing direct acute services and specialist care across four hospital sites, with 2.2 million patient episodes per year, 2750 beds. UHB runs a fully electronic healthcare record (EHR) (PICS; Birmingham Systems), a shared primary and secondary care record (Your Care Connected) and a patient portal “My Health”.
Scope: Investigating the relationship between cancer and stroke and whether a cancer related stroke is associated with a worse clinical outcome compared with patients with non-cancer related stroke. Longitudinal and individually linked, so that the preceding and subsequent health journey can be mapped and healthcare utilisation prior to and after admission understood. The dataset includes highly granular patient demographics, co-morbidities taken from ICD-10 and SNOMED-CT codes. Serial, structured data pertaining to process of care (timings and admissions), presenting complaint, procedures, physiology readings (blood pressure, respiratory rate, heart rate, oxygen saturations, swallow screening), Lab analysis results (blood sodium level, estimated Glomerular filtration rate (GFR), urea, albumin, cholesterol, full blood counts and others), drug administered and all outcomes.
Available supplementary data: Matched controls; ambulance, OMOP data, synthetic data.
Available supplementary support: Analytics, Model build, validation and refinement; A.I.; Data partner support for ETL (extract, transform and load) process, Clinical expertise, Patient and end-user access, Purchaser access, Regulatory requirements, Data-driven trials, “fast screen” services.
Coverage
Spatial
Spatial Coverage:
- United Kingdom
- England
- West Midlands
Temporal
Start Date:
28 October 2004
End Date:
30 July 2021
Frequency:
QUARTERLY
Date of Latest Release:
20 December 2021
Date of First Release:
08 October 2024
Temporal Aggregation:
1 - 10 Years
Provenance
Origin
Purpose:
Care
Collection Situation:
- Secondary care - Accident and Emergency
- Secondary care - In-patients
- Secondary care - Outpatients
Image Contrast:
Not stated
Method of Collection:
EPR
Access and Governance
Usage
Data Use Requirements:
Project specific restriction
Access
Jurisdiction:
England
Data Controller:
University Hospitals Birmingham NHS Foundation Trust
Delivery Lead Time:
1-2 months
Legal Basis:
General research use
Health Data Access Body:
This publication uses data from PIONEER, an ethically approved database and
analytical environment (East Midlands Derby Research Ethics 20/EM/0158)
Format and Standards
Language:
English
Format:
SQL
Conforms To:
LOCAL
Coding System:
- SNOMED CT
- OPCS4
- ICD10
Data Distribution
Data Status:
Available
Distribution:
Trusted Research Environments (TRE) are built using Microsoft Azure services and
hosted in the UK to provide research teams a safe, secure and agile environment
which allows users to quickly analyse, interpret and form an enriched view of
primary care information through a range of integrated datasets. Health data
collated from multiple sources is ingested into a secure data lake which will
then allow subsets of data to be made available to research teams on approval of
a data request. Once approved a customer specific TRE is made available with a
standard set of leading analytical tools from Microsoft including Azure
Databricks, Azure Machine Learning, Azure SQL and Azure Synapse (for large-scale
data warehouses). Specific tools can be provided at an additional cost over the
standard platform data access charge and the PIONEER team will work with you to
determine your exact needs. Access to the TRE is managed using the latest
virtual desktop technology to provide a safe and secure end-user experience. By
utilising leading edge design PIONEER are able to create TREs rapidly to enable
us to service any customer requirement.,
www.pioneerdatahub.co.uk/data/data-services-costs/
Observations
Name
Population Type
Value
Description
Variable Measured
Unit Code
Observation Date
Number of Records
Minimum Typical Age
Maximum Typical Age
Persons
15908
15,908 cancer related stroke admissions between 29/10/2004 and 31/07/2021
Count
20 December 2021
15908
18
110
Origin
Name:
Data Catalogue