client logo
Version: 1.0.0 | Published: 24 Mar 2026 | Updated: 16 days ago

Demography, interventions & outcomes of patients with Cerebrovascular Disease

Dataset

Summary

Description:
Patients admitted with a cerebrovascular event. Granular detail of care pathways. Multi-morbidity, investigations, interventions and treatments. Serial physiology, blood biomarkers, physiotherapy, outcome. Deeply phenotyped.
Access Tier:
Controlled
Contact Point:
Health Theme:
Mental, neurological & substance use
Health Category:
  • Electronic Health Records (EHRs)
  • Data from clinical trials, clinical studies & clinical investigations
Number of Unique Individuals:
8885

Documentation

Documentation:
PIONEER geography The West Midlands (WM) has a population of 5.9million & includes a diverse ethnic, socio-economic mix. There is a higher than average % of minority ethnic groups. WM has a large number of elderly residents but is the youngest population in the UK. There are particularly high rates of physical inactivity, obesity, smoking & diabetes. WM has a high prevalence of COPD, reflecting the high rates of smoking and industrial exposure. Each day >100,000 people are treated in hospital, see their GP or are cared for by the NHS. This is the SAMBA dataset from 4 NHS hospitals. EHR University Hospitals Birmingham NHS Foundation Trust (UHB) is one of the largest NHS Trusts in England, providing direct acute services & specialist care across four hospital sites, with 2.2 million patient episodes per year, 2750 beds & 100 ITU beds. UHB runs a fully electronic healthcare record (EHR) (PICS; Birmingham Systems), a shared primary & secondary care record (Your Care Connected) & a patient portal “My Health”. Scope: All patients from 2015 onwards, curated to focus on Stroke. Longitudinal & individually linked, so that the preceding & subsequent health journey can be mapped & healthcare utilisation prior to & after admission understood. The dataset includes highly granular patient demographics, co-morbidities taken from ICD-10 & SNOMED-CT codes. Serial, structured data pertaining to process of care (admissions, wards and discharge outcomes), presenting complaints, therapies, all physiology readings (pulse, temperature, blood pressure, screening for dysphagia, all sample analysis results (urine specimens, blood specimens), all prescribed & administered treatments and all outcomes. Available supplementary data: More extensive data including granular serial physiology, bloods, conditions, interventions, treatments. Ambulance, 111, 999 data, synthetic data. Available supplementary support: Analytics, Model build, validation & refinement; A.I.; Data partner support for ETL (extract, transform & load) process, Clinical expertise, Patient & end-user access, Purchaser access, Regulatory requirements, Data-driven trials, “fast screen” services

Coverage

Spatial

Spatial Coverage:
  • United Kingdom
  • England
  • West Midlands

Temporal

Start Date:
01 January 2015
End Date:
31 December 2020
Frequency:
QUARTERLY
Date of Latest Release:
16 February 2020
Date of First Release:
08 October 2024
Temporal Aggregation:
Other

Provenance

Origin

Purpose:
Care
Collection Situation:
  • Secondary care - Accident and Emergency
  • Secondary care - In-patients
Image Contrast:
Not stated
Method of Collection:
EPR

Access and Governance

Usage

Data Use Requirements:
Project specific restriction

Access

Data Controller:
University Hospitals Birmingham NHS Foundation Trust
Delivery Lead Time:
1-2 months
Legal Basis:
General research use
Health Data Access Body:
This publication uses data from PIONEER, an ethically approved database and analytical environment (East Midlands Derby Research Ethics 20/EM/0158)

Format and Standards

Language:
English
Format:
SQL
Conforms To:
LOCAL
Coding System:
  • OPCS4
  • SNOMED CT
  • ICD10

Data Distribution

Data Status:
Available
Distribution:
Trusted Research Environments (TRE) are built using Microsoft Azure services and hosted in the UK to provide research teams a safe, secure and agile environment which allows users to quickly analyse, interpret and form an enriched view of primary care information through a range of integrated datasets. Health data collated from multiple sources is ingested into a secure data lake which will then allow subsets of data to be made available to research teams on approval of a data request. Once approved a customer specific TRE is made available with a standard set of leading analytical tools from Microsoft including Azure Databricks, Azure Machine Learning, Azure SQL and Azure Synapse (for large-scale data warehouses). Specific tools can be provided at an additional cost over the standard platform data access charge and the PIONEER team will work with you to determine your exact needs. Access to the TRE is managed using the latest virtual desktop technology to provide a safe and secure end-user experience. By utilising leading edge design PIONEER are able to create TREs rapidly to enable us to service any customer requirement., https://www.pioneerdatahub.co.uk/data/data-request-process/

Observations

Name
Population Type
Value
Description
Variable Measured
Unit Code
Observation Date
Number of Records
Minimum Typical Age
Maximum Typical Age
Persons
8885
8885 spells in this dataset from 01.01.2015 to 31.12.2020
Count
16 February 2020
8885
15
115