Cookies help us deliver our services. By using our services, you agree to our use of cookies. Learn more

The Get Data Out Programme > Data

In order to fulfil its duty as a public health agency responsible for cancer prevention and control in England, the National Cancer Registration and Analysis Service of Public Health England is expected to produce evidence about cancer incidence, diagnosis, treatment and survival. We fulfil this critical function with a range of outputs, including official statistics, reports, and support for public health research on cancer.

As part of this broader mission, the Get Data Out (GDO) programme has produced key cancer statistics for small groups of patients; these outputs are meant for use by patients, the public and any general user, and anonymisation standards are designed in to these outputs by aggregation at the outset.

Get Data Out (GDO) tables are currently produced for four statistical areas (incidence, treatment, survival and routes to diagnosis) and for four cancer tumour groups:

  • Brain, meningeal and other primary CNS tumours
  • Ovary, fallopian tube and primary peritoneal carcinomas
  • Pancreatic cancers
  • Testicular tumours including post-pubertal teratomas
We are working to expand this output in the near future.


Download Get Data Out (GDO) tables

Download Description
GDO_data_wide.csv A collated .csv file of all the latest statistics in wide format (one row for each group of patients, with all statistics for that group as columns)
GDO_data_thin.csv A collated .csv file of all the latest statistics in thin format (one row for each statistic, with each group of patients having many rows of data)
GDO_data.json
[coming soon]
A collated .json file of all the latest statistics [coming soon]
GDO_metadata.csv A .csv file with the metadata for each statistic including full descriptions and units
GDO_releases.csv A .csv file listing the releases that make up the Get Data Out table, including release dates and lists of documentation
GDO_releases.json
[coming soon]
A .json file listing the releases that make up the Get Data Out table, including release dates and lists of documentation [coming soon]
GDO_structure.csv A .csv file defining the tree structure of the partition
GDO_units.csv A .csv file with the metadata for each unit
GDO_missing.csv A .csv file containing the look ups for the missing data codes

The Get Data Out tables were last updated on 2019-02-18. A list of all data releases is available at the bottom of the page.

Using the Data

The data can be downloaded from the links above. Alternatively tools and webpages can be pointed directly to the data at our static URLs. Click here for more detail for developers about the data structures and accessing the data.

The data is signed off as non-disclosive and is released under an Open Government Licence. You are free to copy, publish, distribute and transmit the information, and to adapt it and include it in your own products. The attribution statement that must be included with any reuse of the data is:

Data for this [study/ project/ report/tool] is based on patient-level information collected by the NHS, as part of the care and support of cancer patients. The data is collated, maintained and quality assured by the National Cancer Registration and Analysis Service, which is part of Public Health England (PHE). The data is taken from the Get Data Out tables.

Understanding the Get Data Out groupings

The Get Data Out programme partitions diagnoses of cancer into many small groups, where each group contains approximately 100 people with the same characteristics.

The grouping process can be imagined as a rooted branching tree, where the first node is the group 'all tumours', and each branch point divides by a dimension of interest (e.g. age, region, sex). If a node contains too few tumours then it cannot be divided further without making groups of less than 100, and so the tree terminates there. If the node has enough tumours it branches again by the next dimension of interest. We have visualised the trees for each tumour type, and you can view them on the pages dedicated to each tumour type.

Currently the Get Data Out programme has grouped four tumour types. These groupings are explained in more detail in the documents below:

Statistics available in the Get Data Out table

There are currently four statistical releases available in the Get Data Out table.

Incidence. Statistics are provided on the number of new tumours diagnosed in each group and the incidence rate of cancer in this group with upper and lower confidence intervals.

Treatment. Statistics are provided on the number of tumours treated with surgery, chemotherapy, radiotherapy and all combinations of these treatments in each group, the % of tumours treated, and the upper and lower confidence intervals around the percentage.

Survival. Statistics are provided on the number of tumours included in the survival calculation and the net and crude survival rates in each group at 3, 6, 9, 12, 24, 36 and 48 months after diagnosis, with upper and lower confidence intervals.

Routes to Diagnosis. Statistics are provided on the number of tumours diagnosed by each 'route to diagnosis' and the % of tumours diagnosed by each route with the upper and lower confidence intervals. The eight standard diagnostic routes - two week wait; GP referral; screening; other outpatient; inpatient elective; emergency presentation; death certificate only and unknown - are provided, along with a 'not classified' group. Please visit: http://ncin.org.uk/publications/routes_to_diagnosis to find out more.

Contact Us

If you have feedback on this pilot, or any other queries about the Get Data Out tables, please email us here. It will help us to get your query to the right people if you mention 'Get Data Out' in your email.

Data Releases

All the data in the tables below has been incorporated into the full Get Data Out table found above in the first table. The data is also provided separately for each release of each statistic.
Dataset Date released Release ID Wide csv Thin csv Json Metadata Documentation
Brain, meningeal and other primary CNS tumours, Ovary, fallopian tube and primary peritoneal carcinomas, Pancreas and Testicular tumours including post-pubertal teratomas, survival, 2013-2016 2019-02-18 GDO_0011 GDO_brain_surv_13-16_w.csv, GDO_ova_surv_13-16_w.csv, GDO_pan_surv_13-16_w.csv, GDO_test_surv_13-16_w.csv [coming soon] [coming soon] GDO_surv_13-16_m.csv GDO_survival_SOP.docx, Cancer Survival SOP v11_0.docx
Brain, Ovary, Pancreas and Testis, Routes to Diagnosis, 2013-2016 2018-12-12 SOT0010 SOT_brain_RtD_13-16_w.csv, SOT_ova_RtD_13-16_w.csv, SOT_pan_RtD_13-16_w.csv, SOT_test_RtD_13-16_w.csv [coming soon] [coming soon] SOT_RtD_13-16_m.csv GDO SOP - RtD v1.1.doc, Routes_to_Diagnosis_2006_2015_technical_document.pdf, Routes to Diagnosis for cancer - Elliss-Brookes et al.pdf
Brain, Ovary, Pancreas and Testis, treatments, 2013-2015 2018-12-12 SOT0009 SOT_brain_treat_13-15_w.csv, SOT_ova_treat_13-15_w.csv, SOT_pan_treat_13-15_w.csv, SOT_test_treat_13-15_w.csv [coming soon] [coming soon] SOT_treat_13-15_m.csv SOT SOP - Treatment_251018.doc, CAS-SOP_#4.4_linking_treatment_tables.pdf
Ovary, Pancreas and Testis, incidence rates, 2013-2016 2018-12-12 SOT0008 SOT_ova_Inc_13-16_w.csv, SOT_pan_Inc_13-16_w.csv, SOT_test_Inc_13-16_w.csv [coming soon] [coming soon] SOT_Inc_13-16_m.csv SOT SOP - Incidence v2.0.doc, CASSOP #5 - Crude incidence rates 1.1.docx, CASSOP #5 - Crude incidence rates 1.1.xlsx, CASSOP #1 Counting Cases.docx, GDO_ova_cohort_13-16_sql.txt, SOT_ova_Inc_13-16_sql.txt, SOT_pan_Inc_13-16_sql.txt, SOT_test_Inc_13-16_sql.txt
Brain, incidence rates, 2013-2016 2018-05-08 SOT0007 SOT_brain_Inc_13-16_w.csv [coming soon] [coming soon] SOT_brain_Inc_13-16_m.csv SOT SOP - Incidence 2016 v1.0.doc, CASSOP #5 - Crude incidence rates 1.0.docx, CASSOP #5 - Crude incidence rates 1.0.xlsx, CASSOP #1_Counting_cancer_cases.pdf

Note: Standard Output Table (SOT) is the previous name used for Get Data Out (GDO) releases.