Data Dictionary

The PanCAN Know Your Tumor Data Dictionary (the embedded excel file) is a guide to understanding the details of each csv file, the column (i.e field) names and the renaming of filterable vs downloaded field names.

šŸ“˜

Note:

Some fields that have PanCAN Field Name’s are marked as non-filterable – in these cases you should ignore the PanCAN Field Name as it is only there for possible future enhancements to the set of filterable data.

It is divided by each csv file component (ex. adverse events, assessments, etc) and contains the following columns:

a. Table Name – this is the name of the csv file which will contain the data for which you are downloading. Example, when downloading ā€˜adverse events’ for a specific defined cohort the download file will be named ā€˜adverse_events.csv’
b. Column Name – this is the column name/header for each column of the referenced csv file. Example, when downloading ā€˜adverse events’ in the file named ā€˜adverse_events.csv’ you will see columns named patient_id, care_plan_id, adverse_event_id…..
c. PanCAN Field Name – This the name of the field which, if renamed, you will when filtering cohorts. If this field is blank it means that the ā€˜CSV Column Name’ will be shown when filtering. Example, when filtering on comorbidity information, you would see ā€˜Comorbidity Name’ instead of ā€˜comobidity_concept_canonical_name’.
d. Filterable – as previously note, for each of use and understanding not every field is available as a filter. This column indicates which are (denoted with a Y) which are not (denoted with a ā€œNā€ or blank) available for a cohort filter. As an example, the ā€˜Histopathological Grade Description’ is available to filter however ā€˜observationb_concept_system’ of the histopath file is not.
e. Data Type – the general characteristic if the data type (ex. character, numeric).
f. Description – a basic description of the data field as provided by Tempus.