From the CORGIS Dataset Project

By Ryan WhitcombVersion 1, created 6/17/2016

Tags: cancer, death, states, gender, race, population, crude rate

From the United States Cancer Statistics as part of the U.S. Center for Disease Control, the following data set focuses on the crude rate for all types of cancer reported for different demograpic groups. Significant groupings include age, gender, race and geographical area.

*http://www.cdc.gov/cancer/npcr/uscs/download_data.htm*

Download all of the following files.

JSON Path | Type | Comment | Example Value |
---|---|---|---|

[0].Age.Age Adjusted Rate | float | A number representing the expected cancer rate, adjusted for the age of the participants. An age-adjusted rate is a weighted average of the age-specific rates, where the weights are the proportions of persons in the corresponding age groups of a standard population. The potential confounding effect of age is reduced when comparing age-adjusted rates computed using the same standard population. | 165.5 |

[0].Age.Age Adjusted CI Lower | float | A number representing the expected lower bound for the cancer rate. It is unlikely that the actual rate is lower than this number. CI means "Confidence Interval". | 160.6 |

[0].Age.Age Adjusted CI Upper | float | A number representing the expected upper bound for the cancer rate, adjusted for the age of the participants. It is unlikely that the actual rate is higher than this number. CI means "Confidence Interval". | 170.5 |

[0].Age | dict | {u'Age Adjusted Rate': 165.5, u'Age Adjusted CI Lower': 160.6, u'Age Adjusted CI Upper': 170.5} | |

[0].Year | int | The 4-digit year that this report was created for. | 1999 |

[0].Data | dict | {u'Count': 4366, u'Crude Rate': 190.4, u'Crude CI Upper': 196.1, u'Crude CI Lower': 184.8, u'Sex': u'Female', u'Race': u'All Races', u'Event Type': u'Mortality', u'Population': 2293259} | |

[0].Area | unicode | The area of the country (typically the name of the state) for this report. | Alabama |

[0].Data.Count | int | The number of incidences of cancer in this particular group. | 4366 |

[0].Data.Crude Rate | float | The estimated number of people with cancer adjusted by the population. This adjustment makes it easy to compare cancer rates between different locations and over time. | 190.4 |

[0].Data.Crude CI Upper | float | A number representing the upper bound for the Crude Rate. It is unlikely that the actual rate is higher than this number. | 196.1 |

[0].Data.Crude CI Lower | float | A number representing the lower bound for the Crude Rate. It is unlikely that the actual rate is lower than this number. | 184.8 |

[0].Data.Sex | unicode | The gender of people in this particular report. | Female |

[0].Data.Race | unicode | The races reported in this particular report. | All Races |

[0].Data.Event Type | unicode | The type of event reported here - whether the participants lived or died. | Mortality |

[0].Data.Population | int | The number of people present in this report. | 2293259 |