Overview

From the United States Cancer Statistics as part of the U.S. Center for Disease Control, the following data set focuses on the crude rate for all types of cancer reported for different demograpic groups. Significant groupings include age, gender, race and geographical area.

http://www.cdc.gov/cancer/npcr/uscs/download_data.htm

Downloads

Download all of the following files.

Field Descriptions

JSON Path Type Comment Example Value
[0].Age.Age Adjusted Rate float A number representing the expected cancer rate, adjusted for the age of the participants. An age-adjusted rate is a weighted average of the age-specific rates, where the weights are the proportions of persons in the corresponding age groups of a standard population. The potential confounding effect of age is reduced when comparing age-adjusted rates computed using the same standard population. 165.5
[0].Age.Age Adjusted CI Lower float A number representing the expected lower bound for the cancer rate. It is unlikely that the actual rate is lower than this number. CI means "Confidence Interval". 160.6
[0].Age.Age Adjusted CI Upper float A number representing the expected upper bound for the cancer rate, adjusted for the age of the participants. It is unlikely that the actual rate is higher than this number. CI means "Confidence Interval". 170.5
[0].Age dict {u'Age Adjusted Rate': 165.5, u'Age Adjusted CI Lower': 160.6, u'Age Adjusted CI Upper': 170.5}
[0].Year int The 4-digit year that this report was created for. 1999
[0].Data dict {u'Count': 4366, u'Crude Rate': 190.4, u'Crude CI Upper': 196.1, u'Crude CI Lower': 184.8, u'Sex': u'Female', u'Race': u'All Races', u'Event Type': u'Mortality', u'Population': 2293259}
[0].Area unicode The area of the country (typically the name of the state) for this report. Alabama
[0].Data.Count int The number of incidences of cancer in this particular group. 4366
[0].Data.Crude Rate float The estimated number of people with cancer adjusted by the population. This adjustment makes it easy to compare cancer rates between different locations and over time. 190.4
[0].Data.Crude CI Upper float A number representing the upper bound for the Crude Rate. It is unlikely that the actual rate is higher than this number. 196.1
[0].Data.Crude CI Lower float A number representing the lower bound for the Crude Rate. It is unlikely that the actual rate is lower than this number. 184.8
[0].Data.Sex unicode The gender of people in this particular report. Female
[0].Data.Race unicode The races reported in this particular report. All Races
[0].Data.Event Type unicode The type of event reported here - whether the participants lived or died. Mortality
[0].Data.Population int The number of people present in this report. 2293259