Overview

The Occupational Safety and Health Administration (OSHA) collected work-related injury and illness data from employers within specific industry and employment size specifications from 2002 through 2011. This data collection is called the OSHA Data Initiative or ODI. The data provided is used by OSHA to calculate establishment specific injury and illness incidence rates. This searchable database contains a table with the name, address, industry, and associated Total Case Rate (TCR), Days Away, Restricted, and Transfer (DART) case rate, and the Days Away From Work (DAFWII) case rate for the establishments that provided OSHA with valid data for calendar years 2002 through 2011. This data has been sampled down from its original size to 4%. In addition, the original dataset only has data from a small portion of all private sector establishments in the United States (80,000 out of 7.5 million total establishments). Therefore, these data are not representative of all businesses and general conclusions pertaining to all US business should not be overdrawn. Data quality: While OSHA takes multiple steps to ensure the data collected is accurate, problems and errors invariably exist for a small percentage of establishments. OSHA does not believe the data for the establishments with the highest rates on this file are accurate in absolute terms. Efforts were made during the collection cycle to correct submission errors, however some remain unresolved. It would be a mistake to say establishments with the highest rates on this file are the 'most dangerous' or 'worst' establishments in the Nation. Rate Calculation: An incidence rate of injuries and illnesses is computed from the following formula: (Number of injuries and illnesses X 200,000) / Employee hours worked = Incidence rate. The Total Case Rate includes all cases recorded on the OSHA Form 300 (Column G + Column H + Column I + Column J). The Days Away/Restriced/Transfer includes cases recorded in Column H + Column I. The Days Away includes cases recorded in Column H. For further information on injury and illness incidence rates, please visit the Bureau of Labor Statistics' webpage at http://www.bls.gov/iif/osheval.htm State Participation: Not all state plan states participate in the ODI. The following states did not participate in the 2010 ODI (collection of CY 2009 data), establishment data is not available for these states: Alaska; Oregon; Puerto Rico; South Carolina; Washington; Wyoming.

https://www.osha.gov/pls/odi/establishment_search.html

Explore Structure




Index Type Example Value
0 dict { }
... ... ...
Key Type Example Value Comment
"address" dict { }
"industry" dict { }
"statistics" dict { }
"business" dict { }
"year" int 2002
Key Type Example Value Comment
"name" str "United States Cold Storage"
"second name" str "US Cold"
Key Type Example Value Comment
"city" str "Cherry Hill"
"state" str "NJ"
"street" str "100 Dobbs Ln Ste 102"
"zip" int 8034
Key Type Example Value Comment
"major_group" str "Motor Freight Transportation And Warehousing"
"division" str "Transportation, Communications, Electric, Gas, And Sanitary Services"
"id" int 4222
"label" str "Refrigerated Warehousing and Storage"
Key Type Example Value Comment
"total case rate" float 0.0
"days away" float 0.0
"days away/restricted/transfer" float 0.0

Downloads

Download all of the following files.

Usage

This library has 1 function you can use.
import injuries
list_of_report = injuries.get_reports()
Additionally, some of the functions can return a sample of the Big Data using an extra argument. If you use this sampled Big Data, it may be much faster. When you are sure your code is correct, you can remove the argument to use the full dataset.
import injuries
# These may be slow!
list_of_report = injuries.get_reports(test=True)

Documentation

 injuries.get_reports(test=False)

Returns injury reports for companies in the US.