injuries icon

Injuries Python Library

From the CORGIS Dataset Project

By Austin Cory Bart acbart@vt.edu
Version 2.0.0, created 3/22/2016
Tags: osha, united states, us, usa, government, health, safety, injury, injuries, hurt, pain, job, work, industry, career, occuptation, safety, hazard, health

Overview

The Occupational Safety and Health Administration (OSHA) collected work-related injury and illness data from employers within specific industry and employment size specifications from 2002 through 2011. This data collection is called the OSHA Data Initiative or ODI. The data provided is used by OSHA to calculate establishment specific injury and illness incidence rates. This searchable database contains a table with the name, address, industry, and associated Total Case Rate (TCR), Days Away, Restricted, and Transfer (DART) case rate, and the Days Away From Work (DAFWII) case rate for the establishments that provided OSHA with valid data for calendar years 2002 through 2011. This data has been sampled down from its original size to 4%. In addition, the original dataset only has data from a small portion of all private sector establishments in the United States (80,000 out of 7.5 million total establishments). Therefore, these data are not representative of all businesses and general conclusions pertaining to all US business should not be overdrawn. Data quality: While OSHA takes multiple steps to ensure the data collected is accurate, problems and errors invariably exist for a small percentage of establishments. OSHA does not believe the data for the establishments with the highest rates on this file are accurate in absolute terms. Efforts were made during the collection cycle to correct submission errors, however some remain unresolved. It would be a mistake to say establishments with the highest rates on this file are the ‘most dangerous’ or ‘worst’ establishments in the Nation. Rate Calculation: An incidence rate of injuries and illnesses is computed from the following formula: (Number of injuries and illnesses X 200,000) / Employee hours worked = Incidence rate. The Total Case Rate includes all cases recorded on the OSHA Form 300 (Column G + Column H + Column I + Column J). The Days Away/Restriced/Transfer includes cases recorded in Column H + Column I. The Days Away includes cases recorded in Column H. For further information on injury and illness incidence rates, please visit the Bureau of Labor Statistics’ webpage at http://www.bls.gov/iif/osheval.htm State Participation: Not all state plan states participate in the ODI. The following states did not participate in the 2010 ODI (collection of CY 2009 data), establishment data is not available for these states: Alaska; Oregon; Puerto Rico; South Carolina; Washington; Wyoming.

https://www.osha.gov/pls/odi/establishment_search.html

Explore Structure

Each row represents $MISSING_FIELD.

Index Type Example Value
0 dict { }
1 dict (same structure)
2 dict (same structure)
... ... ...
Key Type Example Value Description
"year" int 2002 $MISSING_FIELD
"address" dict { }
"business" dict { }
"industry" dict { }
"statistics" dict { }
Key Type Example Value Description
"city" str "Cherry Hill" $MISSING_FIELD
"state" str "NJ" $MISSING_FIELD
"street" str "100 Dobbs Ln Ste 102" $MISSING_FIELD
"zip" int 8034 $MISSING_FIELD
Key Type Example Value Description
"name" str "United States Cold Storage" $MISSING_FIELD
"second name" str "US Cold" $MISSING_FIELD
Key Type Example Value Description
"division" str "Transportation, Communications, Electric, Gas, And Sanitary Services" $MISSING_FIELD
"id" int 4222 $MISSING_FIELD
"label" str "Refrigerated Warehousing and Storage" $MISSING_FIELD
"major_group" str "Motor Freight Transportation And Warehousing" $MISSING_FIELD
Key Type Example Value Description
"days away" float 0.0 $MISSING_FIELD
"days away/restricted/transfer" float 0.0 $MISSING_FIELD
"total case rate" float 0.0 $MISSING_FIELD

Downloads

Download all of the following files.

  1. injuries.py
  2. injuries.data

Usage

import injuries
report = injuries.get_report()

Documentation

get_report()
Returns a list of dictionaries representing report.