Story Name: Smoking and Cancer


Abstract: Government statisticians in England conducted a study of the
relationship between smoking and lung cancer. The data concern 25 occupational
groups and are condensed from data on thousands of individual men. The
explanatory variable is the number of cigarettes smoked per day by men in each
occupation relative to the number smoked by all men of the same age. This smoking
ratio is 100 if men in an occupation are exactly average in their smoking, it is
below 100 if they smoke less than average, and above 100 if they smoke more than
average. The response variable is the standardized mortality ratio for deaths
from lung cancer. It is also measured relative to the entire population of men of
the same ages as those studied, and is greater or less than 100 when there are
more or fewer deaths from lung cancer than would be expected based on the
experience of all English men.

A scatterplot of the data shows a moderately strong linear association, with a
correlation coefficient of 0.716. Residuals from a regression of mortality on
smoking are randomly scattered with no outliers or influential observations.

Reference: Moore, David S., and George P. McCabe (1989). Introduction to the
Practice of Statistics. Original source: Occupational Mortality: The Registrar
General's Decennial Supplement for England and Wales, 1970-1972, Her Majesty's
Stationery Office, London, 1978. Authorization: Description: Data summarizes a
study of men in 25 occupational groups in England. Two indices are presented for
each occupational group. The smoking index is the ratio of the average number of
cigarettes smoked per day by men in the particular occupational group to the
average number of cigarettes smoked per day by all men. The mortality index is
the ratio of the rate of deaths from lung cancer among men in the particular
occupational group to the rate of deaths from lung cancer among all men. Number
of cases: 25 Variable Names:

1.Occupational_Group: Occupational Group
2.Smoking: Smoking index (100 = average)
3.Mortality: Lung cancer mortality index (100 = average)

The Data:

Occupational_Group Smoking Mortality
Farmers, foresters, and fisherman 77 84
Miners and quarrymen 137 116
Gas, coke and chemical makers 117 123
Glass and ceramics makers 94 128
Furnace, forge, foundry, and rolling mill workers 116 155
Electrical and electronics workers 102 101
Engineering and allied trades 111 118
Woodworkers 93 113
Leather workers 88 104
Textile workers 102 88
Clothing workers 91 104
Food, drink, and tobacco workers 104 129
Paper and printing workers 107 86
Makers of other products 112 96
Construction workers 113 144
Painters and decorators 110 139
Drivers of stationary engines, cranes, etc. 125 113
Laborers not included elsewhere 133 146
Transport and communications workers 115 128
Warehousemen, storekeepers, packers, and bottlers 105 115
Clerical workers 87 79
Sales workers 91 85
Service, sport, and recreation workers 100 120
Administrators and managers 76 60
Professionals, technical workers, and artists 66 51