CramX Logo
Analysis of Infection Risk in Hospitals: A Multiple Regression Approach Using Stepwise and Subset Regression Methods - Document preview page 1

Analysis of Infection Risk in Hospitals: A Multiple Regression Approach Using Stepwise and Subset Regression Methods - Page 1

Document preview content for Analysis of Infection Risk in Hospitals: A Multiple Regression Approach Using Stepwise and Subset Regression Methods

Analysis of Infection Risk in Hospitals: A Multiple Regression Approach Using Stepwise and Subset Regression Methods

An analysis of infection risk in hospitals using multiple regression methods for risk prediction.

Zoey Taylor
Contributor
4.7
0
12 months ago
Preview (4 of 12 Pages)
100%
Log in to unlock
Page 1 of 4
Analysis of Infection Risk in Hospitals: A Multiple Regression Approach Using Stepwise and Subset Regression Methods - Page 1 preview imageAnalysis of Infection Risk in Hospitals: A Multiple Regression ApproachUsing Stepwise and Subset Regression MethodsFor this assignment use the data setsenic.xlsx.This data set consists of a random sample of 113hospitals. The objective is to study the infection risk and what factors influence it.The variables from thedata set are:Variable NameDescriptionIdentification number1-113Length of stayAverage length of stay in hospital (in days)AgeAverage age of patients (in years)Infection riskAverage estimated probability of acquiring infection in hospital (inpercent)Routing culturing ratioRatio of number of cultures performed tonumber of patientswithout signs or symptoms of pneumonia, times 100Routine chest X-rayratioRatio of number of X-rays performed to number of patients withoutsigns or symptoms of pneumonia, times 100Number of bedsAverage number of beds in hospitalMedical schoolaffiliation0 = Yes, 1 = NoAverage daily censusAverage number of patients in hospital per dayNumber of nursesAverage number of full-time licensed practical nursesAvailable facilities andservicesPercent of 35 potentialfacilities and services that are provided bythe hospitalThe goal is to fit the best multiple regression model to the response (infection risk).Do an analysis usingthefirst 108 observations.Use the stepwise regression method to see which model is the best. Repeat using subset regression. Dothey agree?Are there any outliers in the data? Look forx-outliers,y-outliers, and high-influence points.Come up with one model that you think best describes the data and can be used for future predictions.Showthe residual plot for this one.Does the model seem appropriate?Use this model to predict (using prediction interval)yfor the last 5 observations of the data and see ifthe model is doing well.
Page 2 of 4
Analysis of Infection Risk in Hospitals: A Multiple Regression Approach Using Stepwise and Subset Regression Methods - Page 2 preview image
Page 3 of 4
Analysis of Infection Risk in Hospitals: A Multiple Regression Approach Using Stepwise and Subset Regression Methods - Page 3 preview imageSolution:First we tried to find whether there are any outliers in the data considering allvariables and first 108observations.There are few outliers as shown by the below tables with classification of outliers:Also the same can be seen from the below boxplots:stayageculturingxraybedsschoolcensusnursesfacilitiesriskcount108108108108108108108108108108empirical rulemean - 1s7.813548.6605.43661.95064.180.5144.8734.9228.0452.981mean + 1s11.333057.62725.569100.732432.731.21329.57307.6257.9275.670percent in interval (68.26%)75.9%75.9%78.7%64.8%79.6%86.1%75.9%79.6%72.2%69.4%mean - 2s6.053844.176-4.63042.558-120.100.17-97.48-101.4313.1041.637mean + 2s13.092762.11135.636120.123617.011.56471.93443.9772.8687.015percent in interval (95.44%)97.2%93.5%95.4%95.4%94.4%86.1%95.4%94.4%95.4%93.5%mean - 3s4.294039.692-14.69723.167-304.38-0.18-239.84-237.78-1.8370.292mean + 3s14.852566.59545.702139.514801.281.90614.28580.3187.8098.359percent in interval (99.73%)99.1%99.1%97.2%100.0%98.1%100.0%100.0%98.1%100.0% 100.0%low extremes0000000000low outliers0200000003high outliers2341605603high extremes1010000100579111315171921stayBoxPlot30354045505560657075ageBoxPlot
Page 4 of 4
Analysis of Infection Risk in Hospitals: A Multiple Regression Approach Using Stepwise and Subset Regression Methods - Page 4 preview image010203040506070culturingBoxPlot020406080100120140160180xrayBoxPlot02004006008001000bedsBoxPlot00.20.40.60.811.2schoolBoxPlot
Preview Mode

This document has 12 pages. Sign in to access the full document!