Regression Critiques
Banker, R. J., Field, M. M, Schroeder, R. G., & Sinha, K. K. (1996). Impact of work teams on manufacturing performance: A longitudinal field study. Academy of Management Journal, 39, 867-890.
Setting and research question. They followed the productivity of four production lines in an electromechanical assembly plant over time. They wanted to know if the implementation of teams increased productivity.
DVs. The dependent variables were quality and labor productivity. Quality was measured by a manufacturing defect rate (inspection). There were 84 weeks' data points. Labor productivity was measured by the ratio of units produced to total production hours. Data were available for 21 months.
IVs. The main independent variable was time since the implementation of teams. It was measured in weeks for quality and months for productivity. There were also control variables that were measured and used to help rule out alternative explanations of the results. Variables included time before the implementation, workforce policies, and policies affecting confusion in the factors. Workforce policy variables were overtime, headcount additions and headcount deletions. Confusion variables were product diversity, capacity utilization (both overutilization and underutilizaiton), and engineering change orders. For one of the production lines, an experiment was conducted for engineering purposes during the study (not related to the study purpose). A variable called "adhesive experiment period on the gear train line" was used to identify the duration of this experiment. Because there were four different lines, each line was identified using dummy coding.
Analyses. The authors ran two different models: a fixed-effects model and a seemingly unrelated regressions (SUR) model. The fixed-effects model is a regression model that shows the effects of changes in time for all four lines. All four lines are expected to show identical slopes for time (and the control variables), but are allowed to have different intercepts. In the SUR model, each line is analyzed independently, so that both intercepts and the slope for time (as well as the effects of the control variables) are allowed to vary among lines. The SUR model is less powerful because there are fewer observations (1/4 as many) for the SUR model than the fixed-effects model. However, the SUR model allows a check on the similarity of the lines to one another in terms of the models (the "robustness" of the fixed-effects model).
Diagnostics. The authors report checking for collinearity and completing a residuals analysis. They used data transformations to reduce the time-series autocorrelations. They reported that collinearity was not a problem according to their diagnostics. They also worried about heteroscedasticity. They reported problems of heteroscedasticity and nonlinearity of the residuals, and transformations to correct the problems. They deleted outliers (based on residuals) from the analysis and reran the analyses. Because there were no differences in the interpretations of the results, they only report results based on the full dataset. The data are summarized in a table (Table 1) that shows all the correlations, means and standard deviations. Regression results for both the fixed-effects and SUR models for both quality and quantity are presented in tables.
Critique.
Overall. This is an excellent article in terms of the use of regression. I'd give them an "A." They were very sensitive about modeling issues (the use of various diagnostics, transformations, control variables, and the comparison of fixed-effects and SUR models). The models that they employed were appropriate for the data and research question. The main drawback is that they really had a very small sample for the production data. We do not study time series regression in this class because it is used so infrequently in psychology. I included this article so that you could see what a thorough job looks like and how they wrote it up.
Keller, R. T. (1994). Technology-information processing fit and the performance of R&D project groups: A test of contingency theory. Academy of Management Journal, 37, 167-179.
Setting and research question. Research and development (R&D) project teams from four industrial organizations were surveyed (total N=683 people). The main hypothesis was that fit between the teams' information processing (communication process) and task characteristics would predict team performance. Team with a good match between the type of task and the amount of information processing were expected to perform better than teams with a lesser match.
IVs. A ten-item scale was used to measure perceptions of the task. Five items measured Nonroutinesss of the task; the other five measured Unanalyzability of the task. A survey was also used to measure the Information Processing of each team. Fit was defined by taking the absolute value of the difference between the score on information processing and each of the other two scales. First individual scores were averaged to create a team score for each scale. Then the team scores were standardized to have a mean of zero and a standard deviation of 1.0 across teams. Finally the absolute value of the difference between the information processing variable and each of the task variables was computed, yielding a total of five IVs.
Dvs. Project group performance was measured by managerial ratings of five criteria. Each of the criteria ( technical quality, budget and cost performance, schedule adherence, value to the company, overall performance) were rated using a five-point scale. Ratings were taken at two different times, approximately one year apart. A panel of managers was used to make each of the ratings. A factor analysis of the ratings cause the researchers to lump the Dvs into two new variables, project quality and budget-schedule performance. Thus, there were four dependent variables, two at each time.
Analyses and Results. The correlations and descriptive statistics for all variables were reported (Table 1). The authors note that the maximum VIF was 3, and that therefore collinearity did not appear to be a problem. The results include standardized regression weights (betas) for the full model using simultaneous regression. The R-square value from some hierarchical regressions are also reported to determine the unique variance of some variables. The authors report that the fit between nonroutineness and information processing was a good predictor of project quality, but not budget-schedule performance. The fit of unanalyzability and information processing did not predict either performance variable. Neither did the task variables by themselves. The authors concluded that the hypothesis that the fit of nonroutineness and information processing predicted performance was supported but that the hypothesis that the fit between unanalyzability and performance was not supported.
Critique.
Overall, this is okay but not outstanding. I'd give it a "B." The sample size is marginal, I'm not convinced that the measurement of fit is the best way to go, and they didn't look at the fit of the model to the data (residuals: linearity, homoscedasticity, outliers).
Kinsella, G., Ong, B., Murtah, D., Prior, M. & Sawyer, M. (1999). The role of the family for behavioral outcome in children and adolescents following traumatic brain injury. Journal of Consulting and Clinical Psychology, 67, 116-123.
Setting and research question. Children and adolescents with traumatic brain injury (TBI) were assessed at three months, one year and two years post injury for behavioral problems. The research question was whether family characteristics predicted the severity of behavioral problems following TBI.
IVs. Parents completed a standardized behavioral problem checklist that indicated behavioral problems in the child or adolescent prior to the TBI. This was done prior to the assessment of the severity of the current injury to the child. The severity of the injury was coded into either mild, moderate, or severe using medical convention. They coded family status variables including the age of the child, the socio-economic status (SES) of the family, and whether the primary caretaker lived alone or with a partner. They coded family environment variables for well being of the primary care parent and overall pathology of the family (obtained by self-report surveys).
Dvs. Parents completed the same behavior problem checklist that they used for pre-injury status at 3 months, one year, and two years after the injury.
Analyses and Results. The correlations and descriptive statistics for all variables were not reported. The authors note that the sample size is small and report problems with missing data. Regarding missing data, they reported that "pairwise exclusion of missing data was used in the regression analyses." They concluded that "Multicollinearity among predictors was ruled out, given that there were no significant correlations between any two predictor variables within the same regression model." The hierarchical regression analyses proceeded in blocks. The authors' explanation for their procedure was that they were worried about adjusting the significance level for multiple predictors but also about the small sample size. The analyses are presented in three parts of the same table, one analysis for each time (3 months, one year, two years). The authors describe the sequence of steps in each block. For example, they note that "In step 2, prediction of residual behavior problems scores by severity was not significant..." The authors note that the significance of the predictors changes over time and discuss the findings as showing that the relations between problem behavior and the independent variables change over time following the injury.
Critique.
Overall, this is bad. I'd give it a "C - or D." The sample size is too small, although just as in the teams/ groups literature, it's hard to get large numbers in this type of research. However, the reporting of the results is just inadequate. Further, they do not really know what they are doing from a statistical standpoint, so the analyses and results do not really support the conclusions that the authors want to make.