Document Jv87g6KmXK5oEBko5pwE0mBr

39 ff UNITED STATES ENVIRONMENTAL PROTECTION AGENCY WASHINGTON, D.C. 20460 1133 OFFICE OF PREVENTION, PESTICIDES AND TOXIC SUBSTANCES 21 October 2002 MEMORANDUM: SUBJECT: Focussed statistical analyses of APFO data on >and preweaning loss g?* FROM: Elizabeth H. Margosches, Ph.D., Statistician Existing Chemicals Assessment Branch J . Risk Assessment Division (7403M) / CD CD -rn TO " O O cn *~4fn TO: Katherine Anitole, Ph.D., Toxicologist Existing Chemicals Assessment Branch Risk Assessment Division (7403M) *TJ CD fK> cn no On 30 August 2002 several members of industry met with EPA staff to discuss results seen in the study by York (2002) and some other studies that might have bearing on them. It was agreed that certain additional analyses would be carried out. York (2002) reported there were no statistically significant differences among the treatment groups in numbers of dams delivering litters, the duration of gestation, averages for implantation sites per delivered litter, gestation index, numbers of dams with stillborn pups, dams with all pups dying, liveborn and stillborn pups, viability index, pup sex ratios and body weights for either the FO or FI parental generations and their litters. According to p. 45 of the report, these analyses were carried out using analysis of variance or, if the group variances were not comparable, the Kmskal-Wallis test (nonparametric), or, for count data, a chi-square for homogeneity. The additional analyses related to an index of prenatal mortality (taken as (implantations minus liveborn)/implantations) and one of preweaning mortality (taken as (liveborn minus those alive on day 22)/liveborn). These quantities are proportions within each litter and the variance of a sample proportion is related to the true proportion. When the denominators are nearly equal, the analyses can be carried out using data transformed in a standard fashion, using arcsin(.)l/2(see Gaylor, 1978). Analysis of variance of the numbers of implantations, and those numbers properly transformed for being counts ((i)'/2+(i+l)14) assured that this condition was pretty well fulfilled. To illustrate the effect of the transformation of the proportions, plots and analyses are provided both with and without the transformation, although the untransformed statistics should not be used. All analyses were implemented using the computer program Statistix (Statistix 7, Analytical Software, 2000). In light of the other tests already carried out by York (2002), the high dose prenatal mortality index was statistically compared only to the control. Plots of the index for control and four positive doses (p. 1 of the 36 analysis pages) showed that, in both F0 (red squares) and FI (blue circles), the treatments did not appear to differ. The comparison was carried out using Student's t-test. No difference was seen between high dose and control (p. 3). The preweaning mortality index was examined across all dose groups, within generation. Here again a plot of the index across all treatments (p. 12)suggested no differences. This comparison was carried out using Analysis of Variance (p. 20 for P or F0, p. 26 for FI), followed by a Scheffe comparison of the individual treatment means (p. 21 and p. 27, respectively). Using the Scheffe basis permitted all contrasts of means to be considered in follow-up comparisons while controlling for an overall error rate. While the order o f treatment-group means appeared to differ between F0 and F I, in neither case were there any significant pairwise differences between the dose groups and the responses did not have any trend in dose. Gaylor, D.W. 1978. Methods and Concepts of Biometrics Applied to Teratology. In: J.G. Wilson and F.C. Fraser, ed., Handbook of Teratology. Voi. 4. New York: Plenum Press, pp. 429-444. Statistix Version 7.0 [computer software]. Analytical Software. 1985, 2000. cc: J. Seed TRPREB(arcsin(sqrt( 1-live/implant)) Data (proportions lost prior to birth) transformed to stabilize variances Scatter Plot of TRPREB vs DOSE oo o e 8 1! 81 i oa 0 o 6 O 1s a 12 DOSE 18 24 O GEN O F1 8 P 8 O I a 30 Scatter Plot of PREBIRTH vs DOSE NI3 I GEN O F1 P DOSE STATISTIX 7 .0 TWO-SAMPLE T TESTS FOR TRPREB BY DOSE CONT-HIGHONLY, 10/15/02, 4:03:39 PM P DOSE MEAN SAMPLE SIZE S. D. S.E. 0 30 DIFFERENCE 0.2020 0.2825 -0.0805 28 29 0.1941 0.2945 0.0367 0.0547 NULL HYPOTHESIS: DIFFERENCE = 0 ALTERNATIVE HYP: DIFFERENCE <> 0 ASSUMPTION T DF P 951 Cl FOR DIFFERENCE EQUAL VARIANCES UNEQUAL VARIANCES -1.21 -1.22 55 0.2299 48.6 0.2273 (-0.2134, 0.0524) (-0.2129, 0.0518) TESTS FOR EQUALITY OF VARIANCES F 2.30 NUM DF 28 DEN DF 27 P 0.0166 CASES INCLUDED 57 MISSING CASES 3 TRPREB = transformed (loss prior to birth) In parents (p. 3) and FI (p. 8), control and high dose groups not significantly different under assumption of unequal variances (or equal ones). The test for variance equality is sensitive to normality assumptions, and these data are not normally distributed. The indication the variances are unequal may not be correct for P (see the box and whiskers plot p. 4) although it's likely for FI (see p. 9) -3- TRPREB Box and Whisker Plot P 0 30 DOSE 57 cases 3 missing cases -4- The Box and Whisker Plot procedure computes box plots that graphically present measurements o f central tendency and variability. A series o f box plots can be displayed side by side, which can dramatically illustrate differences between groups. Each box plot is composed o f a box and two whiskers. The box encloses the middle half of the data. The box is bisected by a line at the value for the median. The vertical lines at the top and the bottom o f the box are called the whiskers, and they indicate the range o f "typical" data values. Whiskers always end at the value o f an actual data point and can't be longer than 114 times the size of the box. Extreme values are displayed as for possible outliers and "O" for probable outliers. Possible outliers are values that are outside the box boundaries by more than 114 times the size o f the box. Probable outliers are values that are outside the box boundaries by more than 3 times the size of the box. Copyright 2000 Analytical Software -5- STATISTIX 7.0 TWO-SAMPLE T TESTS FOR PREBIRTH BY DOSE 10/15/02, 3:34:53 PM P DOSE MEAN SAMPLE SIZE S.D. S.E. 0 30 DIFFERENCE 0.0727 0.1274 -0.0547 28 29 0.0826 0.1942 0.0156 0.0361 NULL HYPOTHESIS: DIFFERENCE = 0 ALTERNATIVE HYP: DIFFERENCE <> 0 ASSUMPTION T DF P 95% Cl FOR DIFFERENCE EQUAL VARIANCES UNEQUAL VARIANCES -1.38 -1.39 55 0.1747 38.1 0.1719 (-0.1344, 0.0250) (-0.1342, 0.0248) TESTS FOR EQUALITY OF VARIANCES F -- 5.52 NUM DF -- 28 DEN DF -- 27 P -- 0.0000 CASES INCLUDED 57 MISSING CASES 3 PREBIRTH = loss prior to birth -6- PREBIRTH Box and Whisker Plot P 0 30 DOSE 57 cases 3 missing cases -7- STATISTIX 7.0 TWO-SAMPLE T TESTS FOR TRPREB BY DOSE CONT-HIGHONLY, 10/15/02, 4:05:52 PM R DOSE MEAN SAMPLE SIZE S.D. S.E. 0 30 DIFFERENCE 0.2497 0.2521 -2.41E-03 28 29 0.1483 0.2223 0.0281 0.0413 NOLL HYPOTHESIS: DIFFERENCE = 0 ALTERNATIVE HYP: DIFFERENCE <> 0 ASSOMPTION T DF P 95% CI FOR DIFFERENCE EQUAL VARIANCES UNEQUAL VARIANCES -0.05 -0.05 55 0.9619 49.1 0.9616 (-0.1032, 0.0984) (-0.1028, 0.0980} TESTS FOR EQUALITY OF VARIANCES F -- 2.23 NUM DF -- 28 DEN DF -- 27 P -- 0.0201 CASES INCLUDED 57 MISSING CASES 3 -8- TRPREB Box and Whisker Plot & 1. 0 - 0.8 0.6 0.4 0.2 0.0 0 DOSE 57 cases 3 missing cases -9- STATISTIX 7.0 TWO-SAMPLE T TESTS FOR PREBIRTH BY DOSE 10/15/02, 3:27:53 PM Fl DOSE MEAN SAMPLE SIZE S.D. S.E. 0 30 DIFFERENCE 0.0801 0.0998 -0.0197 28 29 0.0600 0.1258 0.0113 0.0234 NULL HYPOTHESIS: DIFFERENCE = 0 ALTERNATIVE HYP: DIFFERENCE <> 0 ASSUMPTION T DF P 95% CI FOR DIFFERENCE EQUAL VARIANCES UNEQUAL VARIANCES -0.75 -0.76 55 0.4563 40.4 0.4524 (-0.0723, 0.0329) (-0.0722, 0.0328) TESTS FOR EQUALITY OF VARIANCES F ------ 4.39 NUM DF -- 28 DEN DF -- 27 P -- 0.0001 CASES INCLUDED 57 MISSING CASES 3 -10- PREBIRTH Box and Whisker Plot Pi 0.8 0.6 - 0.4- 0. 2 - 0.0- 30 DOSE 57 cases 3 missing cases -ll- T R P R E W (arcsin(sqrt(1-d ay22 /live)) Data (proportions of live births lost prior to weaning) transformed to stabilize variances Scatter Plot of TRPREW vs DOSE GEN O F1 P TRPREW 1.6 o 1.2 0.8 0.4 0.0 0 Box and Whisker Plot o o 1 3 10 DOSE 283 cases 16 missing cases 30 p. 12 and pp. 13-15 illustrate two ways of looking at the transformed preweaning loss, to clarify the conclusions of the analysis of variance on pp. 20 and 26. -13- TRPREW Box and Whisker Plot 0 1 3 10 30 DOSE 142 cases Parent(FO) 8 missing cases -14- TRPREW Box and Whisker Plot 0 1 3 10 30 DOSE 141 casesFl8 missing cases -15- Scatter Plot of PREWEAN vs DOSE GEN O F1 P P R E W E A N (1-day22/live) DOSE PREWEAN 1.2 o 0.8 0.4 0.0 0 Box and Whisker Plot o 0 * 0* o * 1 3 10 DOSE 283 cases F0&F1 16 missing cases o o o * 30 -17- PREWEAN Box and Whisker Plot 0 1 3 10 30 DOSE 142 cases Parent(FO) 8 missing cases -18- PREWEAN 0.35 0.28 0.21 0.14 0.07 0.00 Box and Whisker Plot 3 DOSE 141 casesFi8 missing cases 10 30 -19 STATISTIX 7.0 ONE-WAY AOV FOR TRPREW BY DOSE SOURCE DF SS MS F P BETWEEN 4 WITHIN 137 TOTAL 141 0.31524 9.85073 10.1660 0.07881 0.07190 1.10 0.3611 CHI-SQ DF BARTLETT'S TEST OF ------ ---- EQUAL VARIANCES 32.85 4 P 0.0000 COCHRAN'S Q LARGEST VAR / SMALLEST VAR 0.3314 6.0991 COMPONENT OF VARIANCE FOR BETWEEN GROUPS 2.433E-04 EFFECTIVE CELL SIZE 28.4 DOSE MEAN SAMPLE SIZE GROUP STD DEV 0 1 3 10 30 TOTAL 0.1378 0.1532 0.1162 0.0661 0.2089 0.1362 28 27 29 29 29 142 0.3110 0.3150 0.1632 0.1403 0.3465 0.2681 CASES INCLUDED 142 MISSING CASES 8 10/16/02, 3:23:20 PM r? -20- STATISTIX 7.0 .. SCHEFFE COMPARISON OF MEANS OF TRPREW BY DOSE 10/16/02, 3:21:29 PM F DOSE MEAN HOMOGENEOUS GROUPS 30 0.2089 I 1 0.1532 I 0 0.1378 I 3 0.1162 I 10 0.0661 I THERE ARE NO SIGNIFICANT PAIRWISE DIFFERENCES AMONG THE MEANS. CRITICAL F VALUE 2.438 REJECTION LEVEL 0.050 STANDARD ERRORS AND CRITICAL VALUES OF DIFFERENCES VARY BETWEEN COMPARISONS BECAUSE OF UNEQUAL SAMPLE SIZES. -21- Residuals One-Way AOV Residual Plot Using dose alone somewhat underestimates the pattern of preweaning loss. -22- STATISTIX 7.0 ONE-WAY AOV FOR PREWEAN BY DOSE ALLDOSESD22, 10/17/02, 8:51:04 AM r SOURCE DF SS MS F P BETWEEN 4 WITHIN 137 TOTAL 141 0.10272 3.38109 3.48380 0.02568 0.02468 1.04 0.3887 CHI-SQ DF BARTLETT 'S TEST OF ----- -- EQUAL VARIANCES 70.93 4 P -- 0.0000 COCHRAN'S Q LARGEST VAR / SMALLEST VAR 0.3627 15.017 COMPONENT OF VARIANCE FOR BETWEEN GROUPS 3.520E-05 EFFECTIVE CELL SIZE 28.4 DOSE MEAN SAMPLE SIZE GROUP STD DEV 0 1 3 10 30 TOTAL 0.0591 0.0638 0.0372 0.0221 0.1005 0.0564 28 27 29 29 29 142 0.1889 0.1914 0.0641 0.0549 0.2126 0.1571 CASES INCLUDED 142 MISSING CASES 8 -23- STATISTIX 7.0 ALLDOSESD22, 10/17/02, 8:52:43 AM SCHEFFE COMPARISON OF MEANS OF PREWEAN BY DOSE P DOSE MEAN HOMOGENEOUS GROUPS 30 0.1005 I 1 0.0638 I 0 0.0591 I 3 0.0372 I 10 0.0221 I THERE ARE NO SIGNIFICANT PAIRWISE DIFFERENCES AMONG THE MEANS. CRITICAL F VALUE 2.438 REJECTION LEVEL 0.050 STANDARD ERRORS AND CRITICAL VALUES OF DIFFERENCES VARY BETWEEN COMPARISONS BECAUSE OF UNEQUAL SAMPLE SIZES. -24- Residuals One-Way AOV Residual Plot -25- STATISTIX 7.0 ALLDOSESD22 10/17/02, 9:01:05 AM ONE-WAY AOV FOR TRPREW BY DOSE SOURCE DF SS MS F P BETWEEN 4 WITHIN 136 TOTAL 140 0.04543 2.65791 2.70334 0.01136 0.01954 0.58 0.6768 CHI-SQ DF BARTLETT'S TEST OF ----- ---- EQUAL VARIANCES 6.38 4 P 0.1726 COCHRAN'S Q LARGEST VAR / SMALLEST VAR 0.3339 2.3935 COMPONENT OF VARIANCE FOR BETWEEN GROUPS -2.903E-04 EFFECTIVE CELL SIZE 28.2 DOSE MEAN SAMPLE SIZE GROUP STD DEV 0 1 3 10 30 TOTAL 0.0593 0.0710 0.1114 0.0878 0.0721 0.0803 28 28 28 28 29 141 0.1168 0.1299 0.1807 0.1308 0.1323 0.1398 CASES INCLUDED 141 MISSING CASES 8 -26- STATISTIX 7.0 ALLDOSESD22, 10/17/02, 9:02:27 AM SCHEFFE COMPARISON OF MEANS OF TRPREW BY DOSE Fi DOSE MEAN HOMOGENEOUS GROUPS 3 0.1114 I 10 0.0878 I 30 0.0721 I 1 0.0710 I 0 0.0593 I THERE ARE NO SIGNIFICANT PAIRWISE DIFFERENCES AMONG THE MEANS. CRITICAL F VALUE 2.438 REJECTION LEVEL 0.050 STANDARD ERRORS AND CRITICAL VALUES OF DIFFERENCES VARY BETWEEN COMPARISONS BECAUSE OF UNEQUAL SAMPLE SIZES. -27- Residuals One-Way AOV Residual Plot 0.6 0.3 0.0 -0.3 - 0.6 0.058 0.067 0.076 0.085 0.094 Fitted values:arcsin(sqrt(1 -day22/live)) F1 dams; F2 pups 0.103 0.112 -28- STATISTIK 7.0 ONE-WAY AOV FOR PREWEAN BY DOSE ALLDOSESD22, 10/17/02, Fl SOURCE DF SS MS F P BETWEEN 4 WITHIN 136 TOTAL 140 0.01010 0.33704 0.34714 0.00253 0.00248 1.02 0.3999 CHI - DF P BARTLETT'S TEST OF --- EQUAL VARIANCES 32.45 4 0.0000 COCHRAN'S Q LARGEST VAR / SMALLEST VAR 0.5270 6.0289 COMPONENT OF VARIANCE FOR BETWEEN GROUPS 1.665E-06 EFFECTIVE CELL SIZE 28.2 DOSE MEAN SAMPLE SIZE GROUP STD DEV 0 1 3 10 30 TOTAL 0.0162 0.0205 0.0408 0.0236 0.0214 0.0245 28 28 28 28 29 141 0.0329 0.0425 0.0809 0.0358 0.0413 0.0498 CASES INCLUDED 141 MISSING CASES 8 4 AM -29- STATISTIX 7.0 ALLDOSESD22, 10/17/02, 8:57:42 AM SCHEFFE COMPARISON OF MEANS OF PREWEAN BY DOSE Fi DOSE MEAN HOMOGENEOUS GROUPS 3 0.0408 I 10 0.0236 I 30 0.0214 I 1 0.0205 I 0 0.0162 I THERE ARE NO SIGNIFICANT PAIRWISE DIFFERENCES AMONG THE MEANS. CRITICAL F VALUE 2.438 REJECTION LEVEL 0.050 STANDARD ERRORS AND CRITICAL VALUES OF DIFFERENCES VARY BETWEEN COMPARISONS BECAUSE OF UNEQUAL SAMPLE SIZES. -30- Residuals CD CO ii T- q od 0.3 0.2- "P + -0.1 -0.2 -0.3- 1.6 One-Way AOV Residual Plot + + + ++ + + 2ll 2^6 1 Fitted valuesX10E-2:1-day22/live F1 generation dams; F2 pups + + P + ------------ 1-- ^ 4.1 -31- Statistix offers a number o f procedures to test hypotheses about the central values o f the population distributions from which the samples are drawn. These procedures are often referred to as tests o f location. Several of these tests are parametric and require the assumption that the data are normally distributed, Nonparametric tests are provided for situations where the assumption of normality is not appropriate. When their assumptions are appropriate, parametric tests are generally more powerful than their nonparametric equivalents, although nonparametric tests often compare quite well in performance. The parametric versions test hypotheses concerning the group means. The nonparametric procedures test central value hypotheses based on measures other than the mean. Copyright 2000 Analytical Software -32- STATISTIX 7.0 ALLDOSESD22, 10/17/02, 2:28:50 PM SCHEFFE COMPARISON OF MEANS OF TRIMPLANT BY DOSE P DOSE MEAN HOMOGENEOUS GROUPS 10 8.0142 I 1 8.0054 I 30 7.9451 I 0 7.9304 I 3 7.9130 I THERE ARE NO SIGNIFICANT PAIRWISE DIFFERENCES AMONG THE MEANS. CRITICAL F VALUE 2.438 REJECTION LEVEL 0.050 STANDARD ERRORS AND CRITICAL VALUES OF DIFFERENCES VARY BETWEEN COMPARISONS BECAUSE OF UNEQUAL SAMPLE SIZES. pp. 33 and 35 show the numbers of implantations (transformed because they are counts) are essentially the same across treatment groups. -33- STATISTIX 7.0 ALLDOSESD22, 10/17/02, 2:27:40 PM SCHEFFE COMPARISON OF MEANS OF IMPLANT BY DOSE P DOSE MEAN HOMOGENEOUS GROUPS 10 15..621 I 1 15..593 I 30 15..345 I 0 15..286 I 3 15..241 I THERE ARE NO SIGNIFICANT PAIRWISE DIFFERENCES AMONG THE MEANS. CRITICAL F VALUE 2.438 REJECTION LEVEL 0.050 STANDARD ERRORS AND CRITICAL VALUES OF DIFFERENCES VARY BETWEEN COMPARISONS BECAUSE OF UNEQUAL SAMPLE SIZES. -34- STATISTIX 7.0 ALLDOSESD22, 10/17/02, 4:03:39 PM SCHEFFE COMPARISON OF MEANS OF TRIMPLANT BY DOSE R DOSE MEAN HOMOGENEOUS GROUPS 1 8.0345 I 10 7.9177 I 30 7.8713 I 0 7.8501 I 3 7.8387 I THERE ARE NO SIGNIFICANT PAIRWISE DIFFERENCES AMONG THE MEANS. CRITICAL F VALUE 2.404 REJECTION LEVEL 0.050 STANDARD ERRORS AND CRITICAL VALUES OF DIFFERENCES VARY BETWEEN COMPARISONS BECAUSE OF UNEQUAL SAMPLE SIZES. -35- STATISTIX 7.0 ALLDOSESD22, 10/17/02, 4:05:19 PM SCHEFFE COMPARISON OF MEANS OF IMPLANT BY DOSE P' DOSE MEAN HOMOGENEOUS GROUPS 1 15.709 I 10 15.298 I 30 15.086 I 0 15.036 I 3 14.982 I THERE ARE NO SIGNIFICANT PAIRWISE DIFFERENCES AMONG THE MEANS. CRITICAL F VALUE 2.404 REJECTION LEVEL 0.050 STANDARD ERRORS AND CRITICAL VALUES OF DIFFERENCES VARY BETWEEN COMPARISONS BECAUSE OF UNEQUAL SAMPLE SIZES. -36-