ebook img

Statistical Methods for the Analysis of Repeated Measurements PDF

433 Pages·2003·1.804 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Statistical Methods for the Analysis of Repeated Measurements

Springer Texts in Statistics Advisors: George CasellaMStephen FienbergMIngram Olkin Springer New York Berlin Heidelberg Barcelona Hong Kong London Milan Paris Singapore Tokyo Alfred: Elements of Statistics for the Life and Social Sciences Berger: An Introduction to Probability and Stochastic Processes Bilodeau and Brenner: Theory of Multivariate Statistics Blom: Probability and Statistics: Theory and Applications Brockwell and Davis: An Introduction to Times Series and Forecasting Chow and Teicher: Probability Theory: Independence, Interchangeability, Martingales, Third Edition Christensen: Log-Linear Models and Logistic Regression, Second Edition Christensen: Plane Answers to Complex Questions: The Theory of Linear Models, Second Edition Christensen: Advanced Linear Modeling: Multivariate, Time Series, and Spatial Data(cid:151)Nonparametric Regression and Response Surface Maximization, Second Edition Creighton: A First Course in Probability Models and Statistical Inference Davis: Statistical Methods for the Analysis of Repeated Measurements Dean and Voss: Design and Analysis of Experiments du Toil, Steyn, and Stumpf: Graphical Exploratory Data Analysis Durrett: Essentials of Stochastic Processes Edwards: Introduction to Graphical Modelling, Second Edition Finkelstein and Levin: Statistics for Lawyers Flury: A First Course in Multivariate Statistics Jobson: Applied Multivariate Data Analysis, Volume I: Regression and Experimental Design Jobson: Applied Multivariate Data Analysis, Volume II: Categorical and Multivariate Methods Kalbfleisch: Probability and Statistical Inference, Volume I: Probability, Second Edition Kalbfleisch: Probability and Statistical Inference, Volume II: Statistical Inference, Second Edition Karr: Probability Keyfitz: Applied Mathematical Demography, Second Edition Kiefer: Introduction to Statistical Inference Kokoska and Nevison: Statistical Tables and Formulae Kulkarni: Modeling, Analysis, Design, and Control of Stochastic Systems Lehmann: Elements of Large-Sample Theory Lehmann: Testing Statistical Hypotheses, Second Edition Lehmann and Casella: Theory of Point Estimation, Second Edition Lindman: Analysis of Variance in Experimental Design Lindsey: Applying Generalized Linear Models Madansky: Prescriptions for Working Statisticians (continued after index) Charles S. Davis Statistical Methods for the Analysis of Repeated Measurements With 20 Illustrations 123 Charles S. Davis Senior Director, Clinical Operations and Biostatistics Elan Pharmaceuticals 7475 Lusk Boulevard San Diego, CA 92121 USA [email protected] Editorial Board George Casella Stephen Fienberg Ingram Olkin Department of Biometrics Department of Statistics Department of Statistics Cornell University Carnegie Mellon University Stanford University Ithica, NY14853-7801 Pittsburgh, PA15213-3890 Stanford, CA94305 USA USA USA Library of Congress Cataloging-in-Publication Data Davis, Charles S. (Charles Shaw), 1952– Statistical methods for the analysis of repeated measurements/Charles S. Davis. p. cm. — (Springer texts in statistics) Includes bibliographical references and index. ISBN 0-387-95370-1 (alk. paper) 1. Multivariate analysis. 2. Experimental design. I. Title.MIII. Series. QA278.D343M2002 519.5'35—dc21 2001054913 Printed on acid-free paper. © 2002 Springer-Verlag New York, Inc. All rights reserved. This work may not be translated or copied in whole or in part without the written permission of the publisher (Springer-Verlag New York, Inc., 175 Fifth Avenue, New York, NY 10010, USA), except for brief excerpts in connection with reviews or scholarly analysis. Use in connection with any form of information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or here- after developed is forbidden. The use of general descriptive names, trade names, trademarks, etc., in this publication, even if the former are not especially identified, is not to be taken as a sign that such names, as understood by the Trade Marks and Merchandise Marks Act, may accordingly be used freely by anyone. Production managed by Frank McGuckin; manufacturing supervised by Jerome Basma. Camera-ready copy prepared from the author’s LaTeX2e files using Springer’s svsing2e.sty file. Printed and bound by Edwards Brothers, Inc., Ann Arbor, MI. Printed in the United States of America. 9n8n7n6n5n4n3n2n1 ISBN 0-387-95370-1 SPINM10853625 Springer-VerlagnnNew YorknBerlinnHeidelberg A member of BertelsmannSpringer Science+BusinessMedia GmbH Preface I have endeavored to provide a comprehensive introduction to a wide va- riety of statistical methods for the analysis of repeated measurements. I envision this book primarily as a textbook, because the notes on which it isbasedhavebeenusedinasemester-lengthgraduatecourseIhavetaught since1991.Thiscourseisprimarilytakenbygraduatestudentsinbiostatis- tics and statistics, although students and faculty from other departments haveauditedthecourse.Ialsoanticipatethatthebookwillbeausefulref- erence for practicing statisticians. This assessment is based on the positive responses I have received to numerous short courses I have taught on this topic to academic and industry groups. Althoughmyintentistoprovideareasonablycomprehensiveoverviewof methodsfortheanalysisofrepeatedmeasurements,Idonotviewthisbook asadefinitive“stateoftheart”compendiumofresearchinthisarea.Some general approaches are extremely active areas of current research, and it is not feasible, given the goals of this book, to include a comprehensive summary and list of references. Instead, my focus is primarily on methods thatareimplementedinstandardstatisticalsoftwarepackages.Asaresult, thelevelofdetailonsometopicsislessthaninotherbooks,andsomemore recent methods of analysis are not included. One particular example is the topicofnonlinearmixedmodelsfortheanalysisofrepeatedmeasurements (Davidian and Giltinan, 1995; Vonesh and Chinchilli, 1996). With respect to some of the more recent methods of analysis, I do attempt to mention some of the areas of current research. The prerequisites for a course based on this book include knowledge of mathematical statistics at the level of Hogg and Craig (1995) and a course vi Preface in linear regression and ANOVA at the level of Neter et al. (1985). Indi- viduals without these prerequisites who have audited the graduate course or attended short courses have also been able to benefit from much of the material. Becauseawidevarietyofmethodsarecovered,knowledgeoftopicssuch as multivariate normal distribution theory, categorical data analysis, and generalized linear models would also be useful. However, my philosophy is not to assume any particular knowledge of these areas and to present the necessary background material in the book. WhenIbegantodevelopmygraduatecourseontheanalysisofrepeated measurements, no suitable text was available for the course as I envisioned it, and I made the decision to prepare my own notes. Since then, multi- ple books on the analysis of repeated measurements have been published. I regularly refer to the following books (listed chronologically): Hand and Taylor (1987), Crowder and Hand (1990) [updated as Hand and Crowder (1996)], Diggle (1990), Jones (1993), Diggle et al. (1994), Kshirsagar and Smith (1995), Vonesh and Chinchilli (1996), and Lindsey (1999), among others. Although some of the existing books are reasonably comprehensive in their coverage, others are more narrowly focused on specialized topics. This book is more comprehensive than many and is targeted at a lower mathematical level and focused more on applications than most. In sum- mary, it is more oriented toward statistical practitioners than to statistical researchers. Two obvious distinctions of this book are the extensive use of real data sets and the inclusion of numerous homework problems. Eighty real data sets are used in the examples and homework problems. These data sets areavailablefromthewebsitewww.springer-ny.com(clickon“authorweb- sites”).Becausemanyofthedatasetscanbeusedtodemonstratemultiple methods of analysis, instructors can easily develop additional homework problems and exam questions based on the data sets provided. The inclusion of homework problems makes this book especially well- suited as a course text. Approximately 85% of the homework problems involve data analysis. The focus of these problems is not on providing a definitiveanalysisofthedatabutratheronprovidingthereaderwithexpe- rience in knowing when, and learning how, to select and apply appropriate methods of analysis. Although many of the examples and homework prob- lems have a biomedical focus, the principles and methods apply to other subject areas as well. My graduate course and short course notes include numerous examples oftheuseof,andoutputfrom,statisticalsoftwarepackages,primarilySAS (SASInstitute,1999).Ihavepurposelychosennottoincludeprogramming statements or computer output in the book. I do provide the raw data for nearly all examples as well as the key results of all analyses. In this way, readerswillbeabletocarryoutandverifytheresultsoftheirownanalyses using their choice of software. Preface vii Thenotesonwhichthisbookisbasedareintheformofoverheadtrans- parencies produced using TEX (Knuth, 1986). This format is well-suited forinstructors.Thecoursenotesalsoincludeprogrammingstatementsand computer output for the examples, prepared primarily using SAS. Course instructors interested in obtaining this supplemental material, as well as solutions to homework problems, should contact Springer-Verlag. I would like to thank John Kimmel of Springer-Verlag for initially en- couraging me to write this book and for his support and advice during its preparation. I am also grateful to the graduate students who have par- ticipated in my course since 1991 and to the attendees at external short courses; both groups have motivated me to develop and expand the notes onwhichthisbookisbased.IalsothankMichelleLarsonforherassistance inthepreparationofsolutionstothehomeworkproblemsandKathyClark for her careful review of the manuscript. Finally, I thank my wife, Ruth, andourchildren,Michael,Carrie,andNathan,fortheirunderstandingand support during this endeavor. San Diego, California Charles S. Davis November 2001 Contents Preface v List of Tables xv List of Figures xxiii 1 Introduction 1 1.1 Repeated Measurements . . . . . . . . . . . . . . . . . . . . 1 1.2 Advantages and Disadvantages of Repeated Measurements Designs . . . . . . . . . . . . . . . . . . . . . 2 1.3 Notation for Repeated Measurements. . . . . . . . . . . . . 3 1.4 Missing Data . . . . . . . . . . . . . . . . . . . . . . . . . . 4 1.5 Sample Size Estimation . . . . . . . . . . . . . . . . . . . . 8 1.6 Outline of Topics . . . . . . . . . . . . . . . . . . . . . . . . 9 1.7 Choosing the “Best” Method of Analysis . . . . . . . . . . . 12 2 Univariate Methods 15 2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 2.2 One Sample . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 2.3 Multiple Samples . . . . . . . . . . . . . . . . . . . . . . . . 21 2.4 Comments . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 2.5 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28 x Contents 3 Normal-Theory Methods: Unstructured Multivariate Approach 45 3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 45 3.2 Multivariate Normal Distribution Theory . . . . . . . . . . 46 3.2.1 The Multivariate Normal Distribution . . . . . . . . 46 3.2.2 The Wishart Distribution . . . . . . . . . . . . . . . 46 3.2.3 Wishart Matrices . . . . . . . . . . . . . . . . . . . . 47 3.2.4 Hotelling’s T2 Statistic. . . . . . . . . . . . . . . . . 47 3.2.5 Hypothesis Tests . . . . . . . . . . . . . . . . . . . . 48 3.3 One-Sample Repeated Measurements . . . . . . . . . . . . . 49 3.3.1 Methodology . . . . . . . . . . . . . . . . . . . . . . 49 3.3.2 Examples . . . . . . . . . . . . . . . . . . . . . . . . 50 3.3.3 Comments. . . . . . . . . . . . . . . . . . . . . . . . 54 3.4 Two-Sample Repeated Measurements. . . . . . . . . . . . . 55 3.4.1 Methodology . . . . . . . . . . . . . . . . . . . . . . 55 3.4.2 Example . . . . . . . . . . . . . . . . . . . . . . . . . 57 3.4.3 Comments. . . . . . . . . . . . . . . . . . . . . . . . 60 3.5 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61 4 Normal-Theory Methods: Multivariate Analysis of Variance 73 4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 73 4.2 The Multivariate General Linear Model . . . . . . . . . . . 74 4.2.1 Notation and Assumptions . . . . . . . . . . . . . . 74 4.2.2 Parameter Estimation . . . . . . . . . . . . . . . . . 75 4.2.3 Hypothesis Testing . . . . . . . . . . . . . . . . . . . 76 4.2.4 Comparisons of Test Statistics . . . . . . . . . . . . 77 4.3 Profile Analysis . . . . . . . . . . . . . . . . . . . . . . . . . 78 4.3.1 Methodology . . . . . . . . . . . . . . . . . . . . . . 78 4.3.2 Example . . . . . . . . . . . . . . . . . . . . . . . . . 81 4.4 Growth Curve Analysis . . . . . . . . . . . . . . . . . . . . 83 4.4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . 83 4.4.2 The Growth Curve Model . . . . . . . . . . . . . . . 83 4.4.3 Examples . . . . . . . . . . . . . . . . . . . . . . . . 87 4.5 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94 5 Normal-Theory Methods: Repeated Measures ANOVA 103 5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 103 5.2 The Fundamental Model . . . . . . . . . . . . . . . . . . . . 104 5.3 One Sample . . . . . . . . . . . . . . . . . . . . . . . . . . . 106 5.3.1 Repeated Measures ANOVA Model . . . . . . . . . . 106 5.3.2 Sphericity Condition . . . . . . . . . . . . . . . . . . 109 5.3.3 Example . . . . . . . . . . . . . . . . . . . . . . . . . 111 5.4 Multiple Samples . . . . . . . . . . . . . . . . . . . . . . . . 112 5.4.1 Repeated Measures ANOVA Model . . . . . . . . . . 112 Contents xi 5.4.2 Example . . . . . . . . . . . . . . . . . . . . . . . . . 115 5.5 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116 6 Normal-Theory Methods: Linear Mixed Models 125 6.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 125 6.2 The Linear Mixed Model. . . . . . . . . . . . . . . . . . . . 126 6.2.1 The Usual Linear Model . . . . . . . . . . . . . . . . 126 6.2.2 The Mixed Model . . . . . . . . . . . . . . . . . . . 126 6.2.3 Parameter Estimation . . . . . . . . . . . . . . . . . 127 6.2.4 Background on REML Estimation . . . . . . . . . . 128 6.3 Application to Repeated Measurements . . . . . . . . . . . 130 6.4 Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . 134 6.4.1 Two Groups, Four Time Points, No Missing Data. . 134 6.4.2 Three Groups, 24 Time Points, No Missing Data . . 139 6.4.3 Four Groups, Unequally Spaced Repeated Measurements, Time-Dependent Covariate . . . . . 145 6.5 Comments . . . . . . . . . . . . . . . . . . . . . . . . . . . . 149 6.5.1 Use of the Random Intercept and Slope Model . . . 149 6.5.2 Effects of Choice of Covariance Structure on Estimates and Tests . . . . . . . . . . . . . . . . . . 151 6.5.3 Performance of Linear Mixed Model Test Statistics and Estimators . . . . . . . . . . . . . . . . . . . . . 155 6.6 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . 156 7 Weighted Least Squares Analysis of Repeated Categorical Outcomes 169 7.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 169 7.2 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . 170 7.2.1 The Multinomial Distribution . . . . . . . . . . . . . 170 7.2.2 Linear Models Using Weighted Least Squares . . . . 171 7.2.3 Analysis of Categorical Data Using Weighted Least Squares . . . . . . . . . . . . . . . . . . . . . . 175 7.2.4 TaylorSeriesVarianceApproximationsforNonlinear Response Functions . . . . . . . . . . . . . . . . . . 178 7.3 Application to Repeated Measurements . . . . . . . . . . . 184 7.3.1 Overview . . . . . . . . . . . . . . . . . . . . . . . . 184 7.3.2 One Population, Dichotomous Response, Repeated Measurements Factor Is Unordered . . . . . . . . . . 184 7.3.3 One Population, Dichotomous Response, Repeated Measurements Factor Is Ordered . . . . . . . . . . . 187 7.3.4 One Population, Polytomous Response . . . . . . . . 191 7.3.5 Multiple Populations, Dichotomous Response . . . . 196 7.4 Accommodation of Missing Data . . . . . . . . . . . . . . . 204 7.4.1 Overview . . . . . . . . . . . . . . . . . . . . . . . . 204 7.4.2 Ratio Estimation for Proportions . . . . . . . . . . . 204

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.