Data analysis of agroforestry experiments 22 by Lecture notes Richard Coe Roger Stern Eleanore Allan edited by Jan Beniest Janet Awimbo The World Agroforestry Centre (ICRAF) is the international leader in Agroforestry – the science and practice of integrating ‘working trees’ on smallholder farms and in rural landscapes. Agroforestry is an effective and innovative means to reduce poverty, create food security, and improve the environment. The Centre and its many partners provide improved, high quality tree seeds and seedlings, and the knowledge needed to use them effectively. We combine excellence in scientific research and development to address poverty, hunger and environmental needs through collaborative programs and partnerships that transform lives and landscapes, both locally and globally. © World Agroforestry Centre 2002 ISBN 92 9059 145 5 ICRAF The World Agroforestry Centre United Nations Avenue PO Box 30677 Nairobi, Kenya Tel: + 254 2 524 000 Fax: + 254 2 524 001 Contact via the USA Tel: + 1 650 833 6645 Fax: + 1 650 833 6646 E-mail: [email protected] Internet: www.worldagroforestrycentre.org Design: Mariska Koornneef Printed by: Kul Graphics Ltd, Nairobi, Kenya Table of contents Session 1. Review of experimental design 5 Session 2. Objectives and steps in data analysis 13 Session 3. Software familiarization 19 Session 4. Descriptive analysis and data exploration 25 Session 5. Analysis of variance as a descriptive tool 43 Session 6. Ideas of simple inference 53 Session 7. An introduction to statistical modelling 63 Session 8. An introduction to multiple levels 81 Session 9. Writing up and presenting results 91 Session 10. Where are we now?- Review of basic statistics 95 Session 11. Design and analysis complexity 97 Session 12. Dealing with categorical data 119 Session 13. Getting more out of on-farm trials and multilevel problems 135 Session 14. Complications in agroforestry trials 153 Session 15. Complications in data 163 Session 16. Data analysis 173 nn oo RReevviiReeewwv iooeffw ee oxxfpp eeexrriipmmeeerinmnttaeanll tddaeel ssdiiggensnign sisi eses 11 SS R. Coe, R. D. Stern, E. Allan Introduction It is necessary to understand both the design of a trial, and some of the ideas behind the design, before attempting to analyse the data. This session provides a brief review of these design ideas. The key element in the design of a trial is the ‘Statement of Trial Objectives’. We will use this session to show that the objectives of the trial determine the treatments to be applied in the e t o experiment, as well as the measurements that are to be taken. Satisfying these objectives will n e r also require an appropriate layout of the plots (or other units) in the experiment. In the remaining u t c sessions we will often breakdown a trial into its component parts of treatments, layout and e L measurements. 55 Later, in Session 2, we will use these overall objectives of the trial to construct the objectives n g of the analysis. The idea that the analysis of the data must be able to satisfy the objectives of the si e d trial is the dominant theme of this whole course. al t n e m Objectives eri p x e f Objectives drive the whole of the design of any experiment. For this reason the design o w e process has to start by determining objectives. vi e R 1. The objectives for a trial must be: (cid:123) Clear. If the objectives are vague it will not be possible to decide on the rest of the design. (cid:123) Complete. Often the statement of objectives is incomplete so that when designing the experiment many questions can not be answered. Example An example of an incomplete objective statement would be something like: “To evaluate the effectiveness of improved fallows for restoring soil fertility”. A trial cannot be designed until the objective is completed with such details as: (cid:122) Where is the evaluation needed? Over what range of environmental conditions? (cid:122) What range of ‘improvements’ need to be evaluated? (cid:122) What are the criteria for evaluation? Whose criteria are these? Over what time scale? A more complete statement of objectives might be something like: e t o n e ur “ Planted fallows, using Sesbania sesban seedlings, planted at 1m x 1m and grown t c e for two years before clearing (removing wood and incorporating other biomass) L have improved maize yields, compared to those obtained following a two-year 66 natural fallow in on-station trials in E. Zambia. Now we wish to carry out a trial to determine: n g si e d (cid:122) Grain yield increase (relative to natural fallows) in the first, second and third al t cropping season achieved using this technology on degraded (i.e. about to n e m be put into fallow) clay and sandy soil maize fields in Eastern province of eri p Zambia. x e f (cid:122) Change in soil organic matter and N pools (compared to natural fallow) at o w the end of the improved fallow and after 1, 2 and 3 following maize crops.” e vi e R 1. (cid:123) Relevant. In the area of development research, experiments are carried out to help solve development problems. The objectives of the experiment must be relevant to solving the problem. It must be clear how we will be a step nearer solving the problem once we have the results from the experiment. (cid:123) Capable of being met by an experiment. Not every research question needs an experiment. Treatments The objectives will determine the treatments or conditions to be compared. The key concepts that will determine the choice of treatments are: (cid:123)(cid:123)(cid:123)(cid:123)(cid:123) Contrast or comparison between treatments Many objectives simply require the comparison of the mean results of two treatments. In certain cases, more complex comparisons are needed. In general, each hypothesis corresponds to a contrast. The hypothesis or question determines the contrast and the contrast determines the treatments that will be incorporated in the experiment and not the reverse, as often happens. For example, a trial is conducted to determine whether farmers find the addition of mulch from hedgerow species a useful addition to the management of fertility in vegetable plots, and if so, which of two common hedgerow species (Lantana or Tithonia) is superior. The first e objective requires a contrast ‘mulch vs no mulch’. The second requires a contrast ‘Lantana vs ot n Tithonia’. Together these define three treatments which are; e r u t c e L 1. Lantana mulch, 2. Tithonia mulch, 77 3. no mulch. n g si (cid:123)(cid:123)(cid:123)(cid:123)(cid:123) Controls e d al t Very often, treatments require some sort of standard against which they can be compared. en m Choice of these control treatments is determined by the experimental objectives. Control does eri p not necessarily mean zero input or do nothing or use farmer practices. Any of these might be x e f useful for certain objectives, but others may be equally or more appropriate. o w e vi e The Tithonia and Lantana mulch example refers to ‘mulch as addition to management of R 1. fertility’, meaning the appropriate control, should be whatever the farmer usually does. (cid:123)(cid:123)(cid:123)(cid:123)(cid:123) Factorial treatment structure Factorial treatments refer to sets of treatments defined by the combination of two or more treatment factors, each with two or more levels. For example, in a fodderbank trial, two cutting heights combined with three plant densities gives a total of six treatments. Such sets of treatments often arise naturally from the proposed hypotheses but they can also be used to test several unrelated hypotheses in the same experiment more efficiently than in separate ones. Understanding factorial structure and the idea of interaction can further suggest other hypotheses that should be tested. Complications may arise if: (cid:123) not all factorial combinations are distinct or can be included (they may not be possible), (cid:123) control treatments have to be included in addition to a factorial set, (cid:123) the total number of treatments will be too large if all possible combinations are included in the experiment. (cid:123)(cid:123)(cid:123)(cid:123)(cid:123) Quantitative factors When treatment factors are quantitative (e.g. amount of manure, tree density), the aspects e to be decided are the range of levels, the number of different levels, the spacing of the levels and t o n the replication at each level. There are many options in the choice of levels when two or more e r tu quantitative factors are involved. The choice of levels is not usually defined by objectives that c e L may refer more to the identification of rates, slopes or optima. 88 Layout The layout of the trial describes both the ‘objects’ (plots) of the experiment and the way n g si the treatments are allocated to them. In agroforestry experiments the basic ‘objects’ or units are e d al usually plots of land. However the same ideas apply to other types of experiments, such as those t n e on animals. m eri p ex When describing experiments on plots of land there is a hierarchy in the layout. At the f o top level are the sites where the trial is conducted. There might be one or several. A single site w e vi may be chosen to match the conditions in the objectives. Alternatively a number of sites may be e R required, for example covering a range of environments. If the trial is done on-farm, then each 1. farm is a site, and they may be grouped according to criteria relevant to the objectives. Within sites there will often be known (or suspected) patterns of variation that define blocks. The size and shape of the blocks should be determined by the variation expected. Within each block will be a number of plots. The layout of a single plot (its size, shape, position of trees and crops within it, the size of guard areas and so on) will depend on objectives and practical considerations of conducting the trial.
Description: