LAB EXERCISE A: Nearest-Neighbor Analysis
Examine the distribution of adult fetid lilac shrubs, Ceanothus foetidus (which I made up, by the way, so don't go running to the Munz flora in the excitement of learning about a new chaparral plant!) in Figure 1. The area mapped is a hillside in the Sulphur Mountain area of Ventura County (which does exist and, ooooo, is it stinky!). Employ nearest-neighbor analysis to ascertain whether, at this scale, this "species" has a clumped, uniform, or random distribution.
Calculating the nearest-neighbor co-efficient (R) entails the tedious process of measuring the distance between each point in a given space and the point that is its nearest neighbor. It should be noted that point a may well have point b as its nearest neighbor, but point b may have another point entirely, say, c, as its nearest neighbor. Anyhow, having measured all those nearest neighbor distances, you figure out the mean nearest-neighbor distance and then create an expected mean nearest-neighbor distance from the density of the points in your study area. You then create R as the ratio of the mean observed nearest-neighbor distance to this expected mean nearest-neighbor distance.
R can vary from 0 to 2.149 (I've always liked that extra .149 bit!). A score of 0 means perfect clustering: All points are found at the exact same point in space (which is, of course, a physical impossibility if your study entails point data collected at one time). A score of 2.149 means perfect uniformity. A score of 1 represents perfectly random distribution of points in space. So, you can use this descriptive statistic to characterize a distribution as more clustered or more uniform or just random. Unfortunately, I know of no significance test for the R ratio, so there's no way to test whether a given distribution departs significantly from randomness.
n = ________
A = ________
p = ________
# r # r # r 1) ________ 11) ________ 21) ________ 2) ________ 12) ________ 22) ________ 3) ________ 13) ________ 23) ________ 4) ________ 14) ________ 24) ________ 5) ________ 15) ________ 25) ________ 6) ________ 16) ________ 26) ________ 7) ________ 17) ________ 27) ________ 8) ________ 18) ________ 28) ________ 9) ________ 19) ________ 29) ________ 10) ________ 20) ________ 30) _________
_ ro = _________
_ 1 1 re = ______ _____ 2\/ n/A 2\/ p _ re = ________
_ ro R = _____ _ re = ________
_________________________________________________________________________ _________________________________________________________________________ _________________________________________________________________________ _________________________________________________________________________ _________________________________________________________________________ _________________________________________________________________________
LAB EXERCISE B: Chi-Squared Quadrat Analysis
You may recall that quadrat-based techniques of spatial analysis involve the division of an area into equal-sized plots, usually through a grid of squares. This permits the use of statistical techniques to analyze quantitative data with no more measurement sophistication than mere frequencies by category (nominal data). Nearest-neighbor analysis, by contrast, required the collection of data at the ratio level of measurement, which is the highest level.
For your reference pleasure, the definitional formula for Chi-squared is:
r k __ __ (Oij - Eij)2 X2 =\ \ ____________ /_ /_ Eij i=1 j=1You'll be comforted to know I'll walk you through a much easier computational process.
_________________________________________________________________________ _________________________________________________________________________ _________________________________________________________________________
_____ yes _____ noGiven that, would you set alpha at the high end (larger prob-value) or low end (smaller prob-value) of the scientific continuum of common alphas?
_____ larger alpha _____ smaller alphaOn the other side, this is clearly not a marketing type of study, where a Type II error (missing a significant and exploitable relationship, if it exists) would have the more serious consequences. So, there is not much pressure to increase alpha: It is more important for theory that you not delude yourself into seeing significant relationships where there might be none. So, you want to select an alpha that is on the smaller side compared to, say, the marketing continuum and yet on the larger end of the scientific continuum. Balancing these concerns, which of the commonly used alpha levels is best able to suit your purposes?
_____ 0.10 _____ 0.05 _____ 0.01
| SALVIA | | | | | present | absent | row totals _________________________________________________________________________ |(a) |(b) |-e- present | | | | | | AVENA _________________________________________________________________ |(c) |(d) |-f- absent | | | | | | _________________________________________________________________________ |-g- |-h- |-i- column totals | | | | | | n =
That done, examine the expected frequencies. Chi-square should not be used if any expected frequencies are below 2 (or, irrelevantly in this case, if more than 20 percent of the data cells have fewer than 5 actual cases). You will note that there are no such problems with your contingency table, so you can safely proceed through Chi-square.
________________________________________________________________________ DATA CELL | O | O2 | O2/E ________________________________________________________________________ (a) | | | ________________________________________________________________________ (b) | | | ________________________________________________________________________ (c) | | | ________________________________________________________________________ (d) | | | ________________________________________________________________________ | sum(O2/E) = ________________________________________________________________________ | sum(O2/E) - n = X2 = ________________________________________________________________________
DF = (r - 1)(k - 1) where r = number of rows and k = number of columnsSo, you will enter the table at the intersection of:
the column headed ________ and the row corresponding to ________ degrees of freedom.What, then, is your critical Chi-squared value?
X2crit = ________
_____ reject Ho _____ do not reject Ho
________ prob-value of Ho
To calculate Yule's Q, multiply cells a and d and also cells b and c. Then, enter these multiplications into the following formula:
ad - bc Q = _______ ad + bcSo, what is the Q value for this lab? ________
Please interpret the results of Lab B, taking into consideration both Chi-squared and Yule's Q. What sort of ecological relationship, if any, exists between Salvia apiana and Avena barbata at this scale of analysis?
_________________________________________________________________________ _________________________________________________________________________ _________________________________________________________________________ _________________________________________________________________________ _________________________________________________________________________ _________________________________________________________________________And that's that for another lab, folks!
Figure 1 -- Map of Ceanothus foetidus Plants (or why they don't let me teach cartography!)
Figure 2 Map of Oats and Sage (you might want to recopy these figures at 120 percent or so)
Figure 3: p-Values for X2
X2 1 DF X2 1 DF X2 1 DF X2 1 DF 3.2 .0736 4.4 .0359 5.6 .0180 6.8 .0091 3.3 .0692 4.5 .0339 5.7 .0170 6.9 .0086 3.4 .0652 4.6 .0320 5.8 .0160 7.0 .0082 3.5 .0614 4.7 .0302 5.9 .0151 7.1 .0077 3.6 .0578 4.8 .0285 6.0 .0143 7.2 .0073 3.7 .0544 4.9 .0268 6.1 .0135 7.3 .0669 3.8 .0513 5.0 .0254 6.2 .0128 7.4 .0065 3.9 .0483 5.1 .0239 6.3 .0121 7.5 .0062 4.0 .0455 5.2 .0226 6.4 .0114 7.6 .0058 4.1 .0429 5.3 .0213 6.5 .0108 7.7 .0055 4.2 .0404 5.4 .0201 6.6 .0102 7.8 .0052 4.3 .0381 5.5 .0190 6.7 .0096 >7.8 <.0050
first placed on the web: 11/26/98
last revised: 11/30/98
© Dr. Christine M.
Rodrigue