Semivariance: Difference between revisions
Appearance
Content deleted Content added
→Controversy: extension of explanation on degrees of freedom |
fix double redirect Tag: Redirect target changed |
||
(44 intermediate revisions by 29 users not shown) | |||
Line 1: | Line 1: | ||
#REDIRECT [[Variogram#Semivariogram]] |
|||
{{POV}} |
|||
In [[spatial statistics]], the '''semivariance''' is described by |
|||
:<math>\gamma(h)=\sum_{i=1}^{n(h)}\frac{(z(x_i+h)-z(x_i))^2}{n(h)}</math> |
|||
where ''z'' is a datum at a particular location, ''h'' is the distance between ordered data, and ''n''(''h'') is the number of paired data at a distance of ''h''. A plot of semivariances versus distances between ordered data in a graph is known as a [[semivariogram]] rather than a [[variogram]], in which the sum of squared differences is divided by 2n(h). |
|||
The semivariance is calculated in the same manner as the [[variance]] but only those observations that fall below the mean are included in the calculation. It is sometimes described as a measure of downside risk in an investments context. For [[skewed distribution]]s, the semivariance can provide additional information that a variance does not. |
|||
==Controversy== |
|||
In mathematical statistics, sets of '''n''' measured values give ''df=n-1'' degrees of freedom whereas ''in situ'' or temporally ordered sets give ''df(o)=2(n-1)'' degrees for the first variance term. The semivariance is an invalid measure for variability, precision and risk because the sum of squared differences between x and x+h is divided by n, the number of data in the set, but it ought to be divided by ''df(o)=2(n-1)'', the degrees of freedom for the first variance term (see Ref 2). |
|||
The statement that only measured values below the mean are included in the semivariance makes no statistical sense (see Ref 4). Clark, in her ''Practical Geostatistics'', which can be downloaded from her website, proposed that the factor 2 be moved for mathematical convenience and berates those who refer to variograms rather than semi-variograms. |
|||
Added comment 17/12/2006: in mathematical statistics, the degrees of freedom are n-1 when the true population mean is unknown. If the mean is known or can be realistically hypothesised, no degrees of freedom are lost and the variance may be divided by n. When defining the semi-variogram, Matheron showed theoretically that the true mean difference would be zero in the absence of significant drift in the values. If this assumption is valid, then no degrees of freedom are lost. This assumption can be verified by standard statistical techniques. |
|||
==See also== |
|||
* [[variance]] |
|||
* [[sampling variogram]] |
|||
* [[geostatistics]] |
|||
* [[kriging]] |
|||
== References == |
|||
* Clark, I, 1979, Practical Geostatistics, Applied Science Publishers |
|||
* David, M, 1978, Geostatistical Ore Reserve Estimation, Elsevier Publishing |
|||
* Hald, A, 1952, Statistical Theory with Engineering Applications, John Wiley & Sons, New York |
|||
* Journel, A G and Huijbregts, Ch J, 1978 Mining Geostatistics, Academic Press |
|||
* [http://www.geostatscam.com/Adobe/Precision_reserves.pdf Merks, J W and Merks E A T], Precision Estimates for Ore Reserves, Erzmetall, Vol 44, Nov 1991. |
|||
* [http://www.geostatscam.com/Adobe/Abuse_stats.pdf Merks, J W], Abuse of Statistics, CIM Bulletin, Jan 1993. |
|||
* [http://www.geostatscam.com/Adobe/Stats_exploration.pdf Merks, J W], Applied Statistics in Mineral Exploration, Mining Engineering, Feb 1997 |
|||
* [http://www.geostatscam.com/Adobe/Borehole_stats.pdf Merks, J W], Borehole Statistics with Spreadsheet Software, SME Transactions 2000, Vol 308 |
|||
==External links== |
|||
* Shine, J.A., Wakefield, G.I.: A comparison of supervised imagery classification using analyst-chosen and geostatistically-chosen training sets, 1999, http://www.geovista.psu.edu/sites/geocomp99/Gc99/044/gc_044.htm |
|||
* [http://www.geostatscam.com This website explains why the true variance of a single distance-weighted average should not be replaced with the false kriging variance of a set of kriged estimates.] |
|||
[[Category:Geostatistics]] |
|||
[[Category:Statistical deviation and dispersion]] |
|||
[[de:Semivarianz]] |
Latest revision as of 04:25, 16 March 2021
Redirect to: