Hacker News new | past | comments | ask | show | jobs | submit login

I happen to work with NOAA and NASA scientists on coupling Earth System Models. NOAA does understand elementary descriptive statistics (so does NASA). Sometimes average temperature distributions are skewed and do not have a finite standard deviation. This is the case for the distribution of annual change in global average temperature, which deviates from a normal distribution (skewed to the right). Sample estimates of standard deviations of global average annual temperature change are meaningless because the standard deviation isn't finite. Instead you'll see measures that are meaningful (mean, min, max). Don't assume that all distributions have finite variance.



I understand what you're saying, but it's also kind of begging the question. We can't calculate a standard deviation because temperatures are rising, but we also can't test if temperatures are unusually warm because we don't have a standard deviation?


I think the message is simple: standard deviations don't provide insight when your main concern is extreme values (max/min temperatures, in this case).


That's equally circular. How do you know it's an extreme value?


It's at either end of a set of values?


Set of values: [ 80.1, 80.2, 79.8, 80.1, 80.0, 79.9, 80.1, 80.1 ]

New value: 80.3. It's a record high, but is it an extreme value? i.e., are record highs extreme values by definition?


I'm not expert with these data, but your example is not a fair analog. Let's go back to the actual problem.

From a casual glance at the graphs showing past hot years (each one shows 5), NOAA appears to have records for most cities going back to the 1950s (probably 100 years in some cases).

So, in a set of 63 (1950-2012) values, which are themselves average temperatures (and thus representative of some kind of trend, not just a single hot day), the procedure picks out the 5 highest values of average temperature.

It's reasonable to expect that the sequences corresponding to these 5 highest values will be unusual when compared to the other 58. Nothing weird about that.

Of course, if they had only 10 years data, as in your example above, or if they weren't using monthlong averages, it would be a different story.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: