Do You Need Statistics to Understand Your Data?

As I think each of us in this group last week established, it is important to gain a strong statistical background. Most of us stressed that this was of particular importance within the study of psychology where new studies and theories are being researched and published regularly. Within these studies it is usual for raw data (eg. IQ scores, reaction times etc.) to be gathered. Now, the question is, what use is statistics with regard to this data?

I am going to address this question using two interpretations of the word statistics (because I am a ridiculously pedantic person who has even quibbled pedantically over the use of the word pedantic on more than one occasion). Last year we discussed the differences between qualitative research – which tends to produce narrative reports – and quantitative research which studies variables of varying quantities (remember me?). It is in the latter field that statistics is of vital importance for understanding the data that is produced. Statistics is not merely a mathematical subject, it is a tool for organising, summarising and simplifying data (descriptive statistics) and for analysing sample data in order to infer generalisations about the corresponding population (inferential statistics). No other tool is as useful as statistics for interpreting our findings, while also ensuring that said findings are both empirical and replicable.

However, an ability to use and apply the statistical method is not enough. Statistical outputs, as presented by SPSS, are indeed well organised and relatively concise; and with training and experience, one can read them easily enough. This, however, does not always result in researchers making correct and reasonable inferences about what their data reveals. Take the example of Herrnstein and Murray’s book, The Bell Curve, wherein it was claimed that inherited intelligence alone determined success and that social partition in America was caused by the separation, by IQ, of the “cognitive elite” and the “cognitive inferior”. According to them, the majority of the cognitively inferior were of African or Latin descent. How is it that statistically educated researchers came to this conclusion? Quite simply, they either misused statistical methods or they attempted to be too parsimonious. For, you see, they decided that success in life was determined solely by intelligence, neglecting the impact of environment and upbringing. Too much emphasis and importance was placed on one variable to the point of incorrect exclusion of potentially confounding variables. (Click here for James Heckman’s critique of The Bell Curve) In cases such as these, however, it is often further studies – where statistical analysis is used appropriately – that expose the weakness and problems of the previous research. Therefore, the moral of this tale is that while statistics are needed to interpret data, your statistical understanding of what the data is telling you may not always be correct. Statistics are essential for gaining a better understanding, but will only help you to do so, if used correctly.

To end on a less serious (and definitely nit-picky point), Gravetter and Wallnau taught us that while statistics is often used as shorthand to refer to statistical procedures, the word statistics on its own can refer to something else. A statistic is derived from a sample and is a descriptive characteristic of a sample. These statistics, if you remember, have a corresponding population parameter. More often than not, in behavioural and medical sciences, one cannot gain access to the full target population within a study. This is why we use samples as representative groups from the population. Without the characteristics/statistics of these samples to study, very few projects could be completed due to insufficient access to the population. Therefore, in the case of studies done using samples, statistics (as defined in this manner) are indeed vital in that they form our data. And then, how might we interpret and understand this data? Why, by using statistical procedures, of course!

If you bore with me through my fussiness there, I am extremely grateful 🙂

Speak to you all next week.

8 thoughts on “Do You Need Statistics to Understand Your Data?

  1. I like how you have linked this weeks and last weeks blogs as well as related the topic of this week with our statistics sessions last year with the ‘remember me’ link – its pretty cool! Also your the first blog I’ve read today including how stats effects populations and samples which did interest me greatly. Further more I like how you have included the bell curve and how statistics should be used correctly for understanding.One point I would suggests is possibly talking more of how perhaps understanding is not just from a P value or a statistical significance, it is what we can interpret from any result whether the null is accepted or rejected that increases are understanding of that particular area. Good blog! 🙂

    • Thank you for not being driven off by my pedantic meandering! You are absolutely correct in saying that understanding statistics means far more than merely being able to report the relevant details of an SPSS output. To correctly infer things from our data, we need to understand the significance of each result – not least of all understanding the significance of significant results! Without comprehending stats and merely knowing that the lower the p value the better, one might overlook important validity issues (confounding variables, external validity, small sample sizes etc.) as occurred in the case of the Bell Curve. We really do need to rely on our own statistical ability and understanding of the subject and use it IN CONJUNCTION with SPSS for quantitative data.

  2. Hi! 😀 Comment time! Just a quickie to say well done.
    Right – proper comment:

    I agree with your blog that samples are used to represent the population. Just a little thought that popped up to me was that we are always told samples, if done properly, i.e. with fully random sampling methods, can actually fully represent the population. Therefore even though we cannot actually test the entire target population, the sample pretty much tells us what would happen if we did. However (this is where i rant- bear with) I don’t actually think we should just assume this as researchers. How do we know that this particular sample is the same as everyone? Say if we were getting IQ’s, our sample might seem representative… but there are bound to be people in the population with higher or lower IQ’s. Take the mensa organisation (http://www.mensa.org), proper genius people. The sample would probably not include them, but are they not too in the population?
    I reckon you get the point (lol~) So I shall end my rant now 🙂
    Enjoyed the blog again, looking forward to next weeks
    Abby

  3. It seems that we both hold similar views with regards to stats especially that no other tool is helpful as statistics when interpreting findings from experiments. In my opinion, statistics is essential in the guidance of data and research, as without it raw data would be greatly confusing and difficult to comprehend. 
I would even go as fat as saying statistical analysis can be vital when applying research in real life situations. The analysis of a raw set of data scores assists in discovering a precise and meaningful result of research, which can then be positively applicated to real people in real life situations. Consider the following example; Gottdiener (2000) conducted a meta-analysis of 37 studies and found that outpatients who received psychotherapy for schizophrenia improved at a significantly higher rate than inpatients. Here, the application of this statistical analysis increases scientific understanding of how schizophrenia is best treated and as a result, demonstrates how important statistics is in the understanding of data and the many benefits that come with doing so.

  4. Pingback: Comments I’ve made on week 2 blogs (for TA) « psychjs1

  5. Pingback: Attention TA: Here’s my homework :) « Not just ANOVA blog~

  6. An excellent post. I liked your use of the bell curve as evidence, and agree with your point that while statistics are essentia (and in a practical sense, the best tool for the job)l, they’re only useful when implemented correctly.

    Looking forward to reading your next post (which I sadly admit had me giggling at the title far too much.)

  7. Pingback: TA: Posts that I have commented on this week (7th-14th October) « So…what do you think?

Leave a comment