PlantPeople

PLANT // PEOPLE

Pondering plants and the people who study them.

R makes my blood boil and it's Stack Exchange's fault

2/10/2017

2 Comments

 
[Anna writes...] ​​I spend a lot of time on Stack Exchange. It’s an online forum where people ask questions about how to do statistics in the program R. Like Yahoo Answers for nerds.

​A visit to Stack Exchange is just about the only guaranteed way to ruin my day.
Picture

​When I have an R question and am perusing Stack Exchange, I am always left appalled at the rudeness and lack of empathy of the posters, further confused by their suggestions, and almost always still wanting the answer to my original question. Yes, I understand that often the question-ers haven’t given enough information, and that this must be terribly frustrating to the regular answer-ers, but come on. I usually find my exact question already posted, but with comments that are mostly “omg I can’t believe you didn’t give this information I can’t possibly help you even a little bit” and never “here’s a hint but I can help more if you tell me more.”

R people will be thinking right about now, “but every situation is unique, so we need 100% of the information before we can give perfect advice so that the data analysis is perfect.”

I challenge that these people need to be more flexible and willing to answer the damn question, even if the person asking it might be making mistakes otherwise. Is that too much to ask?

It's okay to still correct the mistakes! But I get the feeling no one has considered that maybe the other people coming to the page actually want the question answered. 

For instance, if someone posted a question like: “what’s the R code for a ANOVA?”

For which an appropriate answer might be: “aov(y ~ x)"

I bet you a million dollars that there would be 10 comments like:

I can’t possibly be expected to give you statistical advice when not provided with the details of your experimental design, hypotheses, and life story. You have written “a anova” which is grammatically incorrect. When you use an indefinite article before a word starting with a vowel, you absolutely must use ‘an’ rather than ‘a.’ Please use proper grammar when posting on this forum. The analysis of variance is only appropriate under certain circumstances. What are your degrees of freedom? Please fax me your research proposal. You’re clearly an idiot. 
​
Picture

Too extreme?

Then why does my blood boil whenever I spend more than about 5 minutes reading these threads?

For instance, I recently had some trouble with a mixed effect model. I’ll skip the details, but a few colleagues suggested that I try a non-parametric model.

I couldn’t remember what a non-parametric framework even looked like in R, but assumed it would be easily Google-able. I searched for something like “mixed effect non-parametric model R” and found a Stack Exchange page that seemed perfect.

Posted 2 years ago: “Is there any model that includes random effects with non-parametric data distribution?”

Bingo. This is literally a yes or no question that I wanted to know the answer to.

I went to the page, and the question was very well set up (to me). It said:

I have a non-parametric (by which I mean non-normal) data distribution. I tried several transformations, but none were helpful. Now, I want to find a model where I can include random effects with the non-normally distributed data. I know the Kruskal-Wallis test, but I couldn't find any hints if I can include random effects. Does anyone know of an appropriate model?

I’m thinking, jackpot.

The poster goes on to describe their dataset and even includes some histograms to show their data distribution and some code.

Bring on the answers to my question.

Comment 1. If you give more information about your data (and research question or hypotheses), someone might point out parametric alternatives (such as a GLMM). That you have non-normal residuals (the distribution of data is not relevant here) doesn't mean that you must use non-parametric methods. 

Picture

​Um, okay. It's good advice… unfortunately the question I wanted answered was whether there is or is not a non-parametric option for a mixed effect model.

Comment 2. Please say more about your data. I do not believe that "non-parametric data distribution" actually means anything in statistics. Also note that the data do not have to be normally distributed for a standard model, only the residuals should be, but even those can be somewhat non-normal if you have enough data. Please clarify your situation, your data & your goals more fully.

Let’s remember that the original post said “I have a non-parametric (by which I mean non-normal) data distribution.” Although it is true that saying “non-parametric data” is incorrect, I think it is very clear that this person actually meant "non-normal." Because they said that. It’s one thing to answer the question and gently correct the error. It’s another to point out the error and then disregard the question because of it. But okay… let’s keep going. Comments have been noted. Still no word about whether there’s a non-parametric option for a mixed effect model.

Next the original poster responded with more description of their data, as requested.

Now for comment 3. Data is data. The adjectives 'parametric' and 'nonparametric' apply to models or methodologies, not to data. Do you simply mean 'not normal' when you say 'nonparametric', or do you intend to imply something more than that? Can you describe your data in more detail? Is it Likert scale, for example? How are you assessing normality? With images, upload them somewhere (say, imgur.com, which stackexchange uses), and give us a link in your post, and someone will fix it for you so we can see your image(s) in your post.

It’s around this point when my blood begins to boil with a silent rage. The poster has already been chastised for saying “non-parametric data” and it’s already been clarified what they meant. Thank you for correcting them, again.  And not answering the question, again. And what the **** is a Likert scale? You really think that someone who said “non-parametric data” knows if their data is “Likert scale”?​
​
Picture

The user then responded to all the requests from commenter 3 (including uploading stuff to imgur, etc… which was probably a lot of work.)

There are no further comments.

This page has been viewed 1477 times and not one of us found out if you can do a non-parametric mixed model in R. Nor does the original poster know how to analyze their data (which, in defense of the commenters, probably didn’t need a non-parametric model anyway BUT STILL.)

Please, people responding to questions on Stack Exchange, would it kill you to answer a question even if the person is wrong to be asking it? 

And could you be less of a dick?

***

OMG I wasn’t sure whether to call it “Stack Exchange” or “Stack Overflow” in this post, so I Googled “stack exchange stack overflow” and found this question posted to Stack Exchange:

What is the difference between Stack Overflow and Stack Exchange?

And this answer:

I don't understand: how do you not know what stackoverflow is about? you've been on the site for three months, and seem to have an idea of what it's for, judging by your questions (this one being the exception)... And if you're still confused, please read the below migration message for more info...
​

THIS IS WHAT I’M TALKING ABOUT
Picture
2 Comments
Nash Turley link
2/11/2017 04:56:24 pm

I've spent a lot of time of these types of sites, and I definitely remember the demands for 'reproducible examples' which can be annoying but actually quite logical if people want to provide specific code the solve the problem. I don't really remember the rudeness, not because I disagree that it is there, but perhaps because I've learned to mostly ignore the written text and just snatch code that might be helpful and start playing with it. Kinda the nerdy version of 'don't read the comments' haha.

Reply
Paul A. Rubin link
2/15/2017 03:51:37 pm

In partial defense of the Stack Exchange responders, posting an answer to what you guess the originator meant to ask can have unintended side effects: someone else comes along later, reads the answer, doesn't realize it is inapplicable (or incorrect) for them because the answer does not address the (fuzzy) question as they interpret it, and Bad Things Happen. That's no excuse for pedantry, but maybe some justification for asking the OP to clarify things.
That said, if you find yourself looking for R help and don't want to risk Stack Exchange, you might look at the Statistics and R community on Google+ (https://plus.google.com/u/0/communities/117681470673972651781). Folks there seem to be pretty helpful, and I don't recall any flames spouting from orifices.

Reply



Leave a Reply.


    Andy on Twitter
    Anna on Twitter
    Nate on Twitter

    This site is no longer regularly updated, but if you're an early-career plant scientist with something to say, you're welcome (and encouraged!) to submit a post. Contact Anna Funk using this form for information.

    Archives

    July 2020
    March 2018
    February 2018
    December 2017
    October 2017
    September 2017
    August 2017
    July 2017
    June 2017
    May 2017
    February 2017
    December 2016
    November 2016
    October 2016
    September 2016
    July 2016
    June 2016
    May 2016
    April 2016
    December 2015
    August 2015
    June 2015
    May 2015
    April 2015
    March 2015

    Categories

    All
    Academia
    Anna's Research
    Birds
    Blog Updates
    Careers
    Climate Change
    Data & Statistics
    Ecological Restoration
    Ecology
    ESA
    Evolution
    External Posts
    Faith+Science
    Field Work
    Food
    Gardening
    GMOs
    Grad School
    Guest Post
    Meyers-Briggs
    Miley Cyrus
    Natural History
    Nature
    Outreach
    Plant Breeding
    Plants
    Pokémon
    Pokémon Go
    Published Research
    Science Communication
    Science News
    Scientific Meetings
    Scientists
    Teaching
    The Legend Of Zelda
    Twitter
    UpGoer Five
    Video Games
    Wildflowers
    Wine

Powered by Create your own unique website with customizable templates.