I'd like to estimate the number of days a year when the high temperature is likely to be below a particular threshold, e.g. below freezing. This turns out to be harder than expected.
[more inside]
posted by jon1270
on Jan 24, 2013 -
9 answers
How do I elegantly present tabular, statistical data online and automatically?
I'd love some examples of beautifully presented tabular data online - something that works natively in a browser, ideally also on a tablet and mobile as well. Some interactivity (sorting, filtering) also OK but priority is usability and elegance like you'd find in printed statistical abstracts. Bonus points for open source web tools / frameworks that could help automate this from a database!
[more inside]
posted by tkbarbarian
on Aug 30, 2012 -
9 answers
What is the best way for me to learn R? In particular, what is the best website or online tutorials for learning to deal with large datasets.
[more inside]
posted by aleatorictelevision
on Jun 20, 2012 -
13 answers
I'd like to learn about data science. Things like predictive modelling, regression and classification and so on. What would be good books or online courses to start with?
posted by gwynp
on Jun 8, 2012 -
9 answers
I'm looking for an infographic (?) about how lucky it is to have been born in the developed world. I remember a format similar to "If you are also literate, then you are already in the top x% of the world," with various characteristics substituted in for "literate."
[more inside]
posted by glass origami robot
on May 27, 2012 -
7 answers
Can you point me to the best resources to learn about these new-fangled things they call
data science and
big data? I just started a new job as a
data scientist and need to get up to speed.
[more inside]
posted by Tooty McTootsalot
on May 8, 2012 -
10 answers
Nutshell: I need a reputable, authoritative source of data about how many times the average victim of abuse/domestic violence/intimate partner violence returns to the relationship.
[more inside]
posted by prefpara
on Apr 27, 2012 -
6 answers
Given a set of columnar data, some of which are categorical and others that are numerical, how can I identify which category columns are responsible for signficant changes in the one or more of the numerical columns?
[more inside]
posted by mulligan
on Apr 24, 2012 -
7 answers
What's that quote? Something about each statistic in a news article reducing readership by x%. I can't for the life of me remember the exact words and who said it.
posted by tavegyl
on Sep 7, 2011 -
4 answers
I'm trying to figure out interesting ways to slice/combine/aggregate a ton of data into useful or interesting statistics. Where do I begin?
[more inside]
posted by zain
on Dec 11, 2010 -
10 answers
Where can I find data on US federal income tax brackets by congressional district? My goal is making a fairly accurate statement of the form "Congressperson X represents a district where Y people are in tax bracket Z."
posted by ecurtz
on Dec 10, 2010 -
6 answers
Merging datasets in R. I seem to have forgotten how to do some things.
[more inside]
posted by proj
on Nov 19, 2010 -
3 answers
Likert scale survey with 3 matrices, each examining a different factor by asking 8 questions to be ranked. I have the 110 responses I wanted, broken down by some demographics. Now what?
[more inside]
posted by ttyn
on Nov 16, 2010 -
9 answers
I'm looking for data to use in a statistics class. The data set needs to have less than 25 variables and at least 200 individual records. Any fun/unique/silly ideas? I need to right a paper analyzing this data -I'd like it to be interesting. I need access to raw data - it is okay if it has already been analyzed but I need to show that I can analyze it.
posted by turtlefu
on Nov 7, 2010 -
11 answers
Textbooks on data mining techniques / statistical analysis on large data sets?
[more inside]
posted by wooh
on Oct 22, 2010 -
5 answers
Hello MeFites. Lately, I've become interested in the idea of "Quantified Self", essentially, self tracking. A diary of daily events, such as when I woke up, how much coffee I've had, my productivity level and so forth. I'd like to collect some of this data to see if I can deduce some patterns. Maybe when I drink coffee, I go to bed later, and I'm not as productive for example. To achieve this, I think I need a piece of software.
[more inside]
posted by aeighty
on May 6, 2010 -
13 answers
I have a wonderfully large dataset that I'm working with for a long-term project. I am analyzing a small section the dataset for my masters thesis. In meeting with my thesis advisor last week, she suggested I run some statistical tests of significance on the 4 tables I'm working with. She knows that I am yet to be versed in quantitative analysis methods (I've done solely qualitative work thus far) and that I'm under a massive time crunch to get this done. She suggested I seek help from others, as she doesn't want me to get bogged down with figuring out this step, and would rather I concentrate on analyzing the other aspects of this data. To this end, I'm wondering if somebody might be able to suggest the best type of test of significance to run, the easiest way to run it, and a good, simple resource for what the resultant values mean?
[more inside]
posted by iamkimiam
on Apr 17, 2010 -
19 answers
Help me find useful statistics and information on the Kibera slum in Nairobi.
[more inside]
posted by micklaw
on Mar 4, 2010 -
3 answers
What interesting statistical information can I dig out of this medical billing data?
[more inside]
posted by neilkod
on Nov 19, 2009 -
8 answers
There's no special place to turn up if you think you've got the swine flu to be tested or otherwise counted--hospitals and clinics tell people to just stay home unless they are having actual health complications. How is the CDC able to say that
22 million people have been infected with H1N1 when if you don't have to be hospitalized, nobody will even test you for it?
posted by autoclavicle
on Nov 13, 2009 -
18 answers
Statistics question: is it possible to test sets of cumulative data for significant differences in rate?
[more inside]
posted by rollick
on Nov 4, 2009 -
9 answers
Lots of interrelated data, little idea of how to analyze it. What method or type of software is best for collecting complex information for future analysis?
[more inside]
posted by newlyminted
on Aug 20, 2009 -
6 answers
I have some friends collecting movie & music data for the month of August - what are some innovative/interesting things I can do with the results?
[more inside]
posted by troika
on Aug 3, 2009 -
3 answers
How to import a SAS dataset into R (with, unfortunately, one extra degree of difficulty...)?
[more inside]
posted by docgonzo
on Jan 23, 2009 -
5 answers
I'm trying to lead a crusade for government to publish it's statistics and data in a way that is mashable. Is it possible to define a standard digital format that could apply to a diverse array of data sets?
[more inside]
posted by baggymp
on May 29, 2008 -
12 answers
Statistics-filter: I need to establish to what extent student performance on a particular standardized test is predicted by each of the following: GPA, standardized test scores and a couple of other miscellaneous numerical factors. How do I go about this?
[more inside]
posted by perissodactyl
on Mar 6, 2008 -
11 answers
How do I account for twins in statistical analysis? By 'twins' I mean two people born at the same time, rather than any unknown-to-me technical meaning for the word 'twins' in statistics.
[more inside]
posted by teem
on Sep 18, 2007 -
14 answers
SPSS-filter: How can I change the same property field of all my variables simultaneously?
[more inside]
posted by lioness
on Aug 16, 2007 -
4 answers
I need a good resource for conducting statistical tests. I've taken stats courses, but it's been a while. Something SPSS-centric would be ideal...
[more inside]
posted by electric_counterpoint
on Nov 29, 2006 -
4 answers
Clicking Tracking: there is
MeasureMap,
PMetrics,
BlogBeat,
Crazy Egg, but
Google Analytics is apparently the best. What other tools and services for tracking your visitors would you suggest? I am about to start a new web design blog, but I don't know which tool I should use. Which are better? Which would you use for blogs and which for "normal" web-sites?
posted by volandmast
on Apr 17, 2006 -
2 answers
I want to take a bunch of numerical data (say, survey results) found in Word documents in roughly tabular form (though not in tables per se) and run simple statistical analyses on them. What's the best way to do this?
[more inside]
posted by shivohum
on Apr 17, 2006 -
10 answers
A statistics / scientific convention question. I've noticed in scientific journals that often when a set of data is presented with values normalized to one of the sample groups, and the value for that sample group is arbitrarily set to 1, 10, 100 or whatever, to simplify interpretation, the variability/error data for that one sample group is left out. Is there a good statistical reason for that or is it just some random convention with no good reason?
[more inside]
posted by shoos
on Jan 24, 2006 -
17 answers
How many people in the United States make over $25 million a year in pre-tax income? How many over some other threshold? Google is failing me, and census.gov is resisting my navigational efforts.
posted by Aaorn
on Dec 3, 2005 -
10 answers
I'm working on a data graphic with a linear scale horizontally but a logarithmic scale vertically. The y-axis (count scale) values range from 0 to 200, with the mode average in the lower third of that range. I'd like the graph to take up only a third of the size vertically that it currently does.
I can recognize a logarithmic plot when I see one, and I know what I want generally, but I don't have enough of a handle on the Math to actually calculate and plot such a scale myself... at least accurately.
I'd appreciate any help in understanding the Math. It's one thing to see a formula, and another to know how to use it. I'm also not 100% sure that this is the best way to handle such a graph, so alternate strategies or confirmation are welcome.
posted by Jeff Howard
on Oct 14, 2005 -
14 answers