Say that I have a bag which contains 100 balls and every ball in the bag should be red, but it's possible that one or more of these balls is the wrong colour. How many balls should I look at to be 90% sure that all the balls are red? Or 95%? Or 99.9%? Talk me through how to work this out, please?
What great books or resources are there for practicing probability word problems such as for standardized tests like the GRE? [more inside]
Statisticsfilter: Given available information about the distribution of self-selected 4-digit passwords (specifically banking PINs), is it possible to calculate the probability of two randomly selected individuals having the same PIN? If so, what're the odds? [more inside]
I'm looking for the percentage of out-of-wedlock births per capita in the United States in 1880 (or, failing that, 1890). So far I'm coming up empty-handed. Specific stats for the District of Columbia will be even better, but I'm fairly certain that they're not available online.
Statistics filter: How can I categorize time series curves into pattern categories? [more inside]
Please help me figure out potential careers based on my interests and the best paths to obtain them. Psychology, economics, statistics? Market research? Psychometrics? [more inside]
I'm looking to learn how to calculate probabilities for a multi-round dice game. I've researched this question some, and it looks like I might need to know how to use the multinomial distribution, but I can't find any good introductions. Please point me to the most layman-accessible educational material on this subject, and help me to help myself. [more inside]
Grad programs--I've just heard (for the first time) that conditional admissions are A Thing. Would I have a snowball's chance with a good GRE score alone or will I have to take pre-req undergrad classes first? [more inside]
What after bio-statistics software experience is most attractive to future employers? [more inside]
Putting on the math signal: calling the statistics-literate. Trying to change my weight/body composition and track progress in a useful way, but I'm having trouble separating normal daily variation from actual real change. [more inside]
How do I elegantly present tabular, statistical data online and automatically? I'd love some examples of beautifully presented tabular data online - something that works natively in a browser, ideally also on a tablet and mobile as well. Some interactivity (sorting, filtering) also OK but priority is usability and elegance like you'd find in printed statistical abstracts. Bonus points for open source web tools / frameworks that could help automate this from a database! [more inside]
Recommendations for great books about probability and risk. [more inside]
Considering the huge amount of oil/energy expended driving trucks full of food all over the country, would it make more sense to increase our train infrastructure (is that even possible?) to move more food? That's the question, but not why I've come to Metafilter. [more inside]
StatisticsFilter, non-parametric edition: I'd like to test if my non-normally distributed outcome is significantly different between two groups adjusting for a third variable. [more inside]
Does Facebook publish statistics? I'm specifically interested in statistics about photo uploads and camera metadata / popularity. [more inside]
Are there any reliable statistics for the number of American high school seniors who apply to colleges each year? [more inside]
What are the *approximate* download numbers for a Top 20 podcast episode? [more inside]
Say I'm an industrial designer (I'm not), and I have to design a car seat, desk chair, climbing harness, body armor, or some other piece of equipment that has to be able to interact with a whole bunch of differently-shaped humans on a regular basis. What resource do I use to find out things like average foot width, or average knee circumference, average jaw length, or any other specific anatomical measurements? [more inside]
What is the best way for me to learn R? In particular, what is the best website or online tutorials for learning to deal with large datasets. [more inside]
Is there a way to find out approximately when I left this wad of money in my (very old) jacket pocket, using statistics and date/number info from the bills? What would be the minimum number of bills needed for such a thing?
I'd like to learn about data science. Things like predictive modelling, regression and classification and so on. What would be good books or online courses to start with?
What is some recent important or interesting research done in your field? [more inside]
I'm looking for an infographic (?) about how lucky it is to have been born in the developed world. I remember a format similar to "If you are also literate, then you are already in the top x% of the world," with various characteristics substituted in for "literate." [more inside]
Morbid question about Louis CK's statistical analysis of the crowd at last year's Beacon Theater show. [more inside]
I am interested in seeing some comparative data on the civilian casualties of warfare, particularly in the 20th and 21st centuries. Are there any great analyses of this topic, perhaps on DVD, YouTube, or illustrated in books with graphs? (I have a hard time envisioning the magnitude of these things when they are presented only as bald numbers.) I started thinking about this topic after watching The Fog of War, which is a documentary about Robert McNamara. He presented some amazing comparative information about World War II and the Cold War. I would love to acquire a similar critical context for other conflicts, including the recent wars in Iraq and Afghanistan.
Besides Central Americans, what are other countries that have major first-generation diaspora communities in the United States? India? Armenia? Cambodia? Other Asian countries? Just wild guesses here. [more inside]
Statistics, machine learning, and image analysis/processing. Two months. Self-study. No other obligations. Recommendations? [more inside]
I'd like to read about unexpected statistical correlations that begin to emerge when companies or academics use data mining to analyze behaviour in large groups of people. I'm looking for articles/sites that are a little more serious than this (found while searching Metafilter for this topic). More like this (NYT: Target is able to predict that some customers are pregnant before their children are born). Thanks for your recommendations! [more inside]
I have a blogger/blogspot blog, and I get very different information about my stats from the "stats" page, Google Analytics, and statcounter.com. Who is right? [more inside]
Can you point me to the best resources to learn about these new-fangled things they call data science and big data? I just started a new job as a data scientist and need to get up to speed. [more inside]
How could I approach modeling the number of accidents on a certain stretch of road? I've got some semi-ecological approaches in mind but no idea if they're workable. Are there standard approaches to this sort of problem? [more inside]
Looking for cancer survival statistics by individual hospitals in England. [more inside]
Calling geographers: has anyone applied the idea of topological prominence to population density? [more inside]
Nutshell: I need a reputable, authoritative source of data about how many times the average victim of abuse/domestic violence/intimate partner violence returns to the relationship. [more inside]
Given a set of columnar data, some of which are categorical and others that are numerical, how can I identify which category columns are responsible for signficant changes in the one or more of the numerical columns? [more inside]
Statistics make my head go wobbly. But I need to understand something about odds ratios. Actually, I don't need to understand -- I just need to make sure I'm not screwing up this particular figure. Please help! [more inside]
Looking for a web based survey tool that can give feedback to survey takers at the end [more inside]
My workplace has 3 small bathrooms. It seems to me that if I go to use one of them, and it's occupied, there is a statically higher chance the 2nd and even 3rd bathroom will be occupied (vs the chance that they would be occupied if the 1st one was not). Is it true? [more inside]
I'm trying to find the primary source of a much-repeated, but somewhat dubious, statistic. [more inside]
Could someone help me to analyze statistical relationship between two dimensions of a frequency matrix - preferably in Excel? [more inside]
Stats filter. I am doing multivariate regression for the first time and I want to understand what I'm doing, having gone beyond my formal training. I have many possible ways I could formulate the regression (different variables to include) and I want to find a model that is both the best possible fit while using the fewest number of variables. How? [more inside]
SPSS-filter: changing systems-missing values to user-missing values to match survey skip pattern. [more inside]
What are the most mathematically 'advanced' RPG systems? Pen & paper and otherwise? [more inside]
[StatisticsFilter] I’m trying to understand the extent that the statics of a particular experiment can be evaluated and trying to relate that to a 95% confidence interval. The experiment is considered “a standard”, but I want to keep the discussion as “high level” as I can so that we focus on only the relevant details. Any help or pointers in the right direction are greatly appreciated. [more inside]
Where can I find information about the rates of automobile traffic (per minutes/day/month) per address or intersection?
I have a research project in mind. It involves the use of statistics. I am a typical humanities grad with no real background in the quantitative social sciences, and need some guidance. What I'm interested in is: tracking, tallying, and correlating some of the everyday events recorded in personal diaries from a particular historical period. [more inside]
Where can I find the income distribution for India? My google-fu is weak on this one. [more inside]
I need help finding statistics on the effectiveness of various teaching methods. [more inside]
"Fewer persons alive at 70 today survive until 90 than forty years ago." True or False? [more inside]