## Too Many Strata To Sample

I have multiple independent variables that I want to stratify my sampling across, but the result would be way more strata than I can sample. Is there a way to stratify across different character states for each independent variable without creating strata for the interactions of each? Example inside.
posted by agentofselection on Sep 21, 2016 - 6 answers

## I was told there would be no math.

I've crossed a bunch of plants with one another, and now have a bunch of berries on those plants. Each berry can contain way more seeds than I have room to pot up and grow out individually, so I'm trying to find the point of diminishing returns, where potting up additional seedlings stops giving me new and interesting results, but my math background is inadequate.

## Math problem alert: NCAA Magic or Unsurprising?

We had a NCAA March Madness contest at work, where about 40 of us all filled out brackets on ESPN's site. One of us - the winner of our group - did very well and ended up ranked 6000th out of all 13 million ESPN brackets - top .05%. My question is, was this really statistically unlikely or, given that there were 40 of us, is it unsurprising one of us did that well (e.g., the birthday problem)? What mathematical tools would one use evaluate this question? Thanks!
posted by slide on Apr 6, 2016 - 9 answers

## Statistical framing of the engineering and extremism article

Reading this article on the blue got me thinking about conditional probabilities, prediction and causality. I came up with an analytical framing of what I think the article is saying and would be grateful if stats/social science Mefites could tell me if it seems accurate or else set me right.
posted by mister_kaupungister on Mar 25, 2016 - 3 answers

## Can an exact measurement ever be statistically insignificant?

I am dubious of a colleagueâ€™s assertions about statistics.
posted by lakeroon on Oct 14, 2015 - 23 answers

## Probability distributions and their practical applications

I am looking for a resource that lists probability distributions and their common real-world applications. For example, I'd expect to see: Lognormal - daily returns in the stock market. Poisson - failure rates for mechanical equipment, ...
posted by wivy on Feb 19, 2015 - 6 answers

## Math Problem

posted by BeaverTerror on Dec 4, 2014 - 9 answers

## Help me understand the concept of "85th Percentile."

I am part of a group working on a policy document for the mitigation of traffic issues (e.g. speeding). Stakeholders are having a hard time with a particular criteria which reads as follows: "85th percentile speed is in excess of the signed speed limit by 5 mph or more."
posted by M.C. Lo-Carb! on Nov 26, 2014 - 22 answers

## Math, sweet sweet math

I have a set of data: D(t). 5000 samples. Scatter-graphing makes some patterns clear (D-mean increases with t, for instance). D and t are always positive. I want to characterize these, statistically.
posted by IAmBroom on Oct 16, 2014 - 11 answers

## Statistics Book for an interested teen

My teen aged niece has suddenly found a strong interest in statistics. What book would you recommend for a 14 year old who has good, but not advanced, math skills?
posted by honestcoyote on Aug 15, 2014 - 18 answers

## Help! Can I normalize statistical coefficients?

With deadline looming, stats consultant has bailed. Simple queries need resolution. Help? I am working on a data graphic that involves statistical calculations about survival rates for startup businesses, correlated with certain tangible and intangible factors. The raw data (about survival/closure/merger outcomes) has already been investigated, and the original researchers (who are awesome) have generated some interesting correlations using univariate regressions and Cox regressions. For my output I am relying on their statistically significant findings, wanting to create comparisons among the univariate coefficients. Not sure my methods are kosher and would appreciate consultation. Avalanche inside.
posted by GrammarMoses on Aug 13, 2014 - 2 answers

## What are inter-disciplinary and atypical uses for math MSc?

I have just completed my MSc in mathematics in Europe. I do enjoy math, but I spent my uni years feeling like a autodidact hippie marooned on an island full of Mr and Mrs I-Want-A-Good-Job. My main interests revolve around humanities (literature/history/anthropology) and economics (but not finance), and instead of starting a "stable" well-paying career I dream about something inter-disciplinary. I am very open to earning little money and relocating just to do kind of work that engages those skills. What are some random uses of my degree?
posted by desultory_banyan on Jul 28, 2014 - 12 answers

## 2 problems of combinations and permutations.

How many unique ways are there to put X rocks into Y boxes? (Given two different sets of attributes for both the rocks and the boxes.)
posted by CustooFintel on Jun 4, 2014 - 18 answers

## Chance of event with small sample size, based on larger related sample?

Can/how can one improve the estimate for a chance of an event with a small historical sample size by utilizing the chance of a related event with a large historical sample size? Example and half-assed guess inside.
posted by Flunkie on Mar 20, 2014 - 16 answers

## Becoming Fluent in Predictive Analytics

I work in a University managing the broad based direct mail, email and calling programs. I have zero undergrad or graduate experience with math, business or the social sciences. (Aka, I can write a really nice essay...) I would like to chart a path to being recognized as an expert in predictive analytics.
posted by meta x zen on Nov 26, 2013 - 3 answers

## Career advice for math major (also competent programmer)?

I like math. Programming is OK, but I don't want to make it my thing. What careers should I be looking at? (Special snowflake details inside.)
posted by sqrtofpi on Nov 25, 2013 - 15 answers

## Find standard deviation in Excel, but first develop a labor-saving trick

Help me with statistics and Excel. Especially help me if you know any labor saving methods. I want the median, mean and standard deviation for the average price of all items sold, but my spreadsheet-full-of-data doesn't tell me the price of each sale -- just the average price per store, and the number sold at that store. Something like this:
posted by croutonsupafreak on Jun 21, 2013 - 10 answers

## Returning to grad school after 2 year hiatus

So I'm beginning a statistics PhD program this fall and I'm concerned that my math skills have gotten rusty since I haven't done anything related for the past two years. I've been working as an actuary since I graduated college but I don't do that much math--mostly a lot of programming. Has anybody been in a similar situation to me? How was the adjustment for you? I'm considering retaking advanced calculus and linear algebra during my first year (probably next summer before I take 2nd year advanced courses) just to refresh myself again. I'm aware some people may think this is kind of pathetic but I'd rather be safe than sorry. Besides, it's only my first year. Is this frowned upon?
posted by molamola on Jun 17, 2013 - 15 answers

## Learning Statistics for the Social Sciences 101 paperlessly

I'm running into trouble with my statistics course. I'm just getting up to t statistics for independent measures research design. My problems are: 1. I'm going through a lot of paper 2. I need to keep all my calculations better organized as a do them 3. I'm flipping back and forth between my book, an online version of the book, and another screen so that I can reference as much material as possible at once. I'm thinking some kind of basic statistics calculator spreadsheet (or any other format) would be in order. Can anybody direct me towards one?
posted by Che boludo! on Jun 8, 2013 - 9 answers

## Fallacies and Errors in Inferential Statistics

I have recently been introduced to the concept of pseudoreplication as a mistake that people often make when using inferential statistics to evaluate treatment outcomes. My field (evolutionary and conservation biology) makes heavy use of inferential statistics, including techniques that are vulnerable to pseudoreplication, yet nowhere in my formal education have I been taught about how poor experimental design and lack of statistical rigor can lead to fallacies like this. My personal statistical proficiency is poor, but I am working to remedy that. To that end, could folks help me by identifying and ideally explaining whatever other potential pitfalls you can think of, and explaining how they can be avoided through careful experimental design and data-analysis?
posted by Scientist on Jan 26, 2013 - 5 answers

## What does a professional statistician do?

(Good) jobs involving probability and statistics other than math teacher or actuary?
posted by CustooFintel on Dec 3, 2012 - 15 answers

## How to solve a complex statistics problem with a script?

In this game, you roll a number of six-sided dice to get a total. The total is either the highest single die result, or the sum of any multiples rolled, whichever is higher. For example: If I roll three dice and get a 3, 4, and 6, my total is 6. But if I roll a 4, 4, and 6, my total is 8, the sum of the two 4s. What I want to find out is the mean, median, mode, and standard deviation of the possible totals given N dice. How might I create a simple script to compute this?
posted by j0hnpaul on Nov 30, 2012 - 24 answers

## What should my sample size be?

Say that I have a bag which contains 100 balls and every ball in the bag should be red, but it's possible that one or more of these balls is the wrong colour. How many balls should I look at to be 90% sure that all the balls are red? Or 95%? Or 99.9%? Talk me through how to work this out, please?
posted by xchmp on Nov 12, 2012 - 14 answers

## Practice Probability Word Problems

What great books or resources are there for practicing probability word problems such as for standardized tests like the GRE?
posted by Mr. Papagiorgio on Nov 8, 2012 - 3 answers

## What are the odds that two randomly selected people share the same bank PIN?

Statisticsfilter: Given available information about the distribution of self-selected 4-digit passwords (specifically banking PINs), is it possible to calculate the probability of two randomly selected individuals having the same PIN? If so, what're the odds?
posted by myrrh on Oct 27, 2012 - 15 answers

## How do I calculate the probability of a specific sum of repeated die rolls?

I'm looking to learn how to calculate probabilities for a multi-round dice game. I've researched this question some, and it looks like I might need to know how to use the multinomial distribution, but I can't find any good introductions. Please point me to the most layman-accessible educational material on this subject, and help me to help myself.
posted by Richard Daly on Sep 28, 2012 - 6 answers

## Too much noise in my data

Putting on the math signal: calling the statistics-literate. Trying to change my weight/body composition and track progress in a useful way, but I'm having trouble separating normal daily variation from actual real change.
posted by ctmf on Sep 1, 2012 - 15 answers

## Pocket Anthropology

Is there a way to find out approximately when I left this wad of money in my (very old) jacket pocket, using statistics and date/number info from the bills? What would be the minimum number of bills needed for such a thing?
posted by nile_red on Jun 17, 2012 - 10 answers

## How to model the accident rate on a certain stretch of road

How could I approach modeling the number of accidents on a certain stretch of road? I've got some semi-ecological approaches in mind but no idea if they're workable. Are there standard approaches to this sort of problem?
posted by col_pogo on May 7, 2012 - 9 answers

## Identifying urban centers using an algorithm based on topological prominence

Calling geographers: has anyone applied the idea of topological prominence to population density?
posted by miyabo on Apr 27, 2012 - 7 answers

## Bathrooms and Statistics

My workplace has 3 small bathrooms. It seems to me that if I go to use one of them, and it's occupied, there is a statically higher chance the 2nd and even 3rd bathroom will be occupied (vs the chance that they would be occupied if the 1st one was not). Is it true?
posted by lohmannn on Apr 13, 2012 - 30 answers

## When calculating a statistical average, is it reasonable to remove outliers?

When calculating a statistical average, is it reasonable to remove outliers?
posted by JPigford on Jan 13, 2012 - 31 answers

## Probability Question

Probability Question
posted by nickhb on Sep 3, 2011 - 15 answers

## I'm a rank amateur

What are some basic math concepts I can use to understand and develop a ranking system?
posted by rhizome on Jul 26, 2011 - 7 answers

## Need help with a probability question

Can someone help with a probability question?
posted by caroljean63 on Jun 25, 2011 - 22 answers

## Shouldn't a point estimate be the midpoint of its confidence interval?

Shouldn't a point estimate be in the center of its confidence interval?
posted by lakeroon on Jun 22, 2011 - 12 answers

## Calculus? Is that it?

Why can't I read this? And what sort of classes would I need to make sense of this?
posted by Tennyson D'San on Jun 21, 2011 - 12 answers

## What statistical analysis of historical prices would be useful for a buyer?

What statistical analysis of historical prices would be useful for a buyer?
posted by Theta States on May 12, 2011 - 1 answer

## Math skills for 300-level statistics?

I'm about to start a 300-level Statistics course, but my math skills are very rusty. How can I quickly rejuvenate my old skills?
posted by double block and bleed on Jan 8, 2011 - 5 answers

## When do I look for non-academic jobs?

When do I look for non-academic jobs if I'm trying to transition out of academia, and which jobs do I look for? (I have a math PhD.)

## How do I fairly assign partial credit?

How do I assign partial credit consistently?

## Textbooks on data mining techniques / statistical analysis on large data sets?

Textbooks on data mining techniques / statistical analysis on large data sets?
posted by wooh on Oct 22, 2010 - 5 answers

## Clustering Discrete Data

Help me figure out how to do cluster analysis on discrete data.
posted by fremen on Sep 8, 2010 - 10 answers

## A variation on the birthday paradox

I have 232 facebook friends, and six of them share a birthday in common (not 6 different pairs of people on the same birthday, but 6 out of the 232 were born on the 17th of July. Now, I know it only takes 23 random people to get a 50% chance of at least ONE birthday collision, but how the heck do I figure out the odds on this one? Is this a significant anomaly, or reasonably expected? My one stats class was entirely too long ago....
posted by um_maverick on Jul 14, 2010 - 25 answers

## How do I go about finding a plateau within a set of numbers?

How do I go about finding a plateau within a set of numbers?
posted by SamuelF on Apr 23, 2010 - 21 answers

I have a wonderfully large dataset that I'm working with for a long-term project. I am analyzing a small section the dataset for my masters thesis. In meeting with my thesis advisor last week, she suggested I run some statistical tests of significance on the 4 tables I'm working with. She knows that I am yet to be versed in quantitative analysis methods (I've done solely qualitative work thus far) and that I'm under a massive time crunch to get this done. She suggested I seek help from others, as she doesn't want me to get bogged down with figuring out this step, and would rather I concentrate on analyzing the other aspects of this data. To this end, I'm wondering if somebody might be able to suggest the best type of test of significance to run, the easiest way to run it, and a good, simple resource for what the resultant values mean?
posted by iamkimiam on Apr 17, 2010 - 19 answers

## What's higher level statistics like?

Math-related career advice requested: is work in higher level statistics compatible with an imaginative-type thinker or would it be total hell?
posted by pinside on Apr 6, 2010 - 8 answers

## What basic statistics concepts do I need to understand web analytics?

I'm registered for a Web Analytics class. A prerequisite is "Familiarity with basic concepts of statistics." I'm not a math person and never studied statistics. Which basic statistics concepts should learn before this class? Are there any comic books, videos, or fun resources that would help me learn?
posted by valannc on Feb 3, 2010 - 11 answers

## Help me sample accurately!

Please provide me with an *effective*, and statistically sound, table/chart listing suggested sample size guidance for associated population size
posted by thankyoumuchly on Jan 14, 2010 - 11 answers

## Math/Stats: help me analyze a data set and determine the values that created it

Mathematics / Statistics Filter: I have some pairs of numbers that are the result of a process. Given just that data set, and a rule that relates them, can you determine the integer values that could have resulted in those sets?
posted by intermod on Dec 23, 2009 - 20 answers

Page: 1 2