Can anyone point me to actual data on how often motorists receive traffic tickets? I'm looking for where the top of the bell curve is, so to speak - obviously, individuals vary widely, which is kind of the point of my question - how often does the "median" driver receive traffic tickets? [more inside]
Can you recommend free (or cheap) graphing software for Windows? [more inside]
Is it true that it is always better to estimate parameters jointly, even when they are completely unrelated? [more inside]
I am planning on teaching a course on data visualization to some Ph.D. students in the social sciences, and could use some ideas about designing the syllabus. If you have taken or taught such a course, I would especially value your input, but I would also love to hear from any among you who have experience doing data visualization. [more inside]
Statistics filter: If I asses the same paired variables for the same population at multiple points in time, can I integrate the relation into an overall correlation? [more inside]
Would you be kind to someone who is culturally proficient, but statistically deficient? Specifically, what is going on in the tables at the end of this paper? [more inside]
So I want to calculate rate of events as a function of time... [more inside]
I've been focusing on getting a tenure-track academic position for a long time, but I feel like it's foolish not to be looking elsewhere for work, too. What jobs might I be a (reasonably) good applicant for? (relevant personal details below the fold) [more inside]
I'm in charge of inviting people to Facebook events for a small nonprofit. We've started tracking the number of people that say they are going and maybe going on Facebook vs. the number of people who actually show up to the event. It looks like of the people that say they are going or maybe going to an event about 50% actually show up. Is this a normal show up rate? Has anyone done statistical study on this kind of data? [more inside]
I want to create a "crime map" of a city over time. I know the dates of the crimes, and I know the physical address. I'd like for the crime map to be something that I could show a crowd in a presentation such that the audience members could witness for themselves how a particular part of the city changed as time went on. I was thinking of Gapminder as the ideal, but spatially and dynamic instead of simply numbers. I suppose it'd be possible to get something up and going with GIS, but I was wanting to do it in a way that was idiot-proof as this is only valuable to me if I can do it easily without a ton of time learning some new program.
When calculating a statistical average, is it reasonable to remove outliers? [more inside]
How can I locate sources for statistics on the outcomes of Canadian criminal court cases, that include such data as type of defence counsel (ie Legal Aid, from large firm, high profile, new in field, or preferably cost of counsel), amount of time before trial, income/profession of defendant, sentence given, and sentence actually carried out? Does such data even exist? I'm only asking out of curiosity, not for any professional or education project.
How do I go about gathering statistics regarding vegan/vegetarians? [more inside]
Sports nerds of MeFi: where do you go for your downloadable sports statistics database needs? [more inside]
I am looking for a little translation help on an inflation statistics paper in German language (look in more inside for the link). This official paper is quite interesting for students of statistics and social students interested in perceived cost of living vs actual cost of living, in Europe as well as in any other country, as it allegedly shows that indeed there were some excessive rouding of prices during the conversion to euro in Germany. [more inside]
Statistics help - What test should I use? Help needed for a novice. [more inside]
Can you point me to recent surveys of reading habits in the United States? I'm hoping to find statistics from the last 3-4 years about anything from the average number of books read, time spent reading, digital reading, to whatever else you know about. [more inside]
Are there more arriving or departing flights in the United States? Bonus: previous years, other countries, domestic vs. international, etc.
Who are the writers who take the statistics of sports and look at them in new interesting ways and can write a bit, too? [more inside]
What does "posthoc" mean? [more inside]
Excel statistics question (linear regression with weighted data points) [more inside]
What tools are available for determining patterns in short, semi-random sequences of variables? [more inside]
What would be a good statistical approach to find manipulated test results? [more inside]
Anyone know of good online or in-person (in the Philadelphia) training options for R statistical programming? [more inside]
The presence of A is a good sign B exists, but as people learn about the A trick they take pains to display it so it isn't as reliable a sign of B anymore. What's the term for this, is its rate of adoption reliably based on other variables, and who writes about it? [more inside]
How do I calculate the confidence interval for the mean response of some general nonlinear fit? [more inside]
(StatisticsFilter) I'd like to compare two binomial variables from the same population... [more inside]
What is the breakdown of millionaires by profession? (Statistics with sources). [more inside]
How much money do you have to make to be in "the top 1 percent", in world terms? [more inside]
What sort of survey are my colleagues planning? It would involve questionnaires dropped off at a variety of businesses (tour agencies and hotels, mostly), aimed at capturing details about their clients. This doesn't seem to fit the examples I see discussed in online resources on survey methods. To top it off, both the time frame and budget are limited. Where can I find best practices for this sort of survey--and how can I determine which best practices should be given highest priority? [more inside]
[StatsFilter] I have a data set which consists of 1000 groups. I'd like to perform the same commands on each group using the statistical software R. How can I write a loop function to perform this task? Or would any of the functions from the *apply family do a better job? [more inside]
Please clear up the use of statistics by two people trying to compare crime rates in one huge city (NYC) to Charleston, SC. a much smaller city. [more inside]
Where is religiosity growing and where is it shrinking? [more inside]
What are your beginner/intermediate SAS handbook recommendations? [more inside]
How do I interpret the crime statistics on a given neighborhood in my city? And is it then appropriate to make a judgment call on whether or not it's safe to put my child in a daycare there using that information? [more inside]
What professional sport has the most criminals? [more inside]
I'm currently performing a mock research comparative analysis for a class. I'm researching the financials of 'our' metrics versus our highest competitors. One of the questions I'm attempting to answer asks if the variances (or standard deviations) of 'our' profit margins are statistically similar to the variances in profit margins of our competitors over a five year period. I have 11 companies representing the profit margins of our competitors, and of course the reports on our profit margins for these five years. Now it's been a while since my last stats course... so I'm wondering, would I use an F or Levene's Test to answer this? Or am I completely off base... and should look to another form of analysis.
I'm trying to find the number of food products that are sold in the U.S. that contain high fructose corn syrup. I've done some preliminary searches and have found some lists, but I've come up dry when trying to find actual numbers. I'd like a reliable source that I can cite, but I'll take almost anything at this point. I am at a university and have access to a large number of subscription database resources, so I'll keep looking. But I thought metafilter might be able to help out here.
How can I test if one predictor variable in a regression is significantly better than another predictor variable? i.e. If I regress X against A I get an r^2 of .99 and when I regress X against B I get .98. I need a test to see whether this difference is statistically significant. [more inside]
Librarians: Do you know of any current publications that rank borrowing performance - specifically ILL turnaround time - for academic libraries? [more inside]
What's that quote? Something about each statistic in a news article reducing readership by x%. I can't for the life of me remember the exact words and who said it.
Probability Question [more inside]
How can I find the average of multiple columns in excel without selecting each individual row that I want averaged?
How can I find the average of multiple columns in excel without selecting each individual row that I want averaged? [more inside]
Where can I find statistical information on the effects of piracy on American shipping in the nineteenth century?
Where can I find statistical information on the effects of piracy on American shipping in the nineteenth century? [more inside]
How do TV channels figure out how many people watched a particular episode (e.g. XX million watched American Idol)?
How do TV channels figure out how many people watched a particular episode (e.g. XX million watched American Idol)? I often hear things like XX million viewers watched American Idol or some other show. How do the news guys / broadcasters come up with these numbers?? Surely they are not able to track what is being watched in peoples homes. So I guess this is some kind of math or projection. Any idea how this is done?
How do I figure out what the size of this password's keyspace is? [more inside]
I think I need to learn about time series analysis. My statistics background is so mediocre that I'm not even sure this is the thing I actually need to learn. How far in over my head am I? [more inside]
What are some basic math concepts I can use to understand and develop a ranking system? [more inside]
If I have a dependent categorical variable with two values (say, basketball fans vs non-fans) and an independent numerical variable (say, height in millimeters) what statistical tests could I use to analyze my data, besides a t-test? [more inside]
Statistics Filter: help me set up my coding data to calculate Cohen's Kappa as a measure of interrater reliability. Taking data from CAT to SPSS. [more inside]