<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
     xmlns:admin="http://webns.net/mvcb/"
     xmlns:content="http://purl.org/rss/1.0/modules/content/"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
	<channel>
	  <title>Ask MetaFilter questions tagged with Stats</title>
      <link>http://ask.metafilter.com/tags/Stats</link>
      <description>Questions tagged with 'Stats' at Ask MetaFilter.</description>
	  <pubDate>Thu, 12 Nov 2009 19:19:02 -0800</pubDate> <lastBuildDate>Thu, 12 Nov 2009 19:19:02 -0800</lastBuildDate>

      <language>en-us</language>
	  <docs>http://blogs.law.harvard.edu/tech/rss</docs>
	  <ttl>60</ttl>	  
	<item>
	<title>Linear regression is... uh... something to do with residuals? (Shit.)</title>
	<link>http://ask.metafilter.com/137993/Linear%2Dregression%2Dis%2Duh%2Dsomething%2Dto%2Ddo%2Dwith%2Dresiduals%2DShit</link>	
	<description>I have some interviews coming up. Anybody with a statistics-related profession.... please help me out with answering technical questions in my job interviews! (A bit on the long side...) I am in my school&apos;s Co-op program and I am currently pursuing a bachelor degree in Statistics. The number of entry level jobs for statisticians are really next to none, and I really want to do well on this next interview. &lt;br&gt;
&lt;br&gt;
I have already had three interviews for statistics related jobs with different companies and I pretty much flunked each one. I can&apos;t seem to wrap my head around the technical questions that are asked during interviews. Most of the time, my mind draws a blank and useless garble comes out. All the generic questions about me, my interests, etc. are easy, but when it comes to solving technical questions on the fly, I lose all confidence and the interview goes downhill. Sometimes I just don&apos;t know the answer, sometimes I just can&apos;t process the question, and other times I just say &quot;I don&apos;t know&quot; and doom myself joblessness.&lt;br&gt;
&lt;br&gt;
So now I have a phone interview with company X tomorrow morning. X is a rather large company (one that I&apos;ve interviewed for before, actually, for a different position by a different person) and some students (myself included) were recommended for the position by my school&apos;s Co-op office. The position I am being interviewed for is &lt;em&gt;Statistical Methods Analyst&lt;/em&gt;, and doing a quick google search gives me a &lt;a href=&quot;http://www.workopolis.com/FR/job/10024555&quot;&gt;similar job description&lt;/a&gt; to the one I received. &lt;br&gt;
&lt;br&gt;
In more detail, my job description says that I will need to be responsible for collaborating with stakeholders to gather and analyze requirements, produce process analysis reports, adhoc reports and data queries, validate and analyze data, develop tools and communicate analysis results.&lt;br&gt;
&lt;br&gt;
Under essential skills, it says I should be adept at MS Excel, especially data analysis and process analysis, experienced in VBA Macro or similar programming languages, have knowledge of Statistical tools (such as Minitab, Stata, SAS) and the ability to correctly interpret statistical test results, valuing the nature of the data. Also, familiarity with basic statistical concepts and ability to relate them to industry settings, able to detect and diagnose process problems. Knowing SQL was noted as an asset.&lt;br&gt;
&lt;br&gt;
I feel under-qualified for this job and doomed for failure at yet another job interview, but my co-op coordinator assures me that I will be able to pick up the skills once I am on the job. Soo... please help me land this job! Please make my years of studying statistics useful! &lt;br&gt;
&lt;br&gt;
My questions are (yes, I know. Finally, eh?) :&lt;br&gt;
&lt;br&gt;
What kind of technical questions should I be expecting?&lt;br&gt;
What should I study before going into the interview tomorrow?&lt;br&gt;
If I don&apos;t know the answer to a technical question, what should I do?&lt;br&gt;
Any job interview tips?&lt;br&gt;
&lt;br&gt;
Many thanks!</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2009:site.137993</guid>
	<pubDate>Thu, 12 Nov 2009 19:19:02 -0800</pubDate>
	<category>interview</category>
	<category>jobinterview</category>
	<category>stats</category>
	<dc:creator>veol</dc:creator>
	</item>
	<item>
	<title>How many pugs and pug owners are there both in the U.S. and worldwide?</title>
	<link>http://ask.metafilter.com/135728/How%2Dmany%2Dpugs%2Dand%2Dpug%2Downers%2Dare%2Dthere%2Dboth%2Din%2Dthe%2DUS%2Dand%2Dworldwide</link>	
	<description>How many pugs and pug owners are there both in the U.S. and worldwide? I&apos;m trying to get a rough estimation on both the number of pugs and pug owners in the U.S. and worldwide.&lt;br&gt;
&lt;br&gt;
Any ideas on sources for that sort of info?</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2009:site.135728</guid>
	<pubDate>Sat, 17 Oct 2009 12:24:49 -0800</pubDate>
	<category>dogs</category>
	<category>pets</category>
	<category>pugs</category>
	<category>stats</category>
	<dc:creator>JPigford</dc:creator>
	</item>
	<item>
	<title>Where to find good state-by-state savings statistics?</title>
	<link>http://ask.metafilter.com/133685/Where%2Dto%2Dfind%2Dgood%2Dstatebystate%2Dsavings%2Dstatistics</link>	
	<description>Where can I find solid nationwide savings statistics/data? I am trying to make a map illustrating income &amp;amp; savings in the united states. However, the only data I am finding related to savings is based on measuring the entire United States as one unit. Does anyone know if I could find a state-by-state breakdown? &lt;br&gt;
&lt;br&gt;
I have been looking through the &quot;Saving and Investment&quot; section of the BEA website (&lt;a href=&quot;http://www.bea.gov/national/nipaweb/SelectTable.asp?Selected=N&quot;&gt;BEA NIPA Tables&lt;/a&gt;) to no avail.&lt;br&gt;
&lt;br&gt;
If this information is not available, any advice on how to show a geographic breakdown of income/savings would be appreciated.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2009:site.133685</guid>
	<pubDate>Thu, 24 Sep 2009 07:06:46 -0800</pubDate>
	<category>government</category>
	<category>investment</category>
	<category>savings</category>
	<category>stats</category>
	<dc:creator>ejfox</dc:creator>
	</item>
	<item>
	<title>Where is a website where I can download an excel-readable file showing the monthly impressions of popular websites?</title>
	<link>http://ask.metafilter.com/132384/Where%2Dis%2Da%2Dwebsite%2Dwhere%2DI%2Dcan%2Ddownload%2Dan%2Dexcelreadable%2Dfile%2Dshowing%2Dthe%2Dmonthly%2Dimpressions%2Dof%2Dpopular%2Dwebsites</link>	
	<description>I want to compare the monthly hits of popular websites, where could I download a spreadsheet or similar data for making a graph out of? I am looking to make a graph showing the amount of impressions various sites have been getting. If I could download this information as a .csv or something similar, so I could then import it into illustrator to graph it, that would be very helpful.&lt;br&gt;
&lt;br&gt;
Hopefully from a respectable source, please.&lt;br&gt;
&lt;br&gt;
Any resources or tricks are welcome!</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2009:site.132384</guid>
	<pubDate>Wed, 09 Sep 2009 11:46:07 -0800</pubDate>
	<category>analytics</category>
	<category>stats</category>
	<dc:creator>ejfox</dc:creator>
	</item>
	<item>
	<title>How can I get college football box scores as a feed or in a flat file?</title>
	<link>http://ask.metafilter.com/131994/How%2Dcan%2DI%2Dget%2Dcollege%2Dfootball%2Dbox%2Dscores%2Das%2Da%2Dfeed%2Dor%2Din%2Da%2Dflat%2Dfile</link>	
	<description>Is there an easy (or at least relatively straightforward) way to get all of the college football box scores for a weekend (or more) as a feed or in a flat file (txt, csv, xls, etc.)? I&apos;m running an offline fantasy football league, and I would like to automate the scoring if possible. (Solutions that require scripting or macros would be fine.)</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2009:site.131994</guid>
	<pubDate>Fri, 04 Sep 2009 10:14:49 -0800</pubDate>
	<category>boxscores</category>
	<category>collegefootball</category>
	<category>fantasyfootball</category>
	<category>stats</category>
	<dc:creator>gpen</dc:creator>
	</item>
	<item>
	<title>Parvovirus: odds of a puppy getting it?</title>
	<link>http://ask.metafilter.com/131750/Parvovirus%2Dodds%2Dof%2Da%2Dpuppy%2Dgetting%2Dit</link>	
	<description>I understand the danger parvovirus poses to puppies, but what are the &lt;i&gt;odds&lt;/i&gt; of a puppy contracting the disease in the US (specifically Alameda County, California)? I have been reading about parvovirus in dogs (including &lt;a href=&quot;http://ask.metafilter.com/91038/Roger-baby-its-a-wild-world&quot;&gt;this discussion&lt;/a&gt;), and understand how serious the illness is.&lt;br&gt;
&lt;br&gt;
What I can&apos;t seem to find is any indication of risk or prevalance. What are the odds a dog will get parvo, and how many cases of it are there a year in my area?&lt;br&gt;
&lt;br&gt;
The more mathematical and bounded the answer, the better. I know I can&apos;t be assured to the fifth decimal place about anything, but I want to know: Parvo, this terrible disease, are the odds 1%, 10%, or 100%?&lt;br&gt;
&lt;br&gt;
More details below, in the hope that they may allow more exact bounding of the answer.&lt;br&gt;
&lt;br&gt;
My dog is five weeks old. He was one of the larger dogs in the litter (with two or three brothers and a sister), which I understand tends to confer longer maternal immunity. I intend to start him on a full vaccine series for parvo.&lt;br&gt;
&lt;br&gt;
He&apos;s 3/4 Australian Cattle Dog, 1/4 Fox Terrier. He was born in a remote rural area of Humboldt County, California, and as of a few days ago now lives in a semi-urban area in Alameda County.&lt;br&gt;
&lt;br&gt;
I keep him mostly indoors, with trips to the back and front yard for exercise. I understand that completely preventing exposure to parvo is impossible (as the virus hardy and survives for long periods in the soil), but also that minimizing exposure to parvo greatly reduces the chances for infection.&lt;br&gt;
&lt;br&gt;
I would like to know: &lt;br&gt;
&lt;br&gt;
How common is parvo in Humbolt County and in Alameda County? Or, if these specific numbers aren&apos;t available, then whatever numbers are available for California or the US. A link to numbers of cases per year would be ideal.&lt;br&gt;
&lt;br&gt;
What are the odds of a puppy getting parvo between the ages of 5 and 16 weeks if he&apos;s allowed to socialize with a: known dogs (with shots), or b: occasionally visit parks and meet other non-wild dogs.&lt;br&gt;
&lt;br&gt;
Links to scholarly papers are fine, and links to the dog equivalent to the CDC would also appreciated.&lt;br&gt;
&lt;br&gt;
If this is too specific, or if there isn&apos;t enough information, please let me know. Also, I do know how bad the illness itself is.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2009:site.131750</guid>
	<pubDate>Tue, 01 Sep 2009 23:34:45 -0800</pubDate>
	<category>alamedacounty</category>
	<category>australiancattledog</category>
	<category>berkeley</category>
	<category>berkeleyca</category>
	<category>blueheeler</category>
	<category>california</category>
	<category>canine</category>
	<category>cattledog</category>
	<category>disease</category>
	<category>dog</category>
	<category>dogs</category>
	<category>humboltcounty</category>
	<category>odds</category>
	<category>parvo</category>
	<category>parvovirus</category>
	<category>puppies</category>
	<category>puppy</category>
	<category>resolved</category>
	<category>rural</category>
	<category>semi-urban</category>
	<category>statistics</category>
	<category>stats</category>
	<category>urban</category>
	<category>usa</category>
	<dc:creator>zippy</dc:creator>
	</item>
	<item>
	<title>Good websites for tennis stats</title>
	<link>http://ask.metafilter.com/123783/Good%2Dwebsites%2Dfor%2Dtennis%2Dstats</link>	
	<description>What are good websites for tennis statistics? What&apos;s a good website for search-able tennis stats? For instance, I was looking for a complete list of Rafael Nadal&apos;s (pro-) career losses. Where should I go for something like that? I googled but couldn&apos;t find anything with search-able stats. &lt;br&gt;
&lt;br&gt;
(Asking for a friend)</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2009:site.123783</guid>
	<pubDate>Wed, 03 Jun 2009 11:42:34 -0800</pubDate>
	<category>statistics</category>
	<category>stats</category>
	<category>tennis</category>
	<dc:creator>peacheater</dc:creator>
	</item>
	<item>
	<title>Who&apos;s reading?</title>
	<link>http://ask.metafilter.com/122213/Whos%2Dreading</link>	
	<description>How can I juice up a &lt;em&gt; foo.wordpress.com&lt;/em&gt; site?  Specifically I want a better statcounter, but any other cool stuff you recommend would be nice, too. I have a blog at &lt;em&gt;nameofblog.wordpress.com&lt;/em&gt;.  I&apos;m planning to move it to &lt;em&gt;nameofblog.com&lt;/em&gt;, but that&apos;s not high on my priority list at the moment.  In the meantime, I&apos;d like to know more about my stats, but I&apos;m not sure which is the best free stat tracker for a Wordpress.com site.  Right now all I see on the default tracker are pageviews, referring links, and the posts that get the most clicks.   But those &quot;most clicks&quot; numbers only add up to about 20% of my daily pageviews, which... huh?&lt;br&gt;
&lt;br&gt;
Specifically, I&apos;d like to know:&lt;br&gt;
How many people are following on RSS &lt;br&gt;
Stats by city &amp;amp; country&lt;br&gt;
Average length of visit&lt;br&gt;
Any other stats you&apos;d think I might find interesting.&lt;br&gt;
&lt;br&gt;
All this stuff used to be easy to track on Blogger blogs with StatCounter, but I don&apos;t understand how to put a different stat tracker on Wordpress (I&apos;m using the &lt;a href=&quot;http://www.raven.za.net/wp-themes/contempt-wordpress-theme&quot;&gt;Contempt&lt;/a&gt; template, and I don&apos;t know how to edit CSS).  Anyone have a suggestion of an easy, free way for me to satisfy my stats curiosity until I have time to customize the whole site?  Ease of use is key, though- my knowledge of programming is scant, by which I mean I know maybe 10 HTML tags and that&apos;s about it.&lt;br&gt;
&lt;br&gt;
While we&apos;re at it, I guess I&apos;d also be interested in hearing about any Wordpress.com site goodies you know about that are cool &amp;amp; easy to use, but not widely known.&lt;br&gt;
Thanks!</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2009:site.122213</guid>
	<pubDate>Fri, 15 May 2009 13:06:40 -0800</pubDate>
	<category>blog</category>
	<category>resolved</category>
	<category>rss</category>
	<category>statcounter</category>
	<category>stats</category>
	<category>tracking</category>
	<category>wordpress</category>
	<dc:creator>pseudostrabismus</dc:creator>
	</item>
	<item>
	<title>Lazy Pseudo Statistician Looking for Computer To Do Work For Him</title>
	<link>http://ask.metafilter.com/117024/Lazy%2DPseudo%2DStatistician%2DLooking%2Dfor%2DComputer%2DTo%2DDo%2DWork%2DFor%2DHim</link>	
	<description>How can I take a Myers-Briggs test entered into SPSS in a per question format and get a Myers-Briggs Type for each of 92 participants? I&apos;ve tried Google and am getting nothing.  I downloaded a trial so I can work on it at home while I&apos;m on Spring Break, and the Help files didn&apos;t install or something to where the end result is that I can&apos;t use them.  I&apos;m also kicking myself for not getting my text back after letting someone borrow it for my school&apos;s stats class.  &lt;br&gt;
&lt;br&gt;
I have 92 surveys, part of which is a Myers-Briggs Personality Test.  I&apos;ve entered the data on a question by question basis, where each of the 20 Myers-Briggs questions has an option value of either  1 or 2 (representing E or I, S or N, T or F, or J or P depending on the question).&lt;br&gt;
&lt;br&gt;
I know I can get a frequency to get the personality type of the entire sample population.  But how can get SPSS to score he Myers-Briggs for me on a survey by survey basis?&lt;br&gt;
&lt;br&gt;
If it would be easier, I also have the surveys available and can go score them all by hand.  But I&apos;d rather not to that if I don&apos;t have to.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2009:site.117024</guid>
	<pubDate>Tue, 17 Mar 2009 20:05:04 -0800</pubDate>
	<category>personality</category>
	<category>resolved</category>
	<category>score</category>
	<category>SPSS</category>
	<category>stats</category>
	<dc:creator>theichibun</dc:creator>
	</item>
	<item>
	<title>Math for the fantasy sports enthusiast</title>
	<link>http://ask.metafilter.com/116973/Math%2Dfor%2Dthe%2Dfantasy%2Dsports%2Denthusiast</link>	
	<description>Looking for resources regarding math in sports. I&apos;m an avid fantasy baseball player, and bought the Baseball Prospectus this year. The use of statistical analysis in that book is amazing. &lt;br&gt;
&lt;br&gt;
To that end, I want to learn more about new statistics or metrics being used in sports and also how I can learn more about these methods (e.g. regression analysis), so I can better understand the rationale behind each formula. I am a math amateur in every regard.&lt;br&gt;
&lt;br&gt;
Any books, websites, or other resources that can help me understand how these metrics are created, and perhaps that can give me enough of an understanding to try and do some independent research would be much appreciated.&lt;br&gt;
&lt;br&gt;
Thanks!</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2009:site.116973</guid>
	<pubDate>Tue, 17 Mar 2009 09:34:53 -0800</pubDate>
	<category>baseball</category>
	<category>basketball</category>
	<category>fantasy</category>
	<category>fantasysports</category>
	<category>football</category>
	<category>math</category>
	<category>resolved</category>
	<category>sports</category>
	<category>statistics</category>
	<category>stats</category>
	<dc:creator>reenum</dc:creator>
	</item>
	<item>
	<title>Help me find a web site that I should have signed up for years ago.</title>
	<link>http://ask.metafilter.com/115619/Help%2Dme%2Dfind%2Da%2Dweb%2Dsite%2Dthat%2DI%2Dshould%2Dhave%2Dsigned%2Dup%2Dfor%2Dyears%2Dago</link>	
	<description>[Find-that-website] So. There&apos;s this website where you would enter everything you do everyday - what you had for breakfast, what color shirt you wore, what TV shows you watched, whatever. After you had compiled enough data, it would come up with dubious statistics relating things you have done against world news and other users. For example, I might find out that every time I have eggs for breakfast there&apos;s an earthquake in SF, or that some guy in California wears a purple shirt every time I forget to floss. I might have found this on the blue or in PC World, and it was probably sometime around 2006. &lt;small&gt;I&apos;ve been looking for this site forever. Help.&lt;/small&gt;</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2009:site.115619</guid>
	<pubDate>Mon, 02 Mar 2009 19:21:17 -0800</pubDate>
	<category>statistics</category>
	<category>stats</category>
	<category>website</category>
	<dc:creator>niles</dc:creator>
	</item>
	<item>
	<title>Per VHost Apache Bandwidth Stats</title>
	<link>http://ask.metafilter.com/101824/Per%2DVHost%2DApache%2DBandwidth%2DStats</link>	
	<description>Linux/Unix Admin Filter: What is your preferred method for monitoring bandwidth use per virtual host in Apache? I could just log everything and run AWStats/Webalizer/whatever, but that seems overkill when all I really want is a MRTG or RRD Tool graph for each virtual host to indicate bandwidth used the last day, week, month and year. Do you use the impossible to find mod_watch or something else?</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2008:site.101824</guid>
	<pubDate>Tue, 16 Sep 2008 06:56:14 -0800</pubDate>
	<category>apache</category>
	<category>bandwidth</category>
	<category>cacti</category>
	<category>graph</category>
	<category>graphing</category>
	<category>linux</category>
	<category>mrtd</category>
	<category>rrdtool</category>
	<category>statistics</category>
	<category>stats</category>
	<category>unix</category>
	<dc:creator>Brian Puccio</dc:creator>
	</item>
	<item>
	<title>Help with GIS</title>
	<link>http://ask.metafilter.com/97319/Help%2Dwith%2DGIS</link>	
	<description>Noob with GIS: Help me get more out of my data, craft reports, etc. I&apos;m really new to GIS in general and ArcGIS in particular.  I&apos;ve got my shapefiles together and plotted my various points without too much trouble.  I&apos;ve even overlaid census data.  (I&apos;m using ArcGIS 9.2 - almost exclusively ArcMap.)&lt;br&gt;
&lt;br&gt;
Now I&apos;m wondering what else I can get out of it.  Specifically:&lt;br&gt;
&lt;br&gt;
- I&apos;ve plotted several hundred points on a map of Chicago.  Each point is the location of a financial institution.  I&apos;d like to generate some sort of report either in ArcGIS or outside that says something like:&lt;br&gt;
&lt;br&gt;
&lt;i&gt;X percent of institutions are within X distance (feet, miles, whatever) of census tracts (or blocks) with median household income of X.&lt;/i&gt;&lt;br&gt;
&lt;br&gt;
I know methods for generating this type of statistical analysis exists, because I see it in papers all the time! ;-)&lt;br&gt;
&lt;br&gt;
I have no experience with statistics or GIS.  I&apos;m really in over my head here, but I think I can trudge along and produce something worthwhile.  I&apos;m not good with building data queries or scripts either, but I can get help with that.&lt;br&gt;
&lt;br&gt;
What I&apos;m looking for is advice, suggestions, or just general direction on where to go for answers.  Am I completely lost?  Is this beyond the scope of ArcGIS?&lt;br&gt;
&lt;br&gt;
Thanks in advance!</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2008:site.97319</guid>
	<pubDate>Wed, 23 Jul 2008 09:38:11 -0800</pubDate>
	<category>ArcGIS</category>
	<category>census</category>
	<category>statistics</category>
	<category>stats</category>
	<dc:creator>wfrgms</dc:creator>
	</item>
	<item>
	<title>Configured, Boiled Down Numbers Please</title>
	<link>http://ask.metafilter.com/91453/Configured%2DBoiled%2DDown%2DNumbers%2DPlease</link>	
	<description>A good, modern, local website traffic stats solution? &lt;i&gt;I used to know this stuff, but it&apos;s been many years and technology changes, yadda yadda, and this has some special odd requirements... so I prefer some hive mind opinions...&lt;/i&gt;&lt;br&gt;
&lt;br&gt;
I have a colleague looking for a good solution to traffic stats reporting. He is responsible for dozens of websites, with new ones being added/removed every couple of weeks -- promotions and projects from various departments. He has access to combined/extended Apache logs (regular + referrer) already, but since he controls the websites, he is also open to a javascript-triggered thingie on each page, and a custom server that logs things itself. Either is fine as long as it churns out the results.&lt;br&gt;
&lt;br&gt;
For results, he&apos;d need an assortment of the usual hits/views/referrers stuff (visitors from Chile, most popular pages this week, hits on this certain link), but boiled down nicely like so:&lt;br&gt;
&lt;br&gt;
1) takes raw logs and turns them into something human-readable.&lt;br&gt;
2) scheduled (weekly or monthly) reports automatically saved as fixed files (auditable, this not recreated each time it&apos;s run).&lt;br&gt;
3) reports showing just the info he wants* e-mailed to him as attachments (PDF, ideally, but text files would work) so he can present to management.&lt;br&gt;
4) the ability to request custom reports (the period from Sept 1 to Sept 14, users from China only) on demand, presumably from some kind of web interface -- an on-the-fly config run one-off.&lt;br&gt;
5) the ability to combine or separate data from diff sites/logs by configuration. (that is: these two sites are the same, but these four are all to be tracked separately)&lt;br&gt;
&lt;br&gt;
&lt;small&gt;* I don&apos;t have a list, but he says it will be one or two pages and NOT change over time. A template of certain reports?&lt;/small&gt;&lt;br&gt;
&lt;br&gt;
&lt;br&gt;
He is considering hiring a human being part-time to do this with some offline program as a secretarial task every week... but that struck me as crazy since it&apos;s automatable, right? Right? Hm.&lt;br&gt;
&lt;br&gt;
It can&apos;t be Google Analytics, sadly for me (I know how to use that) and for privacy/security reasons he&apos;d prefer something running on his own Un*x server rather than a web service.... whether it&apos;s the same server as the websites or another doesn&apos;t matter, as long as he can control access and keep the data private.&lt;br&gt;
&lt;br&gt;
In the old days, I would say this all called for urchin or awstats, I think... plus some magic I don&apos;t know about to turn the results into pretty reports and mail them from a big old cron job.&lt;br&gt;
&lt;br&gt;
From my perspective (helping find/source/make/install this), I&apos;d be fine with something very complicated that I can tweak the configuration of to produce just what he&apos;s asking for -- and no more. &lt;br&gt;
&lt;br&gt;
Or if something does all the processing, will run reports on a schedule and produce some kind of manageable (parsable?) output like plain flat tables, I can probably manage turning that into pretty documents and mailing them out with additional scripty code that I can stumble through.&lt;br&gt;
&lt;br&gt;
Open source and hackable preferred, because of the configurability and tweakiness required... or guidance to existing things that can be cobbled together. If someone has built/integrated one of these before, or something similar, please mefi-mail me and maybe it&apos;s easy work, too. I may have overcommitted myself in trying to help.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2008:site.91453</guid>
	<pubDate>Wed, 14 May 2008 21:36:20 -0800</pubDate>
	<category>automation</category>
	<category>dev</category>
	<category>pdf</category>
	<category>reporting</category>
	<category>software</category>
	<category>stats</category>
	<category>traffic</category>
	<category>web</category>
	<dc:creator>rokusan</dc:creator>
	</item>
	<item>
	<title>Looking for a smooth function from noisy observations</title>
	<link>http://ask.metafilter.com/91423/Looking%2Dfor%2Da%2Dsmooth%2Dfunction%2Dfrom%2Dnoisy%2Dobservations</link>	
	<description>Statsfilter: I have a bunch of noisy measurements. Each has an (x,y) coordinate and a &quot;score&quot; for that location. Most of the scores are trustworthy, but with the occasonal outlier. I want to come up with a function f(x,y) that estimates what the score at (x, y) would be (whether or not I have an observation at exactly that location). I&apos;d like the function to be smooth and resilient to noise. Can someone point me in the right direction?  I&apos;m thinking of something like a kernel density estimator that takes the score into account, but I don&apos;t know how to make that work since it estimates density rather than some other value.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2008:site.91423</guid>
	<pubDate>Wed, 14 May 2008 15:46:24 -0800</pubDate>
	<category>stats</category>
	<category>visualization</category>
	<dc:creator>Sockpuppet The First</dc:creator>
	</item>
	<item>
	<title>Liberal-minded statistics?</title>
	<link>http://ask.metafilter.com/90353/Liberalminded%2Dstatistics</link>	
	<description>My boyfriend needs to find some statistics for a research paper.  Specifically, pro-vegan, pro-environmentalist, anti-war, or other such statistics. Preferably all gathered in one place. We need hard facts - in the vein of those presented in &quot;Diet for a New America&quot; by John Robbins (for example, the statistics involving the water and grain used to feed livestock and how that detracts from feeding third-world starving nations) - but more current.  As far as anti-war, looking for things such as the cost of war per day compared with what America as a country puts towards education and/or environmental issues yearly, etc.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2008:site.90353</guid>
	<pubDate>Fri, 02 May 2008 04:13:00 -0800</pubDate>
	<category>anti-war</category>
	<category>environmentalism</category>
	<category>stats</category>
	<dc:creator>jitterbug perfume</dc:creator>
	</item>
	<item>
	<title>Help me not track my visitors</title>
	<link>http://ask.metafilter.com/86895/Help%2Dme%2Dnot%2Dtrack%2Dmy%2Dvisitors</link>	
	<description>I&apos;m looking for a website visitor tracking service that will allow me to not track users referred from certain domains. Or, one that will allow me to not show those users when I&apos;m viewing the stats. At the moment, I have a free Statcounter account, with the detailed info from the past 500 visitors. Someone has posted my site to StumbleUpon, which is great, but I&apos;ve had nearly 300 visitors from StumbleUpon so far today, which is eating up my log really quickly.&lt;br&gt;
&lt;br&gt;
I really like being able to see where people are coming from, but with results like I&apos;m getting at the moment, it&apos;s hard to see. &lt;strong&gt;Is there a service, like Statcounter, that will either allow me to not track users referred from a given website, or one that will allow me to filter a given website from showing up in the log when I view it?&lt;/strong&gt;&lt;br&gt;
&lt;br&gt;
I don&apos;t mind paying a small amount per month/year to get this functionality, but free is even better.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2008:site.86895</guid>
	<pubDate>Mon, 24 Mar 2008 05:18:19 -0800</pubDate>
	<category>log</category>
	<category>statcounter</category>
	<category>stats</category>
	<category>traffic</category>
	<category>website</category>
	<dc:creator>Rabulah</dc:creator>
	</item>
	<item>
	<title>Past performance, future results, &amp;amp;c</title>
	<link>http://ask.metafilter.com/85468/Past%2Dperformance%2Dfuture%2Dresults%2Dampc</link>	
	<description>Statistics-filter: I need to establish to what extent student performance on a particular standardized test is predicted by each of the following: GPA, standardized test scores and a couple of other miscellaneous numerical factors.  How do I go about this? I have a fairly large database of student scores on a particular standardized test that is of primary interest to me.  Each student record also contains grade point average information and that student&apos;s scores on a couple of other standardized tests.&lt;br&gt;
&lt;br&gt;
What statistical techniques can I use to analyze how predictive each of a student&apos;s other numbers is of their ultimate performance on the standardized test of interest?&lt;br&gt;
&lt;br&gt;
I&apos;m handy with Excel and of a fairly technical bent, but I know almost nothing about statistical analysis.  Both direct assistance and pointers to relevant information on the Web would be greatly appreciated!</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2008:site.85468</guid>
	<pubDate>Thu, 06 Mar 2008 08:34:51 -0800</pubDate>
	<category>data</category>
	<category>math</category>
	<category>mathematics</category>
	<category>numerical</category>
	<category>quantitative</category>
	<category>regression</category>
	<category>statistics</category>
	<category>stats</category>
	<dc:creator>perissodactyl</dc:creator>
	</item>
	<item>
	<title>Free NHL stats?? </title>
	<link>http://ask.metafilter.com/81252/Free%2DNHL%2Dstats</link>	
	<description>I want NHL (2007/08) stats. I want as much data as I can get my hands on, in a nice, easily parsed format. But I don&apos;t really want to pay for it.  Can you help me find this data on the cheap? The downloadable stats &lt;a href=&quot;http://www.sportsinteractive.com/Stats.asp?SportID=1&quot;&gt;here&lt;/a&gt; look like they&apos;d be perfect. I&apos;m not so keen on paying $89/year for them, however.&lt;br&gt;
&lt;br&gt;
I don&apos;t suppose anyone knows where I could find this data for free? &lt;br&gt;
&lt;br&gt;
My goal is to get these stats in a MySQL database, and I want to get them in there with as little effort as possible. As the season progress, I&apos;ll be updating the database, most likely on a daily basis. Obviously, NHL stats are available from every major sports news source, but they&apos;re usually buried in individual site pages.&lt;br&gt;
&lt;br&gt;
If it comes down to it, I&apos;ll probably just either pony up the $$ to get them in .csv, or write something up that will scrape the boxscores/logs available on tsn.ca for every game. I am a software developer, so that&apos;s within my talents, but it just seems like... work. I&apos;m also lazy, and I figure someone, somewhere has already done this work, and has made it publicly available... but where? Google is failing me. Unless someone else has a better idea...&lt;br&gt;
&lt;br&gt;
On a related note, in case I&apos;m going to have to do it myself - anyone have any strategies for easy page scraping? Any specific Perl or Python libraries I should look into?</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2008:site.81252</guid>
	<pubDate>Thu, 17 Jan 2008 10:49:57 -0800</pubDate>
	<category>free</category>
	<category>hockey</category>
	<category>NHL</category>
	<category>stats</category>
	<dc:creator>cgg</dc:creator>
	</item>
	<item>
	<title>Help me count the ways! How popular is mobile content?</title>
	<link>http://ask.metafilter.com/77971/Help%2Dme%2Dcount%2Dthe%2Dways%2DHow%2Dpopular%2Dis%2Dmobile%2Dcontent</link>	
	<description>[MarketResearchFilter] Where can I find stats on the popularity of content sent to mobile/cell phones and other handheld devices? This may well be terribly vague but I&apos;m happy to answer any questions. &lt;br&gt;
&lt;br&gt;
I am looking for information on how popular different types of media are on mobile gadgets - specifically, multi-media, I guess. For example, how many people are viewing video, downloading podcasts, or listening to streaming radio on their various handheld devices?&lt;br&gt;
&lt;br&gt;
For the purposes of this question, I am not interested in how many people are using iTunes. Music is way outside my market.&lt;br&gt;
&lt;br&gt;
European stats preferred but at this point I&apos;ll take anything.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2007:site.77971</guid>
	<pubDate>Thu, 06 Dec 2007 06:08:50 -0800</pubDate>
	<category>cellphone</category>
	<category>mobile</category>
	<category>mobiles</category>
	<category>statistics</category>
	<category>stats</category>
	<category>WAP</category>
	<dc:creator>DarlingBri</dc:creator>
	</item>
	<item>
	<title>Help me get a C+, so I can never touch this godforsaken subject again!</title>
	<link>http://ask.metafilter.com/75332/Help%2Dme%2Dget%2Da%2DC%2Dso%2DI%2Dcan%2Dnever%2Dtouch%2Dthis%2Dgodforsaken%2Dsubject%2Dagain</link>	
	<description>Stats Help Filter: Arrrgh! Help me pass my Intro Stats course with a C+ I made a 66% on the first half of the midterm and I think I can&apos;t have made more than a 50% on the second half of the midterm, and that&apos;s with luck and probably curve correction. Therefore the remaining 50% of my mark has to be something spectacular.  Please, share the resources that helped you make the grade? &lt;br&gt;
&lt;br&gt;
It&apos;s intro to Stats Psychology, if that helps, with a focus on things like Z scores, Regression, Correlation and Probability.&lt;br&gt;
&lt;br&gt;
If I can&apos;t pass this course it&apos;ll scupper my GPA and doom me to math free majors like History, which while I&apos;ve very good at, are not my cup of tea.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2007:site.75332</guid>
	<pubDate>Fri, 02 Nov 2007 22:49:57 -0800</pubDate>
	<category>help!</category>
	<category>ohgodpleaseno</category>
	<category>school</category>
	<category>Stats</category>
	<category>stress</category>
	<dc:creator>Phalene</dc:creator>
	</item>
	<item>
	<title>What&apos;s better than awstats?</title>
	<link>http://ask.metafilter.com/71901/Whats%2Dbetter%2Dthan%2Dawstats</link>	
	<description>I&apos;ve got a website that does a lot of traffic. I&apos;m using awStats. It&apos;s slow as hell because we&apos;re adding ten million hits a day to it&apos;s database. We don&apos;t want to use an external logging solution because we&apos;ve found that they underreport dramatically (by 40% in the case of Google Analytics). Are there any free stats processors that will be able to process lots of records AND have better drill-down capability than awstats? I have plenty of hardware to throw a tthe problem. </description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2007:site.71901</guid>
	<pubDate>Wed, 19 Sep 2007 08:03:40 -0800</pubDate>
	<category>apache</category>
	<category>httpd</category>
	<category>stats</category>
	<dc:creator>SpecialK</dc:creator>
	</item>
	<item>
	<title>Formulas for Food</title>
	<link>http://ask.metafilter.com/68529/Formulas%2Dfor%2DFood</link>	
	<description>Looking for math formulas involving food. By math I mean anything from algebra to simple arithmatic. By food I mean caloric intake, macronutrient ratios, energy expenditure etc. Bonus points for bizarro stats regarding food. Thank you.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2007:site.68529</guid>
	<pubDate>Sat, 04 Aug 2007 06:36:54 -0800</pubDate>
	<category>food</category>
	<category>formula</category>
	<category>math</category>
	<category>stats</category>
	<dc:creator>JaySunSee</dc:creator>
	</item>
	<item>
	<title>In search of free online tool for creating radial graphs!</title>
	<link>http://ask.metafilter.com/67966/In%2Dsearch%2Dof%2Dfree%2Donline%2Dtool%2Dfor%2Dcreating%2Dradial%2Dgraphs</link>	
	<description>Are there any free online tools for generating &lt;a href=&quot;http://en.wikipedia.org/wiki/Radar_chart&quot;&gt;radar&lt;/a&gt; (aka radial, spider, star) graphs/charts? The ideal tool would be easy to use dynamically and/or against large sets of data to generate multiple graphs.&lt;br&gt;
&lt;br&gt;
I know that Google Documents, for example, provides a lot of graph/chart options, but not this specific sort.  On the other end, I&apos;m solid with perl and aware of e.g. &lt;a href=&quot;http://search.cpan.org/dist/Imager-Chart-Radial/Radial.pm&quot;&gt;radial.pm&lt;/a&gt;, but I&apos;d love to find a slick, existing start-to-finish tool rather than having to roll some or all of it myself.&lt;br&gt;
&lt;br&gt;
Any ideas?</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2007:site.67966</guid>
	<pubDate>Fri, 27 Jul 2007 15:47:20 -0800</pubDate>
	<category>chart</category>
	<category>graph</category>
	<category>graphing</category>
	<category>numbers</category>
	<category>radar</category>
	<category>radial</category>
	<category>spider</category>
	<category>star</category>
	<category>statistics</category>
	<category>stats</category>
	<dc:creator>cortex</dc:creator>
	</item>
	<item>
	<title>What are the odds that I can solve this problem?</title>
	<link>http://ask.metafilter.com/65124/What%2Dare%2Dthe%2Dodds%2Dthat%2DI%2Dcan%2Dsolve%2Dthis%2Dproblem</link>	
	<description>Probability/stat question: I&apos;m looking for patterns in a protein sequence, and I&apos;ve found a few that occur quite frequently. How do I know these are actual patterns and not just an artifact of random amino acid distribution? I have a roughly 1000 amino acid sequence, and I&apos;ve used a sliding window to chop it up into overlapping 6-mers. Some of these 6-mers occur much more frequently than others and I suspect they have some sort of biological significance. Unfortunately, I don&apos;t know to test whether these are true pattern in the biological sense, or if they could just as easily have been the result of random distribution. &lt;br&gt;
&lt;br&gt;
I&apos;ve tried comparing the expected frequency of these 6-mers based on the amino acid distribution with the observed frequency; but the chance of getting any given 6-mer randomly is so low that almost anything I observe (even the ones that only show up once) seem really significant. I&apos;ll be happy to clarify things if this post is a bit messy.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2007:site.65124</guid>
	<pubDate>Tue, 19 Jun 2007 07:46:07 -0800</pubDate>
	<category>probability</category>
	<category>proteins</category>
	<category>stats</category>
	<dc:creator>reformedjerk</dc:creator>
	</item>
	
	</channel>
</rss>

