I have a research project involving about ~90 subjects in two (self-selected) groups. I have collected a number of variables about these subjects (from public sources), and now I would like to do some statistical tests to say whether these variables are significantly correlated with membership in one group or the other. How do I do this? I have a basic knowledge of R and access to Stata 14. [more inside]
This is a two-part question about commodity prices in California in the 1860s. Part 1: How is this newspaper report of "current" (1862) prices read? The @s throw me off. (My best guess is that those are price ranges.) Part two... [more inside]
I need to get hold of datasets for online purchases, more specifically those that include product returns for a (self funded) project I'm working on. [more inside]
I'd like to compile a list of businesses in New York City—starting with Manhattan—that have been around for 100 years or more. I'd also like to include places the public can visit that aren't "businesses" per se. I'm guessing this information is publicly available, but I don't know where to start. Any guidance would be extremely helpful. Thanks!
I'm currently a records manager for the Feds, but what I really love is the data. I've been thinking that I would like to change over into a more research/policy analyst type of job, but I'm unsure of exactly how to get from here to there. [more inside]
I'm looking for the easiest way to find data about "how big" various industries are, by revenue. For instance, when you hear a reporter say, "Video games are a $X billion a year business," where are they getting that information? I've poked around the census bureau's website and found their various reports, but it doesn't seem to be that granular (or easy to read, for that matter). I'm trying to create as diverse a list as possible, of various non-related industries (banking, retail, trade shows, sporting events, hotel, video games, Hollywood, pest control, you name it - virtually any business that is its own discrete thing) and how they stack up relative to one another in terms of the dollar-figure size.
I will be traveling to a remote location to interview key informants for a research project. I am looking for a convenient interface which to enter the data. [more inside]
I'm trying to tag and code about 4000+ unique paragraphs of data. These are opinion responses to two similar questions. I manually went through the first 4000 responses and it took weeks using Google Refine. I'm wondering if there's a way to operationalize this to be a bit easier and less time consuming? [more inside]
ExcelFilter: How to combine the data from three spreadsheets into one spreadsheet? (But wait, there's more!) [more inside]
Is there software capable of analyzing the note-to-note event data from a melody played on a solo acoustic instrument (live or recorded), and then generating a text file containing said data in a list form? [more inside]
I love what I do but I don't love the way my organization functions. I want to go solo. Help me figure out how to become a freelance data analyst/researcher [more inside]
For work I am making a complete database of all existing (reasonably current) data on Native American health/risk factors. But I know next to nothing about the area. What are some surveys I should look at? It would be especially helpful if the data were at the tribal level of analysis. I'm at a bit of a loss and I'd appreciate any help you could offer! Thanks in advance.
Do you have recommendations for good data visualization software for the Hans Rosling fan? [more inside]
I have a list of about 350 words that I'm putting into a database that I'm building from scratch. For each word in the list, I'd like to include its American English dictionary definition (preferably American Heritage Dictionary), UrbanDictionary entry, and Wikipedia page link (I realize that entries may not exist for some columns for many of these words). Is there some way to streamline or data-mine this process, so I don't have to look up each word individually in 3 separate sources x350+, not to mention all the copying and pasting? [more inside]
Asking for Mr. C: Where would you look for research done on how people interact with weather data online? [more inside]
Please point me to quality books, articles, etc which argue both for and against the use of quantitative research techniques in the social sciences, particularly in political science and public policy. [more inside]
I have a wonderfully large dataset that I'm working with for a long-term project. I am analyzing a small section the dataset for my masters thesis. In meeting with my thesis advisor last week, she suggested I run some statistical tests of significance on the 4 tables I'm working with. She knows that I am yet to be versed in quantitative analysis methods (I've done solely qualitative work thus far) and that I'm under a massive time crunch to get this done. She suggested I seek help from others, as she doesn't want me to get bogged down with figuring out this step, and would rather I concentrate on analyzing the other aspects of this data. To this end, I'm wondering if somebody might be able to suggest the best type of test of significance to run, the easiest way to run it, and a good, simple resource for what the resultant values mean? [more inside]
Will Weft QDA help me do what I want to do (that is, perform a content analysis of threaded discussion transcripts with multiple coders)? If not what will? An impoverished grad student needs to know! [more inside]
Does anyone know the source of this bit of Social Media data? [more inside]
Looking for airport passenger data. More specifically, monthly deplaned (inbound) passenger data for Miami, Orlando and Tampa from 1975 to 2008. [more inside]
Lots of interrelated data, little idea of how to analyze it. What method or type of software is best for collecting complex information for future analysis? [more inside]
Need a way to train my ear to recognize tones accurately so that I can improve my transcription data.
Looking for a flash exercise/game or some way to train my ear to recognizing tone so that I can be a better transcriber. I'm currently researching two tonal languages and I need a way to practice/improve on my skills so that my data is more complete. I don't want to devote a lot of time to this, maybe a few hours. Are there any tools out there that can help me? [more inside]
Google-fu masters: I need some stats, STAT. [more inside]
Surveys on attitudes toward and behaviors of recycling and environmental concern? [more inside]
Beyond being a freelance writer and a professional practitioner of public affairs, my base nature, I realize, is that of a data researcher, collector and organizer. [more inside]
collaborative tools, "web 2.0", community - evidence that the tech is really worth the price? [more inside]
Is there open, free, or inexpensive comprehensive data on the location of coal-fired power plants in the United States available anywhere? [more inside]
Best ways to visually explore a large survey data set? [more inside]
Is OLAP my answer? Someone on a stats listserve I subscribe to mentioned OLAP in passing as an approach for managing lots of smaller data tables in a sane/integrated way. I've tried googling around and reading the info on various OLAP/BI project pages. I've since looked briefly at the Pentaho suite (Mondrian, and what was Kettle et al), JasperIntelligence, SpagoBI, and Palo OLAP...briefly only because I'm not quite that technically adept to see for sure how they work and whether these types of applications do what I want them to, given that I'm not coming from what I imagine a "typical" BI user's perspective/needs might be. At this point, my head is spinning a bit. [more inside]