How can I rate abstracts for relevance and quality?
November 5, 2009 10:03 AM
Subscribe
I'm looking for ways to rate abstracts for relevance and quality so that I can filter out the best ones.
I'm going to be doing a very large literature review. In preparation I need to set up a rules-based system for filtering through, or triaging, the abstracts I retrieve, in order to select a subset (those articles I'll actually retrieve and read). Ideally this system will allow me to rate abstracts for quality and/or how well they match the topic of concern. (This is all work-related, so I can't be more specific, but the topic is generally in the social sciences.) There might have to be some qualitative or subjective aspect to this ratings system, but there are a lot of ways to rate abstracts more objectively as well (breadth of study population, number of search keywords "hit", etc.).
I'm not formally trained in information science, but I suspect such systems have already been created and used, and I don't want to reinvent the wheel. I know there are specialists in information science here that might be able to keep me from doing so. Could you point me to projects where something similar has been done, or places where such systems have been reviewed, described, or codified? Can you suggest any other resources that might be of use?
If it matters, I think I will have around one or two thousand abstracts to filter through.
posted by Herkimer to work & money (6 comments total)
1 user marked this as a favorite
Where did these abstracts come from (like, did someone hand you a pile of paper? Are they search results?) What form are your abstracts in? This is information retrieval theory in action! If these items exist in a database already, for example, then the structure of the database (keywords, thesauri, taxonomies etc) will affect your search strategies.
To start with, you may want to think about whether you want to err on the side of caution (get everything you want, maybe some stuff you don't want, or "high recall") or whether you want to cast care to the winds and only read stuff you do want (risking missing some things you might want to see, of "high precision".) Obviously you want to maximize both, but it may be a trade-off.
Let's pretend that you're doing anthropology, and you have a bunch of abstracts about, say, post-Colonial theory but you really only want stuff about Gayatri Spivak (WARNING: HERE BE HALF-REMEMBERED UNDERGRADUATE STUDIES) and the subaltern. So obviously you will want everything with "Spivak" and "subaltern", but maybe you also want things that touch on "strategic essentialism." Maybe you want things where "strategic" and "essentialism" are within a few words of each other, in order to include sentences like "Dude, that essentialism is totally strategic." The more you include, the more likely you are to get stuff that's less relevant, but you're more likely to see everything that is relevant.
You can also use these concepts to prioritize - to say "these things I KNOW will be relevant", and deal with those first, and leave the less certain stuff for later.
I'm rambling on because this is interesting to me, but I'm sure there are current social science librarians who can help you further. In any case, feel free to memail me.
posted by chesty_a_arthur at 10:59 AM on November 5