I'm looking for a guide to how to set up and use Nutch, Solr and Drupal. Particularly Nutch and Solr. I have never coded before.
I am, as per the books, a complete dummy. I am competent in using Microsoft Office products, for whatever that is worth. I have a reasonable user understanding of what Nutch, Solr and Drupal do and have read up on them. I have worked on development projects in the content management space before as a project owner.
I now want to try them out for myself (before writing a spec to hand over to 3rd party developers) so I can at least get a sense of what is what.
Online tutorials
like this assume more knowledge of the very basics than I have. For example, the instructions for setting up Nutch say:
1. Setup Nutch from binary distribution
- Download a binary package (apache-nutch-1.X-bin.zip) from here.
- Unzip your binary Nutch package. There should be a folder apache-nutch-1.X.
- cd apache-nutch-1.X/ [HUH?]
- From now on, we are going to use ${NUTCH_RUNTIME_HOME} to refer to the current directory (apache-nutch-1.X/).
I am using a mac, and am a recent switcher from PCs.
I don't mind paying for access to content or buying books. I already own the most recent Packt guide on Solr which makes sense to me as long as I ignore the parts that talk about coding. I don't mind paying someone to sit down and show me step by step how this works. Apparently you can set up a basic notch web crawl in one hour. I suspect this is not the case for me.
Is there a guide out there for complete beginners? Failing that, does anybody know a good, cheap London-based person who could sit me down and give me a basic tutorial in this stuff?
posted by kamelhoecker at 2:35 AM on January 24 [2 favorites]