Hadoop

Apache PIG

http://www.slideshare.net/hadoop/practical-problem-solving-with-apache-hadoop-pig Real World Problem solving with Hadoop.

Apache Nutch/SOLR links

Compiling Nutch 2.1

Edit nutch-site.xml

<property>
 <name>http.agent.name</name>
 <value>My Nutch Spider</value>
</property>
<property>
 <name>storage.data.store.class</name>
 <value>org.apache.gora.hbase.store.HBaseStore</value>
 <description>Default class for storing data</description>
</property>

Uncomment Line in ivy/ivy.xml to enable Gora/Hbase Backend

 <dependency org="org.apache.gora" name="gora-hbase" rev="0.2.1" conf="*->default" />

Ensure that HBaseStore is set as the default datastore in gora.properties

gora.datastore.default=org.apache.gora.hbase.store.HBaseStore

https://www.youtube.com/watch?v=_HLoH_PgrLk

SOLR VelocityResponseWriter : http://wiki.apache.org/solr/VelocityResponseWriter#Instructions_to_use.2C_Solr_4.0.2B-

Unless otherwise stated, the content of this page is licensed under Creative Commons Attribution-ShareAlike 3.0 License