About the Nutch Users Group Group
Have something to say?
Join LinkedIn for free to participate in the conversation. When you join, you can comment and post your own discussions.
Join LinkedInMost Popular Discussions
customize crawling need help
as i am new user successively crawl all data but i want to crawl a particular field from the source code like <logodesc> </logodesc> but ...

Introduction to Nutch, Part 1: Crawling today.java.net
Nutch is an open source Java implementation of a search engine. It provides all of the tools you need to run your own search engine. But why would anyone want to run their own search engine? After all, there's always Google....

Why the nutch's search result always shows: Hits 0-0 (out of about 0 total matching pages)
I deployed .war file of nutch 1.2 on Tomcat and opened it by http://localhost:8080/nutch-1.2
It shows quite good. but the search result ...

How to get that index from nutch
I am new to nutch.
I use : bin/nutch crawl urls -solr http://localhost:8983/solr/ -depth 3 -topN 5 to crawl a website. built a folder ...

solrUrl is not set, indexing will be skipped...
Exception in thread "main" java.net.ConnectException: Call to localhost/127.0.0.1:9001 failed on connection exception: ...


Another newbie for Nutch
Hey guys,
Can any one help me out with some useful links that helps me in learning NUTCH along with java examples too.
Thanks in ...

Nutch Users Group is now an open group
I am pleased to announce that, as the owner of this group, I have just switched us to an open discussion group. All future discussions ...

Is Nutch the right choice ?
we are looking for an Open Source Web Search engine where we can search the world wide web based on keywords and use the results in our ...

Urgently looking for Java Nutch SSE
EXperience : 5-7.5 Yrs
Job Location : Mumbai
Primary Skills: Core Java 1.3 and above, ...

The Engineering behind LinkedIn Products “You May Like” blog.linkedin.com
Code Alert! This is a part of our continuing series on engineering and analytics at LinkedIn. If this isn’t your cup of Java, check back tomorrow for regular LinkedIn programming. - Ed. We are approaching the first anniversary...

Interested in Nutch
One of the things I don't see is an easy way to eliminate urls from the index if they don't meet a certain set of criteria(whether ...

Can i use Nutch for Keyword search with out a URL, i mean keep it open for world wide web like google?
we are trying to see if we can use Nutch in the same way as we use Google ? just give keywords and get results, we do not have a URL ...
