|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.apache.nutch.microformats.reltag.RelTagIndexingFilter
public class RelTagIndexingFilter
An IndexingFilter
that adds tag
field(s) to the document.
Field Summary |
---|
Fields inherited from interface org.apache.nutch.indexer.IndexingFilter |
---|
X_POINT_ID |
Constructor Summary | |
---|---|
RelTagIndexingFilter()
|
Method Summary | |
---|---|
NutchDocument |
filter(NutchDocument doc,
String url,
WebPage page)
The RelTagIndexingFilter filter object. |
Configuration |
getConf()
Get the Configuration object |
Collection<WebPage.Field> |
getFields()
Gets all the fields for a given WebPage
Many datastores need to setup the mapreduce job by specifying the fields
needed. |
void |
setConf(Configuration conf)
Set the Configuration object |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public RelTagIndexingFilter()
Method Detail |
---|
public Collection<WebPage.Field> getFields()
WebPage
Many datastores need to setup the mapreduce job by specifying the fields
needed. All extensions that work on WebPage are able to specify what fields
they need.
getFields
in interface FieldPluggable
public void setConf(Configuration conf)
Configuration
object
setConf
in interface Configurable
public Configuration getConf()
Configuration
object
getConf
in interface Configurable
public NutchDocument filter(NutchDocument doc, String url, WebPage page) throws IndexingException
RelTagIndexingFilter
filter object.
filter
in interface IndexingFilter
doc
- The NutchDocument
objecturl
- URL to be filtered for rel-tag'spage
- WebPage
object relative to the URL
IndexingException
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |