Package | Description |
---|---|
org.apache.nutch.scoring |
The
ScoringFilter interface. |
org.apache.nutch.scoring.depth |
Scoring filter to stop crawling at a configurable depth
(number of "hops" from seed URLs).
|
org.apache.nutch.scoring.link |
Scoring filter used in conjunction with
WebGraph . |
org.apache.nutch.scoring.opic |
Scoring filter implementing a variant of the Online Page Importance Computation
(OPIC) algorithm.
|
org.apache.nutch.scoring.orphan |
Scoring filter to modify score or status of orphaned pages (no inlinks found
for a configurable amount of time).
|
org.apache.nutch.scoring.similarity | |
org.apache.nutch.scoring.tld |
Top Level Domain Scoring plugin.
|
org.apache.nutch.scoring.urlmeta |
URL Meta Tag Scoring Plugin
|
Modifier and Type | Class and Description |
---|---|
class |
AbstractScoringFilter |
class |
ScoringFilters
Creates and caches
ScoringFilter implementing plugins. |
Modifier and Type | Class and Description |
---|---|
class |
DepthScoringFilter
This scoring filter limits the number of hops from the initial seed urls.
|
Modifier and Type | Class and Description |
---|---|
class |
LinkAnalysisScoringFilter |
Modifier and Type | Class and Description |
---|---|
class |
OPICScoringFilter
This plugin implements a variant of an Online Page Importance Computation
(OPIC) score, described in this paper:
Abiteboul, Serge and Preda, Mihai and Cobena, Gregory (2003), Adaptive
On-Line Page Importance Computation.
|
Modifier and Type | Class and Description |
---|---|
class |
OrphanScoringFilter
Orphan scoring filter that determines whether a page has become orphaned,
e.g.
|
Modifier and Type | Class and Description |
---|---|
class |
SimilarityScoringFilter |
Modifier and Type | Class and Description |
---|---|
class |
TLDScoringFilter
Scoring filter to boost tlds.
|
Modifier and Type | Class and Description |
---|---|
class |
URLMetaScoringFilter
For documentation:
org.apache.nutch.scoring.urlmeta |
Copyright © 2021 The Apache Software Foundation