public class OrphanScoringFilter extends AbstractScoringFilter
Modifier and Type | Field and Description |
---|---|
static Text |
ORPHAN_KEY_WRITABLE |
X_POINT_ID
Constructor and Description |
---|
OrphanScoringFilter() |
Modifier and Type | Method and Description |
---|---|
void |
orphanedScore(Text url,
CrawlDatum datum)
This method may change the score or status of CrawlDatum during CrawlDb
update, when the URL is neither fetched nor has any inlinks.
|
void |
setConf(Configuration conf) |
void |
updateDbScore(Text url,
CrawlDatum old,
CrawlDatum datum,
List<CrawlDatum> inlinks)
Used for orphan control.
|
distributeScoreToOutlinks, generatorSortValue, getConf, indexerScore, initialScore, injectedScore, passScoreAfterParsing, passScoreBeforeParsing
public static Text ORPHAN_KEY_WRITABLE
public void setConf(Configuration conf)
setConf
in interface Configurable
setConf
in class AbstractScoringFilter
public void updateDbScore(Text url, CrawlDatum old, CrawlDatum datum, List<CrawlDatum> inlinks) throws ScoringFilterException
updateDbScore
in interface ScoringFilter
updateDbScore
in class AbstractScoringFilter
url
- of the recordold
- CrawlDatumdatum
- new CrawlDatuminLinks
- list of inlinked CrawlDatumsScoringFilterException
public void orphanedScore(Text url, CrawlDatum datum)
ScoringFilter
url
- URL of the pagedatum
- CrawlDatum for pageCopyright © 2021 The Apache Software Foundation