Package | Description |
---|---|
org.apache.nutch.hostdb | |
org.apache.nutch.indexer |
Index content, configure and run indexing and cleaning jobs to
add, update, and delete documents from an index.
|
org.apache.nutch.parse |
The
Parse interface and related classes. |
Modifier and Type | Field and Description |
---|---|
protected URLNormalizers |
UpdateHostDbMapper.normalizers |
Modifier and Type | Field and Description |
---|---|
protected URLNormalizers |
IndexingFiltersChecker.normalizers |
Modifier and Type | Field and Description |
---|---|
protected URLNormalizers |
ParserChecker.normalizers |
Modifier and Type | Method and Description |
---|---|
static String |
ParseOutputFormat.filterNormalize(String fromUrl,
String toUrl,
String fromHost,
boolean ignoreInternalLinks,
boolean ignoreExternalLinks,
String ignoreExternalLinksMode,
URLFilters filters,
URLExemptionFilters exemptionFilters,
URLNormalizers normalizers) |
static String |
ParseOutputFormat.filterNormalize(String fromUrl,
String toUrl,
String origin,
boolean ignoreInternalLinks,
boolean ignoreExternalLinks,
String ignoreExternalLinksMode,
URLFilters filters,
URLExemptionFilters exemptionFilters,
URLNormalizers normalizers,
String urlNormalizerScope) |
Copyright © 2021 The Apache Software Foundation