Package | Description |
---|---|
org.apache.nutch.tools |
Miscellaneous tools.
|
Modifier and Type | Class and Description |
---|---|
class |
AbstractCommonCrawlFormat
Abstract class that implements { @see org.apache.nutch.tools.CommonCrawlFormat } interface.
|
class |
CommonCrawlFormatJackson
This class provides methods to map crawled data on JSON using Jackson Streaming APIs.
|
class |
CommonCrawlFormatJettinson
This class provides methods to map crawled data on JSON using Jettinson APIs.
|
class |
CommonCrawlFormatSimple
This class provides methods to map crawled data on JSON using a StringBuilder object.
|
class |
CommonCrawlFormatWARC |
Modifier and Type | Method and Description |
---|---|
static CommonCrawlFormat |
CommonCrawlFormatFactory.getCommonCrawlFormat(String formatType,
Configuration nutchConf,
CommonCrawlConfig config) |
static CommonCrawlFormat |
CommonCrawlFormatFactory.getCommonCrawlFormat(String formatType,
String url,
Content content,
Metadata metadata,
Configuration nutchConf,
CommonCrawlConfig config)
Deprecated.
|
Copyright © 2021 The Apache Software Foundation