Package | Description |
---|---|
org.apache.nutch.tools |
Miscellaneous tools.
|
Modifier and Type | Method and Description |
---|---|
static CommonCrawlFormat |
CommonCrawlFormatFactory.getCommonCrawlFormat(String formatType,
Configuration nutchConf,
CommonCrawlConfig config) |
static CommonCrawlFormat |
CommonCrawlFormatFactory.getCommonCrawlFormat(String formatType,
String url,
Content content,
Metadata metadata,
Configuration nutchConf,
CommonCrawlConfig config)
Deprecated.
|
Constructor and Description |
---|
AbstractCommonCrawlFormat(String url,
Content content,
Metadata metadata,
Configuration nutchConf,
CommonCrawlConfig config) |
CommonCrawlDataDumper(CommonCrawlConfig config)
Constructor
|
CommonCrawlFormatJackson(Configuration nutchConf,
CommonCrawlConfig config) |
CommonCrawlFormatJackson(String url,
Content content,
Metadata metadata,
Configuration nutchConf,
CommonCrawlConfig config) |
CommonCrawlFormatJettinson(String url,
Content content,
Metadata metadata,
Configuration nutchConf,
CommonCrawlConfig config) |
CommonCrawlFormatSimple(String url,
Content content,
Metadata metadata,
Configuration nutchConf,
CommonCrawlConfig config) |
CommonCrawlFormatWARC(Configuration nutchConf,
CommonCrawlConfig config) |
CommonCrawlFormatWARC(String url,
Content content,
Metadata metadata,
Configuration nutchConf,
CommonCrawlConfig config,
ParseData parseData) |
Copyright © 2021 The Apache Software Foundation