public class HtmlUnitWebDriver
extends HtmlUnitDriver
Constructor and Description |
---|
HtmlUnitWebDriver() |
Modifier and Type | Method and Description |
---|---|
static void |
cleanUpDriver(WebDriver driver) |
static WebDriver |
getDriverForPage(String url,
Configuration conf) |
static String |
getHTMLContent(WebDriver driver,
Configuration conf) |
static String |
getHtmlPage(String url,
Configuration conf)
Function for obtaining the HTML BODY using the selected
selenium webdriver
There are a number of configuration properties within
nutch-site.xml which determine whether to
take screenshots of the rendered pages and persist them
as timestamped .png's into HDFS. |
protected WebClient |
modifyWebClient(WebClient client) |
protected WebClient modifyWebClient(WebClient client)
public static WebDriver getDriverForPage(String url, Configuration conf)
public static String getHTMLContent(WebDriver driver, Configuration conf)
public static void cleanUpDriver(WebDriver driver)
public static String getHtmlPage(String url, Configuration conf)
nutch-site.xml
which determine whether to
take screenshots of the rendered pages and persist them
as timestamped .png's into HDFS.url
- the URL to fetch and renderconf
- the Configuration
Copyright © 2021 The Apache Software Foundation