public static class ArcSegmentCreator.ArcSegmentCreatorMapper extends Mapper<Text,BytesWritable,Text,NutchWritable>
Mapper.Context
Modifier and Type | Field and Description |
---|---|
static String |
URL_VERSION |
Constructor and Description |
---|
ArcSegmentCreatorMapper() |
Modifier and Type | Method and Description |
---|---|
void |
map(Text key,
BytesWritable bytes,
Mapper.Context context)
Runs the Map job to translate an arc record into output for Nutch segments.
|
void |
setup(Mapper.Context context)
Configures the job mapper.
|
public static final String URL_VERSION
public void setup(Mapper.Context context)
Configures the job mapper. Sets the url filters, scoring filters, url normalizers and other relevant data.
setup
in class Mapper<Text,BytesWritable,Text,NutchWritable>
context
- The task context.public void map(Text key, BytesWritable bytes, Mapper.Context context) throws IOException, InterruptedException
Runs the Map job to translate an arc record into output for Nutch segments.
map
in class Mapper<Text,BytesWritable,Text,NutchWritable>
key
- The arc record header.bytes
- The arc record raw content bytes.context
- The context of the mapreduce job.IOException
InterruptedException
Copyright © 2021 The Apache Software Foundation