Class TsvImporterMapper
java.lang.Object
org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text,ImmutableBytesWritable,Put>
org.apache.hadoop.hbase.mapreduce.TsvImporterMapper
@Public
public class TsvImporterMapper
extends org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text,ImmutableBytesWritable,Put>
Write table content out to files in hdfs.
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.Mapper
org.apache.hadoop.mapreduce.Mapper.Context
-
Field Summary
Modifier and TypeFieldDescriptionprivate org.apache.hadoop.mapreduce.Counter
protected String
protected org.apache.hadoop.conf.Configuration
private String
protected CellCreator
private boolean
protected ImportTsv.TsvParser
private String
Column seperatorprivate boolean
Should skip bad linesprivate boolean
Should skip empty columnsList of cell tagsprotected long
Timestamp for all inserted rowsprotected long
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionprotected void
doSetup
(org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable, org.apache.hadoop.io.Text, ImmutableBytesWritable, Put>.org.apache.hadoop.mapreduce.Mapper.Context context) Handles common parameter initialization that a subclass might want to leverage.org.apache.hadoop.mapreduce.Counter
boolean
long
getTs()
void
incrementBadLineCount
(int count) void
map
(org.apache.hadoop.io.LongWritable offset, org.apache.hadoop.io.Text value, org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable, org.apache.hadoop.io.Text, ImmutableBytesWritable, Put>.org.apache.hadoop.mapreduce.Mapper.Context context) Convert a line of TSV text into an HBase table row.protected void
populatePut
(byte[] lineBytes, ImportTsv.TsvParser.ParsedLine parsed, Put put, int i) protected void
setup
(org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable, org.apache.hadoop.io.Text, ImmutableBytesWritable, Put>.org.apache.hadoop.mapreduce.Mapper.Context context) Handles initializing this class with objects specific to it (i.e., the parser).Methods inherited from class org.apache.hadoop.mapreduce.Mapper
cleanup, run
-
Field Details
-
ts
Timestamp for all inserted rows -
separator
Column seperator -
skipBadLines
Should skip bad lines -
skipEmptyColumns
Should skip empty columns -
badLineCount
-
logBadLines
-
parser
-
conf
-
cellVisibilityExpr
-
ttl
-
kvCreator
-
hfileOutPath
-
tags
List of cell tags
-
-
Constructor Details
-
TsvImporterMapper
public TsvImporterMapper()
-
-
Method Details
-
getTs
-
getSkipBadLines
-
getBadLineCount
-
incrementBadLineCount
-
setup
protected void setup(org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable, org.apache.hadoop.io.Text, ImmutableBytesWritable, Put>.org.apache.hadoop.mapreduce.Mapper.Context context) Handles initializing this class with objects specific to it (i.e., the parser). Common initialization that might be leveraged by a subsclass is done indoSetup
. Hence a subclass may choose to override this method and calldoSetup
as well before handling it's own custom params.- Overrides:
setup
in classorg.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable,
org.apache.hadoop.io.Text, ImmutableBytesWritable, Put>
-
doSetup
protected void doSetup(org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable, org.apache.hadoop.io.Text, ImmutableBytesWritable, Put>.org.apache.hadoop.mapreduce.Mapper.Context context) Handles common parameter initialization that a subclass might want to leverage. -
map
public void map(org.apache.hadoop.io.LongWritable offset, org.apache.hadoop.io.Text value, org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable, org.apache.hadoop.io.Text, throws IOExceptionImmutableBytesWritable, Put>.org.apache.hadoop.mapreduce.Mapper.Context context) Convert a line of TSV text into an HBase table row.- Overrides:
map
in classorg.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable,
org.apache.hadoop.io.Text, ImmutableBytesWritable, Put> - Throws:
IOException
-
populatePut
protected void populatePut(byte[] lineBytes, ImportTsv.TsvParser.ParsedLine parsed, Put put, int i) throws ImportTsv.TsvParser.BadTsvLineException, IOException
-