|
Input/Output Configuration Properties
The following table lists the job configuration properties that are used to specify, among other things, the input table, output table, and scan specification. These properties can be supplied to a streaming MapReduce job with -Dproperty=value arguments.- Input/Output Configuration Properties
- Property Description Example Value
- hypertable.mapreduce.namespace Namespace for both input and output table /test
- hypertable.mapreduce.input.namespace Namespace for input table /test/intput
- hypertable.mapreduce.input.table Input table name wikipedia
- hypertable.mapreduce.input.scan_spec.columns Comma separated list of input columns id,title
- hypertable.mapreduce.input.scan_spec.options Input WHERE clause options. These options (i.e. LIMIT, OFFSET) are evaluated for each single job MAX_VERSIONS 1 KEYS_ONLY
- hypertable.mapreduce.input.scan_spec.row_interval Input row interval Dog <= ROW < Kitchen
- hypertable.mapreduce.input.scan_spec.timestamp_interval Timestamp filter TIMESTAMP >= 2011-11-21
- hypertable.mapreduce.input.include_timestamps Emit integer timestamp as the
- 1st field (nanoseconds since epoch) true
- hypertable.mapreduce.output.namespace Namespace containing output table /test/output
- hypertable.mapreduce.output.table Output table name wikipedia
- hypertable.mapreduce.output.mutator_flags flags parameter passed to mutator constructor (1 = NO_LOG_SYNC) 1
- hypertable.mapreduce.thriftbroker.framesize sets the ThriftClient framesize (in bytes); the default is 16 MB 20971520
复制代码 |
|