org.apache.hadoop.examples.terasort
Class TeraGenWithCRC

java.lang.Object
  extended by org.apache.hadoop.conf.Configured
      extended by org.apache.hadoop.examples.terasort.TeraGenWithCRC
All Implemented Interfaces:
org.apache.hadoop.conf.Configurable, org.apache.hadoop.util.Tool

public class TeraGenWithCRC
extends org.apache.hadoop.conf.Configured
implements org.apache.hadoop.util.Tool

Generate the official terasort input data set. The user specifies the number of rows and the output directory and this class runs a map/reduce program to generate the data. The format of the data is:

To run the program: bin/hadoop jar hadoop-*-examples.jar teragen 10000000000 in-dir


Nested Class Summary
static class TeraGenWithCRC.SortGenMapper
          The Mapper class that given a row number, will generate the appropriate output line.
 
Constructor Summary
TeraGenWithCRC()
           
 
Method Summary
static void main(String[] args)
           
 int run(String[] args)
           
 
Methods inherited from class org.apache.hadoop.conf.Configured
getConf, setConf
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 
Methods inherited from interface org.apache.hadoop.conf.Configurable
getConf, setConf
 

Constructor Detail

TeraGenWithCRC

public TeraGenWithCRC()
Method Detail

run

public int run(String[] args)
        throws IOException
Specified by:
run in interface org.apache.hadoop.util.Tool
Parameters:
args - the cli arguments
Throws:
IOException

main

public static void main(String[] args)
                 throws Exception
Throws:
Exception


Copyright © 2014 Apache Software Foundation. All Rights Reserved.