public class DistCpUtils extends Object
| Constructor and Description |
|---|
DistCpUtils() |
| Modifier and Type | Method and Description |
|---|---|
static boolean |
checksumsAreEqual(org.apache.hadoop.fs.FileSystem sourceFS,
org.apache.hadoop.fs.Path source,
org.apache.hadoop.fs.FileChecksum sourceChecksum,
org.apache.hadoop.fs.FileSystem targetFS,
org.apache.hadoop.fs.Path target)
Utility to compare checksums for the paths specified.
|
static boolean |
compareFs(org.apache.hadoop.fs.FileSystem srcFs,
org.apache.hadoop.fs.FileSystem destFs) |
static long |
getFileSize(org.apache.hadoop.fs.Path path,
org.apache.hadoop.conf.Configuration configuration)
Retrieves size of the file at the specified path.
|
static DecimalFormat |
getFormatter() |
static int |
getInt(org.apache.hadoop.conf.Configuration configuration,
String label)
Utility to retrieve a specified key from a Configuration.
|
static long |
getLong(org.apache.hadoop.conf.Configuration configuration,
String label)
Utility to retrieve a specified key from a Configuration.
|
static String |
getRelativePath(org.apache.hadoop.fs.Path sourceRootPath,
org.apache.hadoop.fs.Path childPath)
Gets relative path of child path with respect to a root path
For ex.
|
static Class<? extends org.apache.hadoop.mapreduce.InputFormat> |
getStrategy(org.apache.hadoop.conf.Configuration conf,
DistCpOptions options)
Returns the class that implements a copy strategy.
|
static String |
getStringDescriptionFor(long nBytes) |
static String |
packAttributes(EnumSet<DistCpOptions.FileAttribute> attributes)
Pack file preservation attributes into a string, containing
just the first character of each preservation attribute
|
static void |
preserve(org.apache.hadoop.fs.FileSystem targetFS,
org.apache.hadoop.fs.Path path,
org.apache.hadoop.fs.FileStatus srcFileStatus,
EnumSet<DistCpOptions.FileAttribute> attributes)
Preserve attribute on file matching that of the file status being sent
as argument.
|
static <T> void |
publish(org.apache.hadoop.conf.Configuration configuration,
String label,
T value)
Utility to publish a value to a configuration.
|
static org.apache.hadoop.fs.Path |
sortListing(org.apache.hadoop.fs.FileSystem fs,
org.apache.hadoop.conf.Configuration conf,
org.apache.hadoop.fs.Path sourceListing)
Sort sequence file containing FileStatus and Text as key and value respecitvely
|
static EnumSet<DistCpOptions.FileAttribute> |
unpackAttributes(String attributes)
Un packs preservation attribute string containing the first character of
each preservation attribute back to a set of attributes to preserve
|
public static long getFileSize(org.apache.hadoop.fs.Path path,
org.apache.hadoop.conf.Configuration configuration)
throws IOException
path - The path of the file whose size is sought.configuration - Configuration, to retrieve the appropriate FileSystem.IOException, - on failure.IOExceptionpublic static <T> void publish(org.apache.hadoop.conf.Configuration configuration,
String label,
T value)
T - The type of the value.configuration - The Configuration to which the value must be written.label - The label for the value being published.value - The value being published.public static int getInt(org.apache.hadoop.conf.Configuration configuration,
String label)
configuration - The Configuration in which the key is sought.label - The key being sought.public static long getLong(org.apache.hadoop.conf.Configuration configuration,
String label)
configuration - The Configuration in which the key is sought.label - The key being sought.public static Class<? extends org.apache.hadoop.mapreduce.InputFormat> getStrategy(org.apache.hadoop.conf.Configuration conf, DistCpOptions options)
conf - - Configuration objectoptions - - Handle to input optionspublic static String getRelativePath(org.apache.hadoop.fs.Path sourceRootPath, org.apache.hadoop.fs.Path childPath)
sourceRootPath - - Source root pathchildPath - - Path for which relative path is requiredpublic static String packAttributes(EnumSet<DistCpOptions.FileAttribute> attributes)
attributes - - Attribute set to preservepublic static EnumSet<DistCpOptions.FileAttribute> unpackAttributes(String attributes)
attributes - - Attribute stringpublic static void preserve(org.apache.hadoop.fs.FileSystem targetFS,
org.apache.hadoop.fs.Path path,
org.apache.hadoop.fs.FileStatus srcFileStatus,
EnumSet<DistCpOptions.FileAttribute> attributes)
throws IOException
targetFS - - File systempath - - Path that needs to preserve original file statussrcFileStatus - - Original file statusattributes - - Attribute set that need to be preservedIOException - - Exception if any (particularly relating to group/owner
change or any transient error)public static org.apache.hadoop.fs.Path sortListing(org.apache.hadoop.fs.FileSystem fs,
org.apache.hadoop.conf.Configuration conf,
org.apache.hadoop.fs.Path sourceListing)
throws IOException
fs - - File Systemconf - - ConfigurationsourceListing - - Source listing fileIOException - - Any exception during sort.public static DecimalFormat getFormatter()
public static String getStringDescriptionFor(long nBytes)
public static boolean checksumsAreEqual(org.apache.hadoop.fs.FileSystem sourceFS,
org.apache.hadoop.fs.Path source,
org.apache.hadoop.fs.FileChecksum sourceChecksum,
org.apache.hadoop.fs.FileSystem targetFS,
org.apache.hadoop.fs.Path target)
throws IOException
sourceFS - FileSystem for the source path.source - The source path.sourceChecksum - The checksum of the source file. If it is null we
still need to retrieve it through sourceFS.targetFS - FileSystem for the target path.target - The target path.IOException - if there's an exception while retrieving checksums.public static boolean compareFs(org.apache.hadoop.fs.FileSystem srcFs,
org.apache.hadoop.fs.FileSystem destFs)
Copyright © 2014 Apache Software Foundation. All Rights Reserved.