| 
    Reranker Framework (ReFr)
    
   Reranking framework for structure prediction and discriminative language modeling 
   | 
 
A simple class interface for running hadoop commands. More...
Public Member Functions | |
| def | __init__ | 
| def | CheckHDir | 
| Check if a directory exists on HDFS.  More... | |
| def | CheckRemoveHDir | 
| Function to check for the existence of a directory on the HDFS.  More... | |
| def | CheckHDFSFile | 
| Function to check for a file on HDFS.  More... | |
| def | CheckInputFile | 
| Check for an input file and prepare it for MR processing.  More... | |
| def | CatPipe | 
| def | CatPipeRead | 
| def | RunMR | 
| RunMR Run a MapReduce.  More... | |
Public Attributes | |
| hadoopmr_ | |
| hadooplibpath_ | |
| hadoopfs_ | |
| hadooptest_ | |
| hadoopcat_ | |
| hadoopput_ | |
| hadoopmove_ | |
| hadoopget_ | |
| hadoopmkdir_ | |
| hadooprmr_ | |
A simple class interface for running hadoop commands.
Definition at line 42 of file hadooputil.py.
| def hadooputil.HadoopInterface.__init__ | ( | self, | |
| hbasedir, | |||
| streamingloc, | |||
| minsplitsize, | |||
| tasktimeout, | |||
| libpath | |||
| ) | 
Definition at line 43 of file hadooputil.py.
| def hadooputil.HadoopInterface.CatPipe | ( | self, | |
| hdfsfiles, | |||
| pipecmd | |||
| ) | 
Definition at line 133 of file hadooputil.py.
| def hadooputil.HadoopInterface.CatPipeRead | ( | self, | |
| hdfsfiles, | |||
| pipecmd, | |||
| retval | |||
| ) | 
Definition at line 138 of file hadooputil.py.
| def hadooputil.HadoopInterface.CheckHDFSFile | ( | self, | |
| filename | |||
| ) | 
Function to check for a file on HDFS.
| [in] | filename | The file to check for. | 
Definition at line 86 of file hadooputil.py.
| def hadooputil.HadoopInterface.CheckHDir | ( | self, | |
| directory | |||
| ) | 
Check if a directory exists on HDFS.
| [in] | directory | name of directory to check | 
Definition at line 65 of file hadooputil.py.
| def hadooputil.HadoopInterface.CheckInputFile | ( | self, | |
| inputfile, | |||
| hdfsinputdir, | |||
| outputdir, | |||
| force, | |||
| uncompress | |||
| ) | 
Check for an input file and prepare it for MR processing.
| [in] | inputfile | Name of the local file to prepare. | 
| [in] | hdfsinputdir | HDFS directory for data staging. | 
| [in] | outputdir | Local file system directory for the output. | 
| [in] | force | Reprocess data even if files already exist. | 
| [in] | uncompress | Uncompress data, if compressed, before running. | 
Definition at line 102 of file hadooputil.py.
| def hadooputil.HadoopInterface.CheckRemoveHDir | ( | self, | |
| directory, | |||
| remove | |||
| ) | 
Function to check for the existence of a directory on the HDFS.
| [in] | directory | Direcotry to check. | 
| [in] | remove | Remove the directory if it exists. | 
Definition at line 73 of file hadooputil.py.
| def hadooputil.HadoopInterface.RunMR | ( | self, | |
| input_files, | |||
| outputdir, | |||
| reduce_tasks, | |||
| reducer, | |||
| mapper, | |||
| mroptions | |||
| ) | 
RunMR Run a MapReduce.
| [in] | input_files | HDFS location of input files. | 
| [in] | outputdir | HDFS location of output. | 
| [in] | reduce_tasks | Number of reducer tasks (0 = use default). | 
| [in] | reducer | Full string of streaming reducer command. | 
| [in] | mapper | Full string of streaming mapper command. | 
| [in] | mroptions | Addition streaming MR options (usually specified with -D). | 
Definition at line 151 of file hadooputil.py.
| hadooputil.HadoopInterface.hadoopcat_ | 
Definition at line 55 of file hadooputil.py.
| hadooputil.HadoopInterface.hadoopfs_ | 
Definition at line 53 of file hadooputil.py.
| hadooputil.HadoopInterface.hadoopget_ | 
Definition at line 58 of file hadooputil.py.
| hadooputil.HadoopInterface.hadooplibpath_ | 
Definition at line 50 of file hadooputil.py.
| hadooputil.HadoopInterface.hadoopmkdir_ | 
Definition at line 59 of file hadooputil.py.
| hadooputil.HadoopInterface.hadoopmove_ | 
Definition at line 57 of file hadooputil.py.
| hadooputil.HadoopInterface.hadoopmr_ | 
Definition at line 44 of file hadooputil.py.
| hadooputil.HadoopInterface.hadoopput_ | 
Definition at line 56 of file hadooputil.py.
| hadooputil.HadoopInterface.hadooprmr_ | 
Definition at line 60 of file hadooputil.py.
| hadooputil.HadoopInterface.hadooptest_ | 
Definition at line 54 of file hadooputil.py.
 1.8.6