我正在使用CDH5.3,我试图编写一个mapreduce程序来扫描一个表并进行一些处理。我已经创建了一个扩展TableMapper的映射程序,我得到的例外是:
java.io.FileNotFoundException: File does not exist: hdfs://localhost:54310/usr/local/hadoop-2.5-cdh-3.0/share/hadoop/common/lib/protobuf-java-2.5.0.jar
at org.apache.hadoop.hdfs.DistributedFileSystem$17.doCall(Distribute
使用hadoop流:
hadoop jar hadoop/contrib/streaming/hadoop-streaming-1.0.3.jar -file mapper.rb -mapper mapper.rb -file reducer.rb -reducer reducer.rb -input textfile.txt -output output
假设我所在的目录是“/home/user/sei/ textfile.txt”和
1) is in the same folder as the directory I am currently in
2) I did not use -