How do I use the dict file generated via arff.vector in mahout?
I tried to do this.
mahout arff.vector -d iris -o iris/data -t iris/dict
mahout trainnb -i iris/data -o iris/model -ow -c -li iris/dict
I get this exception/ Note**I did have to upload the dict file generated
using
hadoop fs -put iris/dict iris
Do I have to convert it to a sequence file? I checked the source code and
it seems to be reading a sequence file. I am lost as to how to give a
label file index
00000_2, Status : FAILED java.lang.IllegalStateException: iris/dict at
org.apache.mahout.common.iterator.sequencefile.SequenceFileIterable.iterator(SequenceFileIterable.java:63)
at
org.apache.mahout.classifier.naivebayes.BayesUtils.readIndexFromCache(BayesUtils.java:142)
at
org.apache.mahout.classifier.naivebayes.training.IndexInstancesMapper.setup(IndexInstancesMapper.java:38)
at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:138) at
org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:672) at
org.apache.hadoop.mapred.MapTask.run(MapTask.java:330) at
org.apache.hadoop.mapred.Child$4.run(Child.java:268)
No comments:
Post a Comment