Do you know of any python mapreduce out of the box clustering libraries?
Do you know of any off-the-shelf python map cluster libraries?
I found some good libraries in Java ( http://lucene.apache.org/mahout/ ), although I would rather use Python.
https://github.com/klbostee/dumbo/wiki (Python Mapreduce API)
Edit --- I'm looking for a ready-made mapreduce: Canopy, K-means, Means-shift, etc ..
+2
a source to share
1 answer
You can use Python in combination with Hadoop if you like:
http://www.michael-noll.com/wiki/Writing_An_Hadoop_MapReduce_Program_In_Python
+3
a source to share