Do you know of any python mapreduce out of the box clustering libraries?

Do you know of any off-the-shelf python map cluster libraries?

I found some good libraries in Java ( http://lucene.apache.org/mahout/ ), although I would rather use Python.

https://github.com/klbostee/dumbo/wiki  (Python Mapreduce API)

Edit --- I'm looking for a ready-made mapreduce: Canopy, K-means, Means-shift, etc ..

+2


a source to share


1 answer


You can use Python in combination with Hadoop if you like:



http://www.michael-noll.com/wiki/Writing_An_Hadoop_MapReduce_Program_In_Python

+3


a source







All Articles