Wednesday, May 11, 2016

Mahout - An algorithm library for scalable machine learning on Hadoop

An algorithm library for scalable machine learning on Hadoop

Apache™ Mahout is a library of scalable machine-learning algorithms, implemented on top of Apache Hadoop®  and using the MapReduce paradigm.

Mahout provides the data science tools to automatically find meaningful patterns in big data sets stored on the Hadoop Distributed File System (HDFS)

Mahout supports four main data mining use cases:

Collaborative filtering – Based on user behavior, makes product recommendations (e.g. YouTube recommended movies)
Clustering – takes items in a particular class (such as web pages or newspaper articles) and organizes them into groups or clusters, such that items belonging to the same group are similar to each other
Classification – learns from existing categorizations and then assigns unclassified items to the best category
Frequent itemset mining – analyzes items in a group (e.g. items in a shopping cart) and then identifies which items typically appear together

Machine Learning with Mahout -Tutorial


Apache Mahout Tutorial-1 | Apache Mahout Tutorial for Beginners-1


Stock Price Movement Prediction Using Mahout and Pydoop’s Website for Big Data Analytics course

No comments:

Post a Comment