Mahout → Big Data Partnership → Unlock Value from Complex Data

Category Archive for: Mahout

SMStats1

Closing the feedback loop

With businesses becoming increasingly sensitive to customer opinion of their brand, monitoring consumer feedback is becoming ever more important. Additionally, the recognition of social media as an important and valid source of customer opinion has brought about a need for new systems and a new approach. Traditional approaches of reactive response to any press coverage…

Read More →

Bloom Filter Vs Feature Hashing

Bloom Filter A Bloom filter is a space-efficient probabilistic data structure that is used to efficiently encode sets and perform set membership tests, whether an element is a member of a set. False positives are possible, but false negatives are strictly not possible. i.e. a query returns either “inside set (may be wrong)�? or “definitely…

Read More →

Clustering with Mahout

Clustering Introduction:- Clustering is one of the most popular techniques available in Machine learning field. This allows the system to group numurous entities into separate clusters/groups based on certain characteristics/features of the entities. Clustering is a widely used technique in many grouping problems like grouping similar news articles, blogs, emails, malwares etc based on their…

Read More →

Back to Top