Clustering → Big Data Partnership → Unlock Value from Complex Data

Posts Tagged:Clustering

datascience-e1343898868603

Big Data and Real Time Analytics

The advances in big data technology are opening up new ways to collect and transport large amounts of data more efficiently. This revolution has boosted research and development of real-time algorithms and methods. Traditionally, machine learning algorithms were not designed for real-time processing. In fact, data science competitions (e.g the Netflix prize, Kaggle) were criticised…

Read More →

Bloom Filter Vs Feature Hashing

Bloom Filter A Bloom filter is a space-efficient probabilistic data structure that is used to efficiently encode sets and perform set membership tests, whether an element is a member of a set. False positives are possible, but false negatives are strictly not possible. i.e. a query returns either “inside set (may be wrong)” or “definitely…

Read More →

Clustering with Mahout

Clustering Introduction:- Clustering is one of the most popular techniques available in Machine learning field. This allows the system to group numurous entities into separate clusters/groups based on certain characteristics/features of the entities. Clustering is a widely used technique in many grouping problems like grouping similar news articles, blogs, emails, malwares etc based on their…

Read More →

Back to Top