BLOG - Page 3 of 3 - Big Data Partnership

How to analyse your customers social profile in 24 hours (Part II – analysis)

Blog, Business, General, Technology, 01 Jun, 2012 0

(This is the second part of the post – How to analyse your customer social profile in 24 hours – Data and Collection) Community Level After collecting the data as described in the previous post, we can look into the data and visualize some aspects of it. There are many questions we can ask of this data,…

How to analyse your customers social profile in 24 hours (Part I – assumptions and data collection)

Apache Hadoop, Blog, Business, Technology, 01 Jun, 2012 0

Social profiles tell us a lot about the interest of its owner and also about people/organisations they follow and people who are following them. This blog post is a summary of what information you can get by collecting and analysing your customer profiles in 24 hours. In fact, after unlocking of the data, this process…

Hadoop becomes Mainstream

Apache Hadoop, Blog, Business, General, 08 Mar, 2012 0

Hadoop is a grassroots phenomenon that emerged in the social networking and consumer Internet world. As always, there are early adopters who take risks on the cutting edge, and there are more conservative organizations watching the pioneers from the sidelines. This played out in 2011 as early customer experiences with Hadoop were shared via conferences,…

“Introducing YARN�? – Hadoop No More a Baby Elephant

Apache Hadoop, Blog, Hadoop Common, Hadoop Ecosystem, MapReduce, Science, Technology, Training, 02 Mar, 2012 1

With the increasing popularity and the addiction of companies towards Hadoop, also Hadoop being an unanimous solution for Big data platforms makes the Hadoop development team to focus on the current architectural deficiencies and make Hadoop free from such underlying architectural issues. In that path a new Hadoop MapReduce version has taken birth MapReduce 2.0…

Map Side and Reduce Side Joins

Apache Hadoop, Blog, Hadoop Common, Hadoop Ecosystem, MapReduce, Science, Technology, Training, 29 Feb, 2012 1

Joins:- ======= Joins is one of the interesting features available in MapReduce. Joins performed by Mapper are called as Map-side Joins. Joins performed by Reducer can be treated as Reduce-side joins. Frameworks like Pig, Hive, or Cascading has support for performing joins. Before diving into the implementation let us understand the problem throughly. If we…

Bloom Filter Vs Feature Hashing

Blog, Mahout, Science, Technology, 29 Feb, 2012 0

Bloom Filter A Bloom filter is a space-efficient probabilistic data structure that is used to efficiently encode sets and perform set membership tests, whether an element is a member of a set. False positives are possible, but false negatives are strictly not possible. i.e. a query returns either “inside set (may be wrong)�? or “definitely…

Clustering with Mahout

Blog, Mahout, Science, 08 Feb, 2012 3

Clustering Introduction:- Clustering is one of the most popular techniques available in Machine learning field. This allows the system to group numurous entities into separate clusters/groups based on certain characteristics/features of the entities. Clustering is a widely used technique in many grouping problems like grouping similar news articles, blogs, emails, malwares etc based on their…

Recommending from big data

Blog, Science, 07 Feb, 2012 2

As the research on core recommender systems progresses and matures, it becomes clear that a fundamental issue for these algorithms is to determine how to embed the core techniques in real operational systems and how to deal with massive and dynamic sets of data. Recommender system algorithms are very effective in identifying and predicting user preferences based on explicit or implicit indication of preference that…

Let the discussions begin……

Inspiration, 26 Jan, 2012 1

At Big Data Partnership we love to tell you about what we are up to and what interesting things we’ve seen or heard. But we prefer to hear about the things you have have heard and what you think about Big Data and even Big Data Partnership. Over the coming weeks we will be coming…

Page 3 of 31 23

Our Thoughts

How to analyse your customers social profile in 24 hours (Part II – analysis)

Hadoop becomes Mainstream

“Introducing YARN�? – Hadoop No More a Baby Elephant

Map Side and Reduce Side Joins

Bloom Filter Vs Feature Hashing

Clustering with Mahout

Recommending from big data

Let the discussions begin……

Recent Posts

Archives

Categories

Newsletter

Stay up to date!