29 Jun 2012

From Digital Analyst to Big Data Expert

No Comments BI, Dashboards, QlikView, Web Analytics

This post is a reply to the post called “Big Data – What is Means for The Digital Analyst” by Stephane Hamel which provides an excellent overview of big data as a trend or hype and the key technologies involved. Stephane starts his post by mentioning that when he “Googled “Big Data” he got 19,600,000 results… Where there was virtually nothing about big data two years ago there is now unprecedented hype”. The digital marketing community, formerly called web analytics community, seems to be struggling with the key concepts behind big data and implications for digital marketing analytics. At the end of his post Stephane poses the question:

what should a digital marketing analyst do to bridge the knowledge gap between “digital marketing” and “big data”?

Having made the switch from web analytics to business intelligence I have some thoughts on this area.

source online-behavior.com

Data scientist VS digital marketing analyst

First I would like to clarify some confusion regarding the different roles involved. A person working with big data in my opinion should not be confused with a “digital marketing” or “web analyst”. The proper role should be “data scientist”. What differentiates a data scientist from a web analytics or digital marketing expert can be illustrated through the following quote:

“A data scientist possesses a combination of analytic, machine learning, data mining and statistical skills as well as experience with algorithms and coding.”

One of the conclusions of the post of Stephan is that the current base of “web analytics” is generally not sophisticated enough to really leverage Big Data”. He also mentions the primary focus of digital analysts should not be on the lower-layers of infrastructure and tools development. I fully agree with this. To understand why, I think it is useful to look at a concrete example of how big data technology has successfully been applied within an organisation. This helps to get a better understand of the role of big data technology and concepts in the ecosystem as described in the earlier part of the article from Stephane Hamel.

Big Data Technology and LinkedIn

A great example of a successful company that uses big data technology is LinkedIn. If you log into your LinkedIn account you will see a lot of personalized content that is relevant for you, or people from your industry. LinkedIn provides recommendations of people you might know, relevant trending news from internet, jobs that are potentially interesting and more. To implement these services LinkedIn makes heavy use of big data technology. The latest LinkedIn IPad application as shown in the screenshot below is an impressive example of how “big data” technologies are used to create an innovative and superior user experience. Using the app almost feels like a fully personalized digital newspaper with relevant information.

screenshot: new LinkedIn Ipad App

To understand the role of big data in the applications above we find a lot of clues on the LinkedIn site. LinkedIn is an active contributor to the open source community and has a lot of interesting articles and webinars. To provide a brief summary:

  • LinkedIn collects, stores and analyses massive amounts of data “from breaking news to breaking servers, real-world events requiring petabytes of data to be collected, analysed and acted on as those events happen.”
  • To calculate something like “People you might know” LinkedIn uses Hadoop to “perform graph processing on a massive scale, such as scoring over a hundred billion relationships a day to compute.”
  • To make all this “offline” data available to their live site, LinkedIn has developed a multi-terabyte scale data pipeline from Hadoop to an online serving layer called “Project Voldemort”, an in house developed NoSQL distributed database which they have open sourced.

Now looking at the examples above it becomes clear that there are big differences between the technologies being used by big data scientists and digital marketing analysts. In the case of LinkedIn they use big data technology for massive distributed and parallel computing for storing, analysing and finding the most relevant information for each user, before making the information available in a user friendly way for system developers to work with. Technologies as Apache Hadoop and Map Reduce, provide the basic framework that allows LinkedIn to distribute processing of large data sets across clusters of computers, where a single server would instantly run out of memory and disk space. Natural language processing, data mining and machine learning are techniques used to algorithmically determine which content is most relevant for each user.

Areas of opportunity for digital marketing analysts

Now let’s go back to the original question:

“How can we bridge the gap between big data and digital marketing analytics?”

At the end of his post Stephane identifies three areas of opportunity interesting to explore. All three opportunities can in my opinion be categorised as sub-areas of a new field called “data visualisation” within the area of business intelligence, which will be discussed in the next paragraph. Taken from the original article the identified opportunities are:

  1. Processing: Mastering the proper tools for efficient analysis under different conditions (different data sets, varied business environments, etc.)
  2. Natural Language Processing: Developing expertise in unstructured data analysis such as social media, call center logs and emails.
  3. Visualisation: There is a clear opportunity for digital analysts to develop an expertise in areas of dashboarding and more broadly, data visualization techniques (not to be confused with the marketing frenzy of “infographics”).

Data Visualisation Tools: QlikView and Tableau

Based on my professional experience the areas above are excellently covered by a relatively new and innovative area within the business intelligence field called “data visualisation”. Data visualisation products are easier to learn, much more user-friendly and have a higher focus on the end-user then traditional BI products. Within the business intelligence field there are two data visualisation providers mentioned in the “Gartner – Magic Business Intelligence Quadrant 2012″, with QlikView (QlikTech) as market leader and Tableau as challenger (see image below). Data visualisation products provide people the possibility to load large data sets from different sources in memory. Once loaded you can analyse, slice and dice the data, create beautiful visualisations and build live interactive dashboards which you can easily publish and share with other people in the organisation through an online portal. QlikView and Tableau require much less technical competency then traditional BI tools. The most important requirements are having an analytical mind-set and good understanding of your business.

source: Gartner Research Report 2012

To give a full description of how these tools work would go beyond the purpose of this article. However, to see the final results I have added two screenshots below taken from random dashboard examples from the QlikView and Tableau websites. To see real live dashboard examples I recommend going to the QlikView demo portal or Tableau public community section. Here you can view and download many different dashboards examples and click around to experience the products from an end-user perspective. The demo portals are on-demand implementations of their server products which work in a similar fashion behind a company firewall.

Link to live Tableau dashboard

Link to live QlikView dashboard

True power of data visualisation tools

Some final words on data visualisation products. Based on my experience the true power of these product do not come from their powerful analytical capabilities, but from the opportunity to help organisations become more data driven by:

  • providing near real time data to monitor business processes
  • completely automate manual reporting
  • provide a single version of the truth (no more Excel data marts)
  • promote data democracy by allowing people to develop and share insights, reports and dashboards with other people in the organisation through the server portal
  • provide superior analytical possibilities through in memory technology and drill down possibilities

Conclusions

To summarize this post I believe the differences between the roles of big data scientist and digital marketing analyst are too big to gap. They should be treated as separate domains each with a unique place in the organisation. However, if you are a digital marketing analyst aspiring to work with big data analytics, or you would like to take your analytical career to the next level, I would recommend learning to work with BI/data visualisation tools QlikView and/or Tableau. The products are still relatively unknown within the field of digital marketing analytics and they have the potential to provide you or your organisation with a competitive advantage.

I hope the post was useful and I am looking forward to hear other people’s thoughts on the areas above…

25 Aug 2011

Twitter Analytics in QlikView

No Comments BI, Dashboards, QlikView, Social Media, Twitter Analytics

Social Media BI and Social Media Analytics are hot trends in the BI industry. As Microstrategy, IBM, SAP and other BI Vendors are investing in social media analytics it is a good time to show the possibilities with QlikView. It has been pretty quiet from QlikTech’s site in the social media analytics sphere so this is a great opportunity to showcase the possibilities developed by QlikTech partners. In the first blog post of this series “Twitter Analytics in QlikView” I will introduce you to Twitter and Twitter Analytics. I will start from the beginning by helping you set-up a Twitter account and learn some of the Twitter basics, before moving to the second part of this series called “Twitter Analytics – Finding Top Influencers”. Here we will introduce you to Twitter Analytics by showing how to find the top influencers in an industry and measure two important social media KPI’s: the number of brand “Promoters” and “Detractors”. If you would like to run the examples in this post with your own company or client’s data, simply download the Twitter Analytics Dashboard and follow the instructions on the QVSource Wiki. Make sure to download and run the latest version of QVSource before reloading the Twitter Analytics Dashboard.

Twitter is a great way to promote your brand, share thoughts and keep updated about the latest trends and insights in the BI industry. It can be a valuable source of traffic for your site and a good way of getting in touch with other Qlikview Professionals. The goal of this series is to show how easy it is to get started with Twitter analytics in QlikView, by providing easy to follow steps. After reading this series you should be well on your way of becoming a true Twitter BI expert.

Part 1 - Introduction to Twitter for QlikView
Part 2 - Twitter Analytics in QlikView – Finding Top Influencers

25 Aug 2011

Part 1: Introduction to Twitter for QlikView

2 Comments BI, Dashboards, QlikView, Social Media, Twitter Analytics

In this first blogposts in the series “Twitter Analytics in QlikView” I will introduce you to some of the Twitter basics. A basic working knowledge of Twitter will be required before starting with the more advanced part of this series called “Twitter Analytics in QlikView - Finding Top Influencers“. In the follow up post I will show you how to analyze Twitter data, find top influencers in an industry and measure two important Twitter KPI’s: brand “Promoters” and “Detractors”. But first let’s dive into some Twitter fundamentals for QlikView professionals.

Setting up a Twitter account

The first thing you will need to do is create a Twitter account though this link. Simply fill in your name, e-mail address and choose a nice Twitter name. The name can be changed afterwards so don’t worry too much about this at this stage. In the second part of the signup process you will be asked to fill in some additional information and create a profile description. Make sure to include some words you would like to be found on, like “QlikView”, “Business Intelligence” or “Business Discovery”. Adding these words will make it easier for like minded people to find you when searching for new profiles.
Read more

25 Aug 2011

Part 2: Twitter analytics – Finding Top Influencers

3 Comments BI, Dashboards, QlikView, Social Media, Twitter Analytics

In this second blogpost in this series called “Twitter Analytics in QlikView” I will show how to use QlikView to find the top influencers in your industry and measure two important social media KPI’s: the number of brand “Promoters” and “Detractors”. For people new to Twitter I recommend reading the first blog post in this series called Introduction to Twitter for QlikView, where I introduced some of the Twitter basics and helped setting up a Twitter account. In this second part we will start analysing Twitter data using the “QlikView Twitter Analytics dashboard” developed by QVSource. If you would like to run the examples in this post with your own company or client’s data, simply download the Twitter Analytics Dashboard and follow the instructions on the QVSource Wiki. Make sure to download and run the latest version of QVSource before reloading the Twitter Analytics Dashboard.

The main focus areas of this post will be:

• Finding top influencers
• Measuring a person’s Influence with “Klout” scores
• Measuring brand “Promoters” and “Detractors” trends
• A list of influential people to add on Twitter

To analyse influence we will use the influence tab on the QlikView Twitter Analytics dashboard as shown in the screenshot below. If you click on the dashboard you can see the top brand “Promoters” and “Detractors”, plus influencer’s trends for the BI industry based on two weeks of data collected from Twitter. We will discuss the details of the graphs in the following sections.

Read more

18 Jul 2011

Top 9 Best QlikView Examples

1 Comment BI, Dashboards, QlikView, Web Analytics

In this blog post I would like to share my favourite QlikView Dashboard examples. QlikView is an awesome Dashboard and/or data visualization tool that has been able to revolutionize the BI market. It is fun, intuitive and really easy to learn. All the examples below are available for download from QlikView or QVSource and can be tested at home, using the QlikView Personal Edition. I have been working with QlikView for a little over a year now and thought it would be nice to share my personal favourite Dashboards. Some are innovative, some useful and others are great for learning purposes. Enjoy the read!

Facebook Friends Analyser

QlikView Example Facebook Friend Analyser Dashboard

The Facebook Friend Analyser Dashboard can be downloaded from QVSource. This is a really great dashboard allowing you to analyse Facebook friends in ways you didn’t knew was possible. You can analyse “Friends”, “Groups”, “Likes”, “Sentiment”, “Check-ins” and cross segment the data using dimensions as “Relationship”, “Gender” and more. What makes this dashboard truly unique is that it offers some really good learning opportunities for advanced techniques such as integrating Google MapsSocial Media MeasurementSentiment Analysis and Geocoding in QlikView Dashboards. The code behind the file is easy to understand and with some minor modifications you can start applying these techniques to your own dashboards. At the time of writing the latest version includes a Google map which uses friend “locations” that are converted to latitude/longitude with the Yahoo Placemaker API. To run these examples make sure to run the latest version of QVSource and follow the instructions on their Wiki.

Read more

26 Jun 2011

Merging Google Analytics with QlikView

6 Comments Dashboards, Google Analytics, QlikView, Web Analytics

Recently I read an interesting blog-post about how to merge Google Analytics data with a Data Warehouse. Integrating data from your website with Data Warehouse data opens up a whole new range of optimization possibilities. It allows you to add customer data, purchase data or demographic data  from your organisation to your website data. Another example could be measuring offline sales generated through your website. In my previous post I showed how easy it is to pull Google Analytics data in Qlikview. This post I will dedicate on showing how to create a key between website and website data, allowing you to associate Google Analytics with offline data in QlikView.

Setting up the primary key.

In the example above you see a Qlikview Dashboard pulling data through two different sources, Google Analytics and CRM data. In order to produce this type of holistic view of our channels we first need to create a (primary) key between the different data sets. Once the key exists Qlikview will automatically associate the data.

Read more

12 Jun 2011

Google Analytics – Online & Offline Data Integration with QlikView

1 Comment Dashboards, Google Analytics, QlikView, Web Analytics

This is my first video in a series called Google Analytics offline online data integration. In this post we will be building a dashboard where we pull in Google Analytics data and plot it on a Google Maps. The video above will show you how to build this application!

The goal of the first video is to introduce you to QlikView as a business intelligence and data visualisation tool. In the next video we will go one step further and start adding sales data from a back-office application. This will allow us to create a holistic view of our data, for example measuring sales from a keyword to an offline purchase.

Read more