The Value of Partial Data

Felix Rios

As I’m typing this blog post, CES 2015 is taking place in Las Vegas, Nevada. If you don’t know it, the Consumer Electronics Show is the biggest and most important consumer-centric technology event in the world. This event is well known for showcasing everything from the most absurd gadgets that will never make it to production, like the iPotty, to truly revolutionary gadgets that shaped the way we consume media like the CD player in 1981.

The CES is the Monaco Grand Prix of the gadget world. This is where products and trends are made or come to die. In the 2015 edition there is an overwhelming number of manufacturers showcasing IoT technology, smartwatches and wearables. This could only mean one thing: these are not just a trend; these devices will stick around for a while. They are the logical progression of the gadgets we own. As we become more dependent on being online, it only makes sense that all the devices we interact with on a daily basis are also connected for our comfort and ease of use.

In a previous blog post (Data: The new gold fever), I explained the endless possibilities that will come for researchers with so many devices collecting our data. The implications for the market research industry are huge. To get the information we need, until now, our main tool for data collection has been asking participants questions directly. But what if asking questions is no longer the only way to collect measurable and actionable data? What if asking questions and waiting for participants to answer them becomes the slowest and least cost-effective source of data?

As IoT devices and wearables become more affordable they will be widely adopted. Most of our devices will inevitably be connected. They are putting at our fingertips the possibility to gather the type of data that cannot be collected by asking questions. These new technologies could show us that the data we get from passive monitoring is going to be richer and more contextual than we have ever seen in the past – maybe much more than any answer from any participant. In no way are we saying that this will kill the other existing data collection techniques (e.g., computer-assisted telephone, online, face-to-face, etc.). Each will have its own space and usability.

The amount of data generated is already larger than ever before, and it comes in a different form than we are used to. It is mostly unstructured, but highly contextual if seen through the right glasses. The advertising industry has found that using large amounts of unstructured data, in the right way, helps create very accurate user profiles that allow them to target online advertising to very specific segments of the population. There are a lot of learnings that we can take from such experiences:

Facebook: A Data Giant!
A few weeks ago, Facebook announced that it started to index more than 3 trillion posts. This will allow users to search their posts and their friends’ posts. Many times I have found myself trying to dig out something that posted a long time ago and I painfully have to scroll through pages and pages of posts until I get there. Now, by simply remembering a few key words, I am going to be able to find it.

What this also means for Facebook is that they are going to be able to build links with all the data that I have been feeding onto their platform. They are also going to be able to compare it and blend it with all of the data from my contacts to extend these links and create patterns. This will give them ways to understand how I behave, think and even feel – all in the spirit of selling more accurate advertising and products.

It gives me shivers down my spine when I think of how powerful this is, not only will they be able to know what I like, think etc., they will know the same information for every single one of the 1.35 billion users that are currently active on their social network.

You may think, “Well, I don’t share much on Facebook, so I’m safe,” but what little you are sharing right now is enough to match to someone else’s similar profile and decode pretty well what your interests are. 1.35 billion users provide a lot of data. I like to think that people are quite unique, but the sad truth is that we are very predictable. This is the foundation of the market research industry; we try to understand how a group of the population behaves by studying a statistically representative subset of that population. Think of the largest panel that you have ever worked with and it is merely a fly in the wall compared to everyone on Facebook.

Facebook is one of the most important human data repositories to ever exist. This is not something that will happen in the future — it is already happening. Until recently, Facebook knew a lot about us, but now they understand it.

Some Important Lessons

First, the online advertising world is moving fast. Google, Baidu, and the most important online advertising delivery organizations all have similar initiatives that involve artificial intelligence and machine learning. They are embracing the changes and riding the wave.

Second, partial data is valuable: Facebook doesn’t need to know everything about you to be able to accurately target advertising that is relevant to you. They just need to know enough information so they can find your similar self. There will be someone out there sharing similar information to you in bigger quantities that will match your profile. If you think of this same approach, how many of you are using partial data from dropouts, terminates, etc. from your surveys? Are you discarding this data?

John Puleston from GMI (@jonpuleston) is well known for his “bonsai surveys” approach and his campaign to “eradicate boring surveys.” This method allows you to set question level quotas, so you can stop asking a section of your survey if you have enough data. Or maybe use that partial data in your results if the entire survey is not completed. In consequence you end up with shorter surveys that are more engaging and quicker to complete.

Have you thought of how much money you can save if you could use partial data from dropouts in your surveys? Have you thought of using all the information from your completed surveys to create profiles that help you validate that partial data? This radical approach changes the conversation in our industry, where the “complete” is not the most important unit, but data itself.

We need to have the right mindset and embrace the changes coming our way. There are many companies out there already thinking like this. Companies like Ugam are willing to work with you to rethink the future of research. To look at your data differently and find new ways to understand it and create insights. Without any doubts, technology is disrupting every aspect of our lives, and it will continue to rock the boat of the market research industry. The most important thing is to have a disruptive mindset. We shouldn’t be afraid to try new things. Our clients will thank us for it!

Felix Rios, Market Research Technology Manager at Ugam

Cookie	Type	Duration	Description
cli_user_preference	persistent	1 year	Keeps track of the cookie consents for on the current domain.
cookielawinfo-checkbox-marketing	persistent	1 year	Keeps track of the cookie consent for a specific category on the current domain.
cookielawinfo-checkbox-measurement	persistent	1 year	Keeps track of the cookie consent for a specific category on the current domain.
cookielawinfo-checkbox-necessary	persistent	1 year	Keeps track of the cookie consent for a specific category on the current domain.
cookielawinfo-checkbox-non-necessary	0	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Non Necessary".
cookielawinfo-checkbox-preferences	persistent	1 year	Keeps track of the cookie consent for a specific category on the current domain.
hustle_module_show_count-	persistent	1 day	This cookie is used to determine when the internal slide-in/pop-up/embed module for newsletter opt-ins is displayed to the user.
inc_optin_	persistent	1 hour	This cookie is used to determine when the internal slide-in/pop-up/embed module for newsletter opt-ins is displayed or hidden to the user.
PHPSESSID	session	0 minute	Preserves user session state across page requests. The PHPSESSID cookie is native to PHP and enables websites to store serialised state data. On the website it is used to establish a user session and to pass state data via a temporary cookie, which is commonly referred to as a session cookie. Stores unique session ID.
viewed_cookie_policy	persistent	1 hour	Stores the user's cookie consent state for the current domain.
viewed_cookie_policy	0	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
wordpress_	session	session	WordPress cookie for a logged in user.
wordpress_logged_in_	session	session	WordPress cookie for a logged in user.
wordpress_test_	session	session	WordPress cookie for a logged in user.
wordpress_test_cookie	session	session	WordPress test cookie.
wp-settings-	session	session	Wordpress also sets a few wp-settings-[UID] cookies. The number on the end is your individual user ID from the users database table. This is used to customize your view of admin interface, and possibly also the main site interface.
wp-settings-time-	session	session	Wordpress also sets a few wp-settings-{time}-[UID] cookies. The number on the end is your individual user ID from the users database table. This is used to customize your view of admin interface, and possibly also the main site interface.

Cookie	Type	Duration	Description
AMP_TOKEN	persistent	1 year	This cookie name is associated with Google Universal Analytics - which is a significant update to Google's more commonly used analytics service. It contains a token that can be used to retrieve a Client ID from AMP Client ID service. Other possible values indicate opt-out, inflight request or an error retrieving a Client ID from AMP Client ID service.
collect	third party	session	Used to send data to Google Analytics about the visitor's device and behaviour. Tracks the visitor across devices and marketing channels.
_ga	persistent	2 year	Registers a unique ID that is used to generate statistical data on how the visitor uses the website.
_gid	persistent	1 day	Registers a unique ID that is used to generate statistical data on how the visitor uses the website.
__gads	third party	2 years	Associated with the DoubleClick for Publishers service from Google. It serves purposes such as measuring interactions with the ads on our domain and preventing the same ads from being shown to you too many times.
__utma	persistent	2 years	This cookie is typically written to the browser upon the first visit. If the cookie has been deleted by the browser operator, and the browser subsequently visits strategy-business.com, a new __utma cookie is written with a different unique ID. In most cases, this cookie is used to determine unique visitors to strategy-business.com, and it is updated with each page view. Additionally, this cookie is provided with a unique ID that Google Analytics uses to ensure both the validity and the accessibility of the cookie as an extra security measure.
__utmb	persistent	30 minutes	This cookie is typically written to the browser upon the first visit. If the cookie has been deleted by the browser operator, and the browser subsequently visits strategy-business.com, a new __utma cookie is written with a different unique ID. In most cases, this cookie is used to determine unique visitors to strategy-business.com, and it is updated with each page view. Additionally, this cookie is provided with a unique ID that Google Analytics uses to ensure both the validity and the accessibility of the cookie as an extra security measure.
__utmc	persistent	30 minutes	Historically, this cookie operated in conjunction with the __utmb cookie to determine whether or not to establish a new session for the user. For backward compatibility purposes with sites still using the urchin.js tracking code, this cookie will continue to be written and will expire when the user exits the browser. However, if you are debugging your site tracking and you use the ga.js tracking code, you should not interpret the existence of this cookie in relation to a new or expired session.
__utmv	persistent	2 years	This cookie is not normally present in a default configuration of the tracking code. The __utmvcookie passes the information provided via the _setVar() method, which you use to create a custom user segment. This string is then passed to the Analytics servers in the GIF request URL via the utmcc parameter. This cookie is written only if you have added the¬_setVar() method for the tracking code on your website page.
__utmz	persistent	6 months	This cookie stores the type of referral used by the visitor to reach strategy-business.com, whether via a direct method, a referring link, a website search, or a campaign such as an ad or an email link. It is used to calculate search engine traffic, ad campaigns, and page navigation within strategy-business.com. The cookie is updated with each page view to strategy-business.com.

Cookie	Type	Duration	Description
GoogleAdServingTest	persistent	session	Used to register what ads have been displayed to the user.
IDE	persistent	1 year	Used by Google DoubleClick to register and report the website user's actions after viewing or clicking one of the advertiser's ads with the purpose of measuring the efficacy of an ad and to present targeted ads to the user.
test_cookie	third party	1 day	Used to check if the user's browser supports cookies.
__ab12#	persistent	2 years	Pending

Top 10 Global Consumer Trends 2020

Top 10 Global Consumer Trends 2021

Understanding the Why? Projective Techniques in Qualitative…

African consumers resistance to e-commerce and what is…

The fascinating dynamism of the African Insights industry

Christmas 2020: Opportunities to close the year on…

Make your customer experience meaningful, not only frictionless

There Is a Way Out of This Mess

Nail Biting in Georgia US Senate Races –…

Media polling and the way forward

U.S. election pollsters: watch Florida for key indicators!

Post-pandemic marketing & advertising trends among marketers

Cross-Media Measurement, XMM: no viewing – no outcomes!

XMM Disconnect? As Alice went into Wonderland, things…

Innovations in media measurement, accelerated by COVID, establish…

Insight from the Insight250 winners: Data-driven leadership

Insights from the Insight250 winners: Evolutions and innovations…

Customer advocacy: How to turn customers into friends,…

Brands as provocations: How to connect at scale…

Predictive qual: How to turn the art of…

What It truly means to be tech-enabled in…

Insights on insights: Which survey data analysis solution…

Eating in, is the new testing out –…

Behavioural tech-heads: What technology needs to learn from…

SHOBSERVATORY Research Chronicles: The heart of the brand…

ESOMAR announces the 2021 award winners

SHOBSERVATORY Research Chronicles: How presentations are created

Leave a Comment Cancel Reply

Predictive qual: How to turn the art of qual into a science...

The Value of Partial Data

Leave a Comment Cancel Reply

Related Articles

We value your privacy!