The danger of claimed data. How do we know the truth when everyone lies?

After a busy morning, you deserve a break. You log into the Guardian Soulmates dating site to check your messages. Just the two. Neither seems eye-catching so you don’t reply.

You’re not getting as many enquiries as you expected. Perhaps you should update your profile? The first thing you change is your photo to a more flattering one, although it is a few years old.

What next? The problem is that everyone else exaggerates, so anyone looking at your profile will assume you do too. Perhaps, by being scrupulously honest you’re not conveying the truth.

You tweak your personal information by adding a couple of extra inches onto your height. Maybe that’s overkill – you readjust your height to add just one inch. Then you change your job title. You’re already doing the work of a director, probably best to call yourself one.

You’re not alone, by any means, in stretching the truth to breaking point on your dating profile. Christian Rudder, founder of dating site OK Cupid, analysed the profiles of 1.51 million active users and uncovered evidence of systematic lying.

Men on the site are four times more likely than others of the same age and postcode to claim they earn more than $100,000. Suspicious, but not as fishy as the fact that the average height of users was supposedly two inches taller than the population average.

Most innovatively, Rudder examined the age of uploaded profile photos. When digital cameras take photos they attach text tags, called EXIF metadata, to the jpeg. These tags capture the date and time. Rudder found that while the average photo was 92 days old, the photos rated as “hottest” were much older.

Sex, lies and survey data

If Rudder’s study hinted at lying, the National Survey of Sexual Attitudes and Lifestyle (NATSAL) categorically confirms it. The survey, conducted among 15,000 respondents by UCL and the London School of Hygiene and Tropical Medicine, is the gold standard of research. In 2010 it found that British heterosexual women admit to a mean of eight sexual partners, compared to twelve for men. The difference is logically impossible. If everyone is telling the truth the mean for each gender must be the same.

All of this goes to show that advertisers trying to understand their customers have a problem: if they listen uncritically to consumers, they’ll be misled.

Even honest answers can be misleading

To complicate matters, people often don’t know their genuine motivations. This is demonstrated by an experiment led by Adrian North, then a psychologist at Leicester University.

To complicate matters, people often don’t know their genuine motivations.

Over a fortnight he alternated the background music played in a supermarket wine aisle, between traditional German oompah music and French accordion music. He surveyed customers who had bought either French or German wine. When accordion music was played, French wine accounted for 77% of wine sales; when the soundtrack was oompah music, German wine represented 73% of sales.

The scale of the variation shows that music was the prime determinant of the type of wine bought. However, only 2% of buyers spontaneously attributed their choice to the music. Even when prompted, 86% of people stated that it had no impact at all.

It’s not that they were lying; more that they were unaware of their motivations. The reasons proffered were mere post-rationalisations or, in the psychological term, confabulations. In the memorable words of Jonathan Haidt, author of The Righteous Mind, the rational mind “thinks of itself as the Oval Office when actually it’s the press office”.

How to apply this effect

1. From lies, learnings

Lies can be illuminating.

Take the NATSAL study that investigated how many sexual partners men and women admitted to. The claims are untrue but they reveal gender expectations about promiscuity. Men exaggerate their promiscuity, while women downplay it. The changing ratio between men and women tells a tale too. In 1990 men claimed to have two and a half times more sexual partners than women, by 2010 the gap had dropped to 50%. Gender expectations are levelling out.

Data are not transparent. They need to be teased, analysed and probed. Taking them at face value will mislead you. But if you dig further, then insights can be uncovered.

2. Adapt your surveys

Louis Heren, foreign correspondent at The Times famously advised, “When a politician tells you something in confidence, always ask yourself, “Why is this lying bastard lying to me?” His scepticism ensured he probed and pushed politicians until he uncovered the truth.

When a politician tells you something in confidence, always ask yourself, “Why is this lying bastard lying to me?

Similarly, you need to be prepared for deceit, and design surveys accordingly.

There are a couple of useful techniques. First, consider asking respondents how they think others might behave.

I used this approach when investigating whether people feel dissatisfied by the idealised images projected on social media. Of the 300 consumers I surveyed, 26% said they had pretended to be happier, or more successful, than they actually were. Furthermore, just over a third claimed to have felt unhappy when they saw others’ success on social media.

While this was a sizeable proportion of the sample I had a hunch that it was an under-estimate. After all, there’s a pressure to present a positive image in surveys. Psychologists call this the “social desirability bias”.

With that in mind, I asked another two questions. Did they believe other people crafted an overly positive social media image? And did other people feel sad when they saw these idealised images?

In this variant of the questions, people were far more likely to admit “social airbrushing” happened. In fact, 60% claimed that their friends portray themselves as happier than they actually were on social media. And nearly two-thirds agreed that other people sometimes feel sad when they see their friends’ success on social media.

My belief is that these questions encouraged respondents to answer more honestly and therefore the results are more accurate.

3. Don’t ask, observe

Direct questioning is unsatisfactory because of lying and confabulation. A more accurate alternative is to observe behaviour.

This could still involve surveys. The twist is to mask the objective of the question from the participant by adopting a cell methodology. This technique involves randomly allocating your sample into different cells, or groups, and then asking each group a slight variant on the question.

Even better, avoid surveys and monitor behaviour in a realistic situation. One example of this was on a New Look brief, when they were planning to launch a menswear range. The initial plans were for a modest budget to make a simple announcement.

I suspected a small campaign would be insufficient to overcome men’s reluctance to buy clothes from what was perceived as a women’s clothes shop. However, that was a hunch and we had no budget to fund a survey.

As an alternate methodology Dylan Griffiths and I recruited half a dozen agency volunteers and photographed them twice: first holding a New Look plastic bag emblazoned with their logo, then one while holding a Topman bag. We uploaded the images to Badoo, a dating site where people rate the looks of other users’ photos. The pictures were left up on the site for a fortnight while we waited for them to be rated.

We found that when our volunteers were holding a New Look bag they were seen as 20%-25% less good-looking than when they were clutching the Topman bag. This demonstrated that the brand had a more significant job than they had initially suspected and that they needed to make bigger efforts to persuade men that they were a unisex brand.

The final approach is to use found data. That is the data that consumers unwittingly create when they’re going about their daily tasks. The data is particularly useful as it’s not muddied by the social desirability bias. People aren’t aware they’re being monitored so they behave naturally.

Search is the most accessible found data source. Analysing search data provides insights that consumers might be loath to admit in a survey. Consider sexism. Most people would claim that they’re equally interested in their children’s intelligence, regardless of gender. However, Seth Stephens-Davidowitz, the New York Times journalist and data scientist, has analysed US search data and found that parents are two and half times more likely to google “Is my son gifted?” than “Is my daughter gifted?” Google acts as a modern confessional in which all our darkest thoughts are captured.

However, this rich seam of data is too rarely mined by advertisers. One of my favourite freely available, but untapped, search tools is answerthepublic.co.uk. This looks at the most common search strings that include the term you give it and a question word, such as who, what, how or when. It is a very simple but quick way to understand what consumers genuinely think about your category.

For example, if you input the term ‘vitamin’ you find that consumers rarely search for vitamins by their letter symbol. Instead they search for vitamins by what they do, such as helping with muscle growth or shiny hair. That’s a useful insight for a vitamin brand as it suggests they should label and package their vitamins according to the problem they solve, not the particular vitamin they contain.

4. Observed data is not perfect

Observed data are a significant improvement on surveys. However, they are far from perfect and still need to be interpreted with caution.

Consider social media data. Brands regularly analyse their Facebook fan data to understand their customer profile. But this data does not always accurately reflect reality. An example from Stephens- Davidowitz, illustrates this discrepancy. He looked at the gender of Katy Perry Facebook fans and found that they were overwhelmingly female. However, Spotify listening data revealed the gender split was more balanced: Perry was in the top ten artists for both genders. If the music label used the Facebook data to target their advertising they’d be way out.

Does that mean the new data streams are junk and best ignored?

Not at all. Observed data is an improvement on claimed data, but it’s still flawed. To understand customers we need a balanced approach, using multiple techniques. If each technique tells us the same story then we can give it greater credence. If they jar, then we need to generate a hypothesis to explain the contradiction.

To understand customers we need a balanced approach, using multiple techniques.

Let’s go back to the Katy Perry example. A simple explanation would be that while both genders enjoy listening to her, far more women are comfortable expressing that publicly. If a record label wants to sell Katy Perry songs or encourage streaming then then Spotify data would be ideal. However, if they want to promote her concerts better to use the Facebook numbers. Neither data set is right in any absolutist sense – they are right in certain circumstances.

This is an excerpt from Choice Factory: 25 behavioural biases that influence what we buy by Richard Shotton

Cookie	Type	Duration	Description
cli_user_preference	persistent	1 year	Keeps track of the cookie consents for on the current domain.
cookielawinfo-checkbox-marketing	persistent	1 year	Keeps track of the cookie consent for a specific category on the current domain.
cookielawinfo-checkbox-measurement	persistent	1 year	Keeps track of the cookie consent for a specific category on the current domain.
cookielawinfo-checkbox-necessary	persistent	1 year	Keeps track of the cookie consent for a specific category on the current domain.
cookielawinfo-checkbox-non-necessary	0	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Non Necessary".
cookielawinfo-checkbox-preferences	persistent	1 year	Keeps track of the cookie consent for a specific category on the current domain.
hustle_module_show_count-	persistent	1 day	This cookie is used to determine when the internal slide-in/pop-up/embed module for newsletter opt-ins is displayed to the user.
inc_optin_	persistent	1 hour	This cookie is used to determine when the internal slide-in/pop-up/embed module for newsletter opt-ins is displayed or hidden to the user.
PHPSESSID	session	0 minute	Preserves user session state across page requests. The PHPSESSID cookie is native to PHP and enables websites to store serialised state data. On the website it is used to establish a user session and to pass state data via a temporary cookie, which is commonly referred to as a session cookie. Stores unique session ID.
viewed_cookie_policy	persistent	1 hour	Stores the user's cookie consent state for the current domain.
viewed_cookie_policy	0	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
wordpress_	session	session	WordPress cookie for a logged in user.
wordpress_logged_in_	session	session	WordPress cookie for a logged in user.
wordpress_test_	session	session	WordPress cookie for a logged in user.
wordpress_test_cookie	session	session	WordPress test cookie.
wp-settings-	session	session	Wordpress also sets a few wp-settings-[UID] cookies. The number on the end is your individual user ID from the users database table. This is used to customize your view of admin interface, and possibly also the main site interface.
wp-settings-time-	session	session	Wordpress also sets a few wp-settings-{time}-[UID] cookies. The number on the end is your individual user ID from the users database table. This is used to customize your view of admin interface, and possibly also the main site interface.

Cookie	Type	Duration	Description
AMP_TOKEN	persistent	1 year	This cookie name is associated with Google Universal Analytics - which is a significant update to Google's more commonly used analytics service. It contains a token that can be used to retrieve a Client ID from AMP Client ID service. Other possible values indicate opt-out, inflight request or an error retrieving a Client ID from AMP Client ID service.
collect	third party	session	Used to send data to Google Analytics about the visitor's device and behaviour. Tracks the visitor across devices and marketing channels.
_ga	persistent	2 year	Registers a unique ID that is used to generate statistical data on how the visitor uses the website.
_gid	persistent	1 day	Registers a unique ID that is used to generate statistical data on how the visitor uses the website.
__gads	third party	2 years	Associated with the DoubleClick for Publishers service from Google. It serves purposes such as measuring interactions with the ads on our domain and preventing the same ads from being shown to you too many times.
__utma	persistent	2 years	This cookie is typically written to the browser upon the first visit. If the cookie has been deleted by the browser operator, and the browser subsequently visits strategy-business.com, a new __utma cookie is written with a different unique ID. In most cases, this cookie is used to determine unique visitors to strategy-business.com, and it is updated with each page view. Additionally, this cookie is provided with a unique ID that Google Analytics uses to ensure both the validity and the accessibility of the cookie as an extra security measure.
__utmb	persistent	30 minutes	This cookie is typically written to the browser upon the first visit. If the cookie has been deleted by the browser operator, and the browser subsequently visits strategy-business.com, a new __utma cookie is written with a different unique ID. In most cases, this cookie is used to determine unique visitors to strategy-business.com, and it is updated with each page view. Additionally, this cookie is provided with a unique ID that Google Analytics uses to ensure both the validity and the accessibility of the cookie as an extra security measure.
__utmc	persistent	30 minutes	Historically, this cookie operated in conjunction with the __utmb cookie to determine whether or not to establish a new session for the user. For backward compatibility purposes with sites still using the urchin.js tracking code, this cookie will continue to be written and will expire when the user exits the browser. However, if you are debugging your site tracking and you use the ga.js tracking code, you should not interpret the existence of this cookie in relation to a new or expired session.
__utmv	persistent	2 years	This cookie is not normally present in a default configuration of the tracking code. The __utmvcookie passes the information provided via the _setVar() method, which you use to create a custom user segment. This string is then passed to the Analytics servers in the GIF request URL via the utmcc parameter. This cookie is written only if you have added the¬_setVar() method for the tracking code on your website page.
__utmz	persistent	6 months	This cookie stores the type of referral used by the visitor to reach strategy-business.com, whether via a direct method, a referring link, a website search, or a campaign such as an ad or an email link. It is used to calculate search engine traffic, ad campaigns, and page navigation within strategy-business.com. The cookie is updated with each page view to strategy-business.com.

Cookie	Type	Duration	Description
GoogleAdServingTest	persistent	session	Used to register what ads have been displayed to the user.
IDE	persistent	1 year	Used by Google DoubleClick to register and report the website user's actions after viewing or clicking one of the advertiser's ads with the purpose of measuring the efficacy of an ad and to present targeted ads to the user.
test_cookie	third party	1 day	Used to check if the user's browser supports cookies.
__ab12#	persistent	2 years	Pending

Top 10 Global Consumer Trends 2020

Top 10 Global Consumer Trends 2021

Understanding the Why? Projective Techniques in Qualitative…

African consumers resistance to e-commerce and what is…

The fascinating dynamism of the African Insights industry

Christmas 2020: Opportunities to close the year on…

Make your customer experience meaningful, not only frictionless

There Is a Way Out of This Mess

Nail Biting in Georgia US Senate Races –…

Media polling and the way forward

U.S. election pollsters: watch Florida for key indicators!

Post-pandemic marketing & advertising trends among marketers

Cross-Media Measurement, XMM: no viewing – no outcomes!

XMM Disconnect? As Alice went into Wonderland, things…

Innovations in media measurement, accelerated by COVID, establish…

Insight from the Insight250 winners: Data-driven leadership

Insights from the Insight250 winners: Evolutions and innovations…

Customer advocacy: How to turn customers into friends,…

Brands as provocations: How to connect at scale…

Predictive qual: How to turn the art of…

What It truly means to be tech-enabled in…

Insights on insights: Which survey data analysis solution…

Eating in, is the new testing out –…

Behavioural tech-heads: What technology needs to learn from…

SHOBSERVATORY Research Chronicles: The heart of the brand…

ESOMAR announces the 2021 award winners

SHOBSERVATORY Research Chronicles: How presentations are created

The danger of claimed data. How do we know the truth when everyone lies?

Sex, lies and survey data

How to apply this effect

1. From lies, learnings

2. Adapt your surveys

3. Don’t ask, observe

4. Observed data is not perfect

Leave a Comment Cancel Reply

Predictive qual: How to turn the art of qual into a science...

The danger of claimed data. How do we know the truth when everyone lies?

Sex, lies and survey data

How to apply this effect

1. From lies, learnings

2. Adapt your surveys

3. Don’t ask, observe

4. Observed data is not perfect

Leave a Comment Cancel Reply

Related Articles

We value your privacy!