MYOD Dataset: Building a DAM

Kshira Saagar
07 July, 2020 09:24
email
Bookmark
Share this post
- facebook facebook
- slashdot slashdot
- digg digg
- Reddit Reddit
- stumbleupon stumbleupon
- linkedin linkedin
- twitter twitter

Kshira Saagar

Chief data officer, Latitude Financial Services

Kshira Saagar (the K is silent like in a Knight) is currently the Chief Data Officer of Latitude Financial Services and has spent 22.3% of his life helping key decision-makers and CxOs make smarter decisions using data, and strongly believes that every organisation can become truly data-driven. Outside work, Kshira spends a lot of time on advancing data literacy initiatives for high school and undergrad students.

In my first article in this MYOD [Make Your Organisation Data-Driven] series, I articulated a one-line approach to successfully injecting data into your organisation’s DNA: Using a Dataset -> Skillset -> Mindset framework. This will take your people and processes on a journey to data actualisation.

The first and most critical stage of the framework is the Dataset stage. The term Dataset used here is a synecdoche or figure of speech standing for all aspects and processes for making data available, reliable and credible.

And it is Data Availability Maps (DAMs) that will you help better understand the lay of the data land and resolve information conflicts to supercharge data-driven decisions.

Why Dataset is the first stage

Have you ever been in a meeting where the simplest of questions seem to be the toughest to answer? A good one is, “What’s the number of new customers we have acquired in the last three months?”

If you ask the tech team that managed the shop/point-of-sale database, you’d get one number – say 1000. When you ask the marketing team using a different set of tools, you’d get a slightly different number – say 980. Finally, when you ask the team responsible for customer experience, you’d get 920. Which one is correct and have you spent a lot of time debating these in crucial meetings?

This ‘which-metric-to-trust’ debate is a key component of all big and small meetings alike and is better known as data confusion. This confusion happens in organisations of all sizes, due to many diverse systems capturing the same information in many different ways. It also leads to the famous “guess we can’t ever know the truth and can’t trust the data” resignation, which is the biggest and most dangerous bottleneck to data-driven decision making in an organisation.

A second big hurdle is GIGO (Garbage in, garbage out). This implies if the data is unreliable, then any super-smart insight or algorithm built on top of it will be unreliable by extension. Despite the most expensive artificial intelligence (AI) tools or software on the market doing what they do best, if the internal data landscape isn’t better mapped out and governed, it makes the whole process nullable.

GIGO along with data confusion, makes it extremely important to first understand the data landscape before going any further on the data journey.

Exploring the data seas

Anyone who has ever used Marco and Polo in Age of Empires 2 will have seen the fog being unveiled and the entire lay of the land showing up. For those who haven’t, imagine the old explorers mapping out our oceans and producing the first Atlas to get a sense of the world. Either way, these actions shine a light on the unknown unknowns and give us the possibility of the unseen reality.

The same exercise needs to be first done on the wider data and system landscape to get a sense for the data reality. In comes, a quick win solution, Data Availability Maps (DAMs).

Good news is DAMs don’t need expensive tools or software. Just like any other sensible exercise, a DAM starts from a super high-level and iteratively dives deeper into finer aspects of each data source.

While there are a lot of really good tools to do the finer details, no ‘one tool’ can claim to do a super high-level overview of all your systems. The good fortune to do that still lies with the organisation.

To build a DAM, all you need is a digital spreadsheet that can address the following questions:

What is the ‘name’ of the data source?
Where does the source store this data in the end? (Hint: Cloud is not an answer)
What aspects and features does it track? Is it manual entries or automatic?
If the accuracy could be benchmarked, how reliable are these tracked metrics?
What business questions can these metrics and data sources answer?

The first pass should yield a result to answer the first two questions: What is the data source ‘called’ and where does the data source store the data. It is important that apart from noting what the data source does, it is imperative to also get its pet name like ‘Alpha’ or typically an animal or a mythical hero’s name. So many confusions and arguments at cross-purposes would be resolved if we just called the data sources by one agreed name first.

First draft of a DAM

A fully-formed first draft of the DAM should look something like this:

A few things become apparent from this exercise:

Which aspects are tracked more than a few times across various systems – it’s good to know, for instance, if customer details are tracked in three different systems already

Which system is the most reliable for what kind of metrics. From the table, we can see customer details is more reliable on the ‘Shop Transaction Database’, whereas email subscriptions is more reliable in ‘Marketing DB #2’

Which aspects are trackable but NOT tracked fully or not tracked at all in these systems - basically the whitespaces in the systems that can be exploited.

Benefits of DAM

Aspects tracked in a DAM are still high-level features and not exact metrics in themselves. Think of these aspects as fundamental blocks that when put together, provide a complete picture of what’s going on with the organisation.

Apart from the obvious visibility of data sources available, this exercise also serves to highlight the glaring gaps in the data landscape based on the kind of questions the business wants answered and the data is currently unable to sufficiently answer.

The DAM exercise doesn’t have to be a technical team exercise. Anybody and everybody who cares to make data-driven decisions can build one on their own and merge all their findings together with another interested data soul. The technical aspect of where the data is stored only plays a very crucial role when conversations are had with vendors and third-parties, as it greatly minimises time spent going back and forth with IT/tech teams and gives everyone involved a quick picture of what is doable.

A fully formed DAM

The DAM above is only a first draft that can be done by anyone in the organisation, irrespective of technical skills. However, to put this into action, more specific tools are needed – known in the market as ‘Data Catalog’ tools – which come in both open-source and commercial versions. These Data Catalog tools can then take the DAMs into a totally different level by providing visibility on what is available and how accurate a specific metric is within a feature.

For example, a Data Catalog can say which system captures the age and gender of a customer most accurately and comprehensively, while at the same time ensuring the right privacy and protection of this sensitive data is in place.

A good first step towards having a better hold on your data governance and privacy framework is to ensure a quick DAM is built, as this helps answer business questions and also can morph into a sophisticated data catalog when the time is right.

A DAM is still just the starting point of sorting out the data confusion and GIGO issues. Once a DAM is built, the next big issues with your dataset are on ways to sort out the right availability for the right people, a plan to fill in data gaps and more importantly, a smarter way to ask tougher questions of your data. We’ll cover more about these three aspects in the upcoming series of MYOD.

Until then, for those who’d like to create a DAM of your own - here’s a template to get you started on your MYOD journey.

Tags: data analytics, big data analytics, data-driven marketing

Show Comments

Tweets by @CMOAustralia

Latest Whitepapers

CMO50 special report: Practising personalisation in a new era ...

As our customers exhibit growing desire to be recognised for their preferences and passion points, ...

More whitepapers

Latest Videos

Modern marketing and why it’s a matter of trust: CMO50 2022 series - Episode 3

In the third and final episode of our 3-part CMO50 video series exploring modern marketing and why it’s become a matter of trust, we’re delighted to be joined by Telstra’s former CMO and now digital services and sales executive, Jeremy Nicholas, and Adobe VP Marketing Asia-Pacific and Japan, Duncan Egan.

28 September 2022
Play video

Partner Zone

Are you ready to break out of digital marketing autopilot?

More Partner Zone

Web Events

CMO | Optimizely webinar: State of CX Leadership 2022

State of the CMO 2022: Marketing’s evolving role

Tackling modern marketing's creative gap

Blog Posts

Marketing prowess versus the enigma of the metaverse

Flash back to the classic film, Willy Wonka and the Chocolate Factory. Television-obsessed Mike insists on becoming the first person to be ‘sent by Wonkavision’, dematerialising on one end, pixel by pixel, and materialising in another space. His cinematic dreams are realised thanks to rash decisions as he is shrunken down to fit the digital universe, followed by a trip to the taffy puller to return to normal size.

Liz Miller

VP, Constellation Research

Why Excellent Leadership Begins with Vertical Growth

Why is it there is no shortage of leadership development materials, yet outstanding leadership is so rare? Despite having access to so many leadership principles, tools, systems and processes, why is it so hard to develop and improve as a leader?

Michael Bunting

Author, leadership expert

More than money talks in sports sponsorship

As a nation united by sport, brands are beginning to learn money alone won’t talk without aligned values and action. If recent events with major leagues and their players have shown us anything, it’s the next generation of athletes are standing by what they believe in – and they won’t let their values be superseded by money.

Simone Waugh

Managing Director, Publicis Queensland