Decision Science

From question to decision

Analytics is not about charts. It is about changing what a leader does next. This page walks through one complete analytical story — from the business question a leader asked, through exploration, pattern identification, driver analysis, and the insight that changed how the organization acted.

Integrity & Confidentiality: All data, percentages, region names, and identifiers shown are entirely synthetic. No real employees, locations, or proprietary data from any organization are represented. Analytical frameworks and methodologies reflect large-scale workforce analytics and transformation work.

"Why are some locations underperforming? We have the same product, the same training. What is different?"

The question

A leader needed an answer

Not a dashboard. A specific answer to a specific business problem — with enough evidence to act on it confidently.

The method

Data before conclusions

Explore first. Find patterns before building models. Let the data reveal what is worth investigating.

The outcome

The intervention changed

Leaders stopped watching turnover. They started watching absence — 90 days earlier in the cycle.

Techniques on this page

PythonSeaborn pairplot Correlation heatmapHierarchical clustering Time seriesEngagement heatmap Geo-risk mappingDriver decomposition D3.jsPower BIExecutive narrative

Chapter 01

Decision Science — the method

Every analysis follows the same six-step discipline. The question determines the analysis — not the other way around. The most common mistake is jumping from step two to step six. Everything in between is where the insight lives.

Decision Science Workflow

A clearer, portfolio-ready version of the analytical flow — from business question to executive decision.

1

Frame the Business Question

Translate an ambiguous request into a well-defined problem. Clarify priorities, decision points, and what action the analysis must influence.

2

Identify the Available Data

Map internal, external, survey, operational, and web-based datasets. Confirm what is reliable, missing, or unavailable.

3

Clean and Prepare the Data

Examine fields at a detailed level, treat missing values, standardize definitions, and create analysis-ready data.

4

Explore Patterns and Features

Segment the data, compare groups, test relationships, and validate observations with business stakeholders.

5

Model, Evaluate, and Refine

Build statistical or machine-learning models only when they help answer the business question. Evaluate, refine, and pressure-test results.

6

Communicate the Decision Story

Visualize findings, explain trade-offs, and translate the insight into a clear recommendation leaders can act on.

Cleaned up visual: from business question to executive decision

This replaces the blurry image with responsive HTML/CSS, so the text stays sharp and fits the page on desktop and mobile.

Frame before collecting

The question determines which data matters. Collecting data without a question produces noise, not insight.

Explore before modelling

A scatter matrix and correlation heatmap before any model. What the data is actually saying — before assumptions take over.

Story before slides

The visual is the last step. Designed around the decision a leader needs to make — not around what looks impressive.

Chapter 02

Data exploration — before any model

Before building anything, the data needs to speak. A scatter matrix reveals distributions and pairwise relationships. A correlation heatmap shows which variables move together. A dendrogram identifies natural clusters. This is Python work — and it determines whether everything that follows will be meaningful.

Synthetic data only. Variable names, values, and patterns shown are generated for demonstration. No real organizational data represented.

LOA rate, tenure, and internal hire rate cluster together. Engagement moves with them. This is not coincidence — it is a system.

Scatter matrix — pairwise relationships (Python / seaborn)

Diagonal: distribution of each variable. Off-diagonal: pairwise scatter. Built in Python before any model is considered.

Scatter matrix showing pairwise workforce variable relationships

What relationships exist between workforce variables?

Years in role, years in organization, age, and shift count. The distributions and scatter patterns determine which variables are worth including in any model.

Correlation heatmap — variable clustering (Python)

Triangular heatmap showing similarity scores. Green intensity = stronger correlation. Three clusters emerged.

Triangular correlation heatmap and dendrogram

Which variables move together — and how do they cluster?

Left: triangular correlation matrix. Top right: dendrogram from hierarchical clustering. Bottom right: actual vs predicted time series across three workforce metrics.

What we found

Tenure and IHR have the strongest positive co-movement. LOA and turnover cluster together. Engagement is downstream — a lagging indicator, not a driver.

Why it changed the approach

Had we skipped exploration and modelled directly, we would have included engagement as a feature. The heatmap showed it moves after LOA — not before.

The Python proof

Seaborn pairplot, scipy hierarchical clustering, matplotlib time series overlay. Built in Google Colab before a single Power BI visual was designed.

Chapter 03

Where is the problem — and who is most affected?

Two questions answered simultaneously. The geo-risk map shows where risk concentrates across the retail network. The engagement heatmap shows who is most affected — by role level and by tenure band. Together they make the problem undeniable.

Synthetic data only. All coordinates, risk scores, percentages, and identifiers are generated for demonstration. No real locations or employee data represented.

Risk concentrates in specific geographies and specific populations — manager-level employees in their first three years are the critical cohort.

Workforce Risk Landscape

Each dot represents a synthetic location. Colour indicates risk tier. No identifiable location-level data is shown.

Low risk Moderate risk High risk Critical risk

Engagement by role level — seven pillars

Single blue ramp. Darker = stronger. Manager level is the consistent weak point.

Engagement by tenure band — the same seven pillars

Scores strong in first three months. Drop sharply in year one through three. Partially recover after five years. Year one is the critical window.

Chapter 04

Why it happens — three pillars, one system

The heatmap showed where and who. The driver analysis showed why. Performance is shaped by three interconnected pillars — each requiring a different intervention. No single factor dominates everywhere.

Synthetic data only. All percentages shown are generated for demonstration.

Roughly 1 in 5 locations underperform. Each is primarily driven by one of three pillars — but the mix varies by region. One intervention strategy does not fit all.

Pillar 1

Employee Experience

Engagement, belonging, development, and management quality. Widespread opportunity — not the primary driver in most locations.

~32%

of underperforming locations

Pillar 2

Workforce Stability

LOA rate, turnover, and staffing availability. Driven by both absence pressure and elevated exits occurring simultaneously.

~35%

of underperforming locations

Pillar 3

Leadership Continuity

Experience in role, organizational tenure, in-location consistency. The upstream driver that shapes everything below it.

~33%

of underperforming locations

Driver distribution by region — stacked view (synthetic)

Power BI — workforce intelligence at enterprise scale

Four dashboards built at Walmart Canada. Each answers one specific question. Top left: store performance with RAG scoring. Top right: workforce composition trends. Bottom left: ER incident geo-mapping across Canada. Bottom right: compliance and investigation tracking.

Power BI dashboards — four workforce intelligence views

From pattern detection to executive dashboard — the full pipeline in production

Built at Walmart Canada. The geo-risk map (bottom left) directly applies the same geographic risk plotting methodology shown in Chapter 03 — real enterprise output of the same analytical approach.

Chapter 05

The full picture

Three pillars. One causal chain. And when that chain breaks — a predictable cycle that repeats until someone intervenes at the right point. This is what the analysis produced: a model simple enough to act on.

Synthetic data only. All metrics and percentages are generated for demonstration.

How the drivers connect

Experience drives workforce outcomes. Workforce stability drives performance. Leadership sustains both. Fix leadership first.

Leadership Continuity

Tenure · experience · consistency

Employee Experience

Engagement · belonging · development

Workforce Stability

LOA · turnover · staffing

Business Performance

Sales · customer experience · outcomes

Experience drives workforce outcomes · workforce drives performance · leadership sustains both

The vicious cycle — when the system breaks

Absence is the leading indicator. By the time turnover spikes, you are already 90 days behind.

Signal 1 · Leading

↑ LOA

Absence climbs 60–90 days before turnover spikes.

Signal 2 · The Cost

↑ TO

Turnover peaks. Maximum organizational stress.

Signal 3 · Pressure

↓ IHR

Internal movement slows. Career opportunity disappears.

Signal 4 · Consequence

↓ ENG

Engagement falls. Disengagement follows instability.

Signal 5 · Recovery

↑ IHR

Internal hiring rebounds. Turnover begins to fall.

Signal 6 · Reset

↑ ENG

Engagement lifts. Cycle closes — until LOA rises again.

Watch LOA — not turnover. Intervene 90 days earlier. That is the insight that changed the strategy.

52-week rolling trend — the cycle made visible (synthetic)

LOA spikes first. IHR drops next. Turnover follows. Once you see it you cannot make the same decisions.

LOA Rate Turnover Internal Hire Rate Engagement

What changed

The organization shifted from monitoring turnover as the primary attrition signal to monitoring LOA rate. Intervention triggers were set at LOA thresholds — before turnover had a chance to spike.

The lesson

The most valuable insight is rarely in the metric everyone is already watching. It is in the upstream signal nobody thought to track. Finding that signal is what decision science is actually for.