Data

Databricks vs Azure Synapse Analytics: Understanding the Differences for Smarter Data Platform Choices

As a data engineer navigating cloud platforms and analytics ecosystems, I’ve worked extensively with both Databricks and Azure Synapse Analytics. While they often appear side-by-side in Azure environments, they come from different design philosophies and cater to slightly different needs—even when they seem to do many of the same things. Both platforms allow you to […]

Databricks vs Azure Synapse Analytics: Understanding the Differences for Smarter Data Platform Choices Read More »

recipe, tab, index, cards, dividers, print, food, book, pages, cookies, candy, seafood, beverage, bread, soup, salad, gray food, gray book, gray books, gray bread, gray candy, tab, index, index, index, index, index

The Hidden Cost of Overloaded Data Fields — And How Data Governance Saves the Day

When One Field Tries to Do Too Much Across many organizations, the same pattern repeats. A single data field—perhaps introduced during a system rollout or legacy data migration—is intended to serve a focused purpose. Over time, however, teams start using it to represent different ideas. Marketing redefines the field to support campaigns. Sales uses it

The Hidden Cost of Overloaded Data Fields — And How Data Governance Saves the Day Read More »

Copy Activity vs Data flows in Azure Data Factory: A Practical Look at Two Competing Features

When building cloud data pipelines with Azure Data Factory (ADF), you’re often presented with two paths: use the tried-and-true Copy Activity, or lean into the newer, powerful Mapping Data flows. Both are native ADF features and technically serve the same purpose—moving and transforming data—but the similarities end there. In our case, we started with Data

Copy Activity vs Data flows in Azure Data Factory: A Practical Look at Two Competing Features Read More »

Using SQL UNPIVOT to Unlock Healthcare Provider Insights with Unpivoted Taxonomy Data

I have worked many years with the NPI dataset and fed a few of my experiences to AI and asked it to write this blog post for me. I think it did a good job. Imagine a conversation between an enterprise IT strategist, a healthcare IT consultant, and a healthcare administrator. The three sit around

Using SQL UNPIVOT to Unlock Healthcare Provider Insights with Unpivoted Taxonomy Data Read More »

Cardboard boxes labeled keep, donate, and trash for effective home organization.

Dealing with Duplicates: A Practical SQL-Based Deduplication System

Many organizations struggle with duplicate customer sites and accounts scattered across operational systems. These duplicates introduce confusion in invoicing, shipping, reporting, and analytics. The immediate need to resolve this data fragmentation often outpaces the availability of enterprise-wide tooling. This post outlines how we built a pragmatic, SQL-based deduplication framework to identify, cluster, and prepare duplicate

Dealing with Duplicates: A Practical SQL-Based Deduplication System Read More »