Category: data-analysis
-
The Gap Between Junior and Senior Data Scientists Isn’t Code
The Gap Between Junior and Senior Data Scientists Isn’t Code Why my obsession with complex algorithms was actually holding my career back. The post The Gap Between Junior and Senior Data Scientists Isn’t Code appeared first on Towards Data Science. Benjamin Nweke Go to original source
-
Take a Deep Dive into Filtering in DAX
Take a Deep Dive into Filtering in DAX Have you ever wondered what happens when you apply a filter in a DAX expression? Well, Today I will take you on a deep dive into this fascinating topic, with examples to help you learn something new and surprising. The post Take a Deep Dive into Filtering…
-
How to Define the Modeling Scope of an Internal Credit Risk Model
How to Define the Modeling Scope of an Internal Credit Risk Model Dataset construction for Internal Ratings-Based (IRB) Probability of Default (PD) models The post How to Define the Modeling Scope of an Internal Credit Risk Model appeared first on Towards Data Science. JUNIOR JUMBONG Go to original source
-
Decisioning at the Edge: Policy Matching at Scale
Decisioning at the Edge: Policy Matching at Scale Policy-to-Agency Optimization with PuLP The post Decisioning at the Edge: Policy Matching at Scale appeared first on Towards Data Science. Erika Gomes-Gonçalves Go to original source
-
Is the AI and Data Job Market Dead?
Is the AI and Data Job Market Dead? What you should be doing in the current job market The post Is the AI and Data Job Market Dead? appeared first on Towards Data Science. Egor Howell Go to original source
-
Why Every Analytics Engineer Needs to Understand Data Architecture
Why Every Analytics Engineer Needs to Understand Data Architecture Get the data architecture right, and everything else becomes easier. I know it sounds simple, but in reality, little nuances in designing your data architecture may have costly implications. This article provides a crash course on the architectures that shape your daily decisions – from relational…
-
What I Am Doing to Stay Relevant as a Senior Analytics Consultant in 2026
What I Am Doing to Stay Relevant as a Senior Analytics Consultant in 2026 Learn how to work with AI, while strengthening your unique human skills that technology cannot replace The post What I Am Doing to Stay Relevant as a Senior Analytics Consultant in 2026 appeared first on Towards Data Science. Rashi Desai Go…
-
From Transactions to Trends: Predict When a Customer Is About to Stop Buying
From Transactions to Trends: Predict When a Customer Is About to Stop Buying Customer churn is usually a gradual process, not a sudden event. In this post, we analyze monthly transaction trends and convert regression slopes into degrees to clearly identify declining purchase behavior. A small negative slope today can prevent a big revenue loss…
-
From ‘Dataslows’ to Dataflows: The Gen2 Performance Revolution in Microsoft Fabric
From ‘Dataslows’ to Dataflows: The Gen2 Performance Revolution in Microsoft Fabric Dataflows were (rightly?) considered “the slowest and least performant option” for ingesting data into Power BI/Microsoft Fabric. However, things are changing rapidly and the latest Dataflow enhancements changes how we play the game The post From ‘Dataslows’ to Dataflows: The Gen2 Performance Revolution in…
-
Under the Uzès Sun: When Historical Data Reveals the Climate Change
Under the Uzès Sun: When Historical Data Reveals the Climate Change Longer summers, milder winters: analysis of temperature trends in Uzès, France, year after year. The post Under the Uzès Sun: When Historical Data Reveals the Climate Change appeared first on Towards Data Science. Marc Polizzi Go to original source
-
How to Filter for Dates, Including or Excluding Future Dates, in Semantic Models
How to Filter for Dates, Including or Excluding Future Dates, in Semantic Models It is common to have either planning data or the previous year’s data displayed beyond today’s date. But future data can be confusing. How can I add a Slicer to show or hide future data? Let’s see how to do it. The…
-
The Real Challenge in Data Storytelling: Getting Buy-In for Simplicity
The Real Challenge in Data Storytelling: Getting Buy-In for Simplicity What happens when your clear dashboard meets stakeholders who want everything on one screen The post The Real Challenge in Data Storytelling: Getting Buy-In for Simplicity appeared first on Towards Data Science. Benjamin Nweke Go to original source
-
Separate Numbers and Text in One Column Using Power Query
Separate Numbers and Text in One Column Using Power Query An Excel sheet with a column containing numbers and text? What a mess! The post Separate Numbers and Text in One Column Using Power Query appeared first on Towards Data Science. Salvatore Cagliari Go to original source
-
I Cleaned a Messy CSV File Using Pandas . Here’s the Exact Process I Follow Every Time.
I Cleaned a Messy CSV File Using Pandas . Here’s the Exact Process I Follow Every Time. Stop guessing at data cleaning. Use this repeatable 5-step Python workflow to diagnose and fix the most common data flaws. The post I Cleaned a Messy CSV File Using Pandas . Here’s the Exact Process I Follow Every Time. appeared first on Towards…
-
How to Implement Three Use Cases for the New Calendar-Based Time Intelligence
How to Implement Three Use Cases for the New Calendar-Based Time Intelligence Starting with the September 2025 Release of Power BI, Microsoft introduced the new Calendar-based Time Intelligence feature. Let’s see what can be done by implementing three use cases. The future looks very interesting with this new feature. The post How to Implement Three…
-
Modern DataFrames in Python: A Hands-On Tutorial with Polars and DuckDB
Modern DataFrames in Python: A Hands-On Tutorial with Polars and DuckDB How I learned to handle growing datasets without slowing down my entire workflow The post Modern DataFrames in Python: A Hands-On Tutorial with Polars and DuckDB appeared first on Towards Data Science. Benjamin Nweke Go to original source
-
Data Visualization Explained: What It Is and Why It Matters
Data Visualization Explained: What It Is and Why It Matters A brief introduction to data visualization and its importance in today’s technological landscape. The post Data Visualization Explained: What It Is and Why It Matters appeared first on Towards Data Science. Murtaza Ali Go to original source
-
A Focused Approach to Learning SQL
A Focused Approach to Learning SQL Data is everywhere, but how do you draw insights from it? Often, structured data is stored in relational databases, meaning collections of related tables of data. For instance, a company might store customer purchases in one table, customer demographics in another, and suppliers in a third table. These tables…
-
The Crucial Role of Color Theory in Data Analysis and Visualization
The Crucial Role of Color Theory in Data Analysis and Visualization How research-backed color principles improved clarity and storytelling in my dashboards The post The Crucial Role of Color Theory in Data Analysis and Visualization appeared first on Towards Data Science. Benjamin Nweke Go to original source
-
When A Difference Actually Makes A Difference
When A Difference Actually Makes A Difference Bite-Sized Analytics for Business Decision-Makers (1) The post When A Difference Actually Makes A Difference appeared first on Towards Data Science. Mena Wang Go to original source
-
Plato’s Cave and the Shadows of Data
Plato’s Cave and the Shadows of Data On truth, illusion, and the limits of what data can reveal The post Plato’s Cave and the Shadows of Data appeared first on Towards Data Science. Pol Marin Go to original source
-
Everything You Need to Know About the New Power BI Storage Mode
Everything You Need to Know About the New Power BI Storage Mode 50 Shades of Direct Lake The post Everything You Need to Know About the New Power BI Storage Mode appeared first on Towards Data Science. Nikola Ilic Go to original source
-
My Most Valuable Lesson as an Aspiring Data Analyst
My Most Valuable Lesson as an Aspiring Data Analyst What my internship taught me about the power of collaboration in data analysis. The post My Most Valuable Lesson as an Aspiring Data Analyst appeared first on Towards Data Science. Benjamin Nweke Go to original source
-
How to Correctly Apply Limits on the Result in DAX (and SQL)
How to Correctly Apply Limits on the Result in DAX (and SQL) What if the output of a measure mustn’t be above a specific limit? How can we ensure that the total is calculated correctly? This piece is about correctly calculating and summarizing such output. The post How to Correctly Apply Limits on the Result…
-
On Adding a Start Value to a Waterfall Chart in Power BI
On Adding a Start Value to a Waterfall Chart in Power BI A waterfall chart can be a powerful tool for conveying information. But it has some limitations. The post On Adding a Start Value to a Waterfall Chart in Power BI appeared first on Towards Data Science. Salvatore Cagliari Go to original source
-
Confusion Matrix Made Simple: Accuracy, Precision, Recall & F1-Score
Confusion Matrix Made Simple: Accuracy, Precision, Recall & F1-Score How to evaluate classification models and understand which metric matters the most. The post Confusion Matrix Made Simple: Accuracy, Precision, Recall & F1-Score appeared first on Towards Data Science. Nikhil Dasari Go to original source
-
What Is Data Literacy in 2025? It’s Not What You Think
What Is Data Literacy in 2025? It’s Not What You Think In today’s fast-paced, distraction-heavy world, data literacy isn’t just about understanding charts or analyzing numbers—it’s about context, clarity, and human connection. With attention spans shrinking and AI-generated insights flooding our screens, even highly skilled professionals can behave like data novices. The real challenge isn’t…
-
What Is a Query Folding in Power BI and Why should You Care?
What Is a Query Folding in Power BI and Why should You Care? “Will that break a query folding?” “Does your query fold?”… Maybe someone asked you those questions, but you were like: “Query…Whaaaat?! In this article, we demistify the query folding and its importance for efficient data refresh in Power BI The post What…
-
I Analysed 25,000 Hotel Names and Found Four Surprising Truths
I Analysed 25,000 Hotel Names and Found Four Surprising Truths Why are there so many hotels named after cities they are not in? Follow along for a data analysis on hotel names. The post I Analysed 25,000 Hotel Names and Found Four Surprising Truths appeared first on Towards Data Science. Anna Gordun Peiro Go to…
-
The Hidden Trap of Fixed and Random Effects
The Hidden Trap of Fixed and Random Effects My lesson of how blindly over-controlling for noise can erase the effects you are measuring The post The Hidden Trap of Fixed and Random Effects appeared first on Towards Data Science. Ngoc Doan Go to original source
-
POSET Representations in Python Can Have a Huge Impact on Business
POSET Representations in Python Can Have a Huge Impact on Business Discover how POSET indicators transform data into coherent scoring systems, enabling meaningful comparisons while preserving the data’s multi-dimensional semantic structure. The post POSET Representations in Python Can Have a Huge Impact on Business appeared first on Towards Data Science. Andrea D’Agostino Go to original…
-
How to Access NASA’s Climate Data — And How It’s Powering the Fight Against Climate Change Pt. 1
How to Access NASA’s Climate Data — And How It’s Powering the Fight Against Climate Change Pt. 1 From architectural design to food security. The post How to Access NASA’s Climate Data — And How It’s Powering the Fight Against Climate Change Pt. 1 appeared first on Towards Data Science. Marco Hening Tallarico Go to…
-
Implementing IBCS rules in Power BI
Implementing IBCS rules in Power BI Is there a way to use the out-of-the-box features of Power BI to be IBCS compliant? The post Implementing IBCS rules in Power BI appeared first on Towards Data Science. Salvatore Cagliari Go to original source
-
Lessons Learned After 6.5 Years Of Machine Learning
Lessons Learned After 6.5 Years Of Machine Learning Deep work, trends, data, and research The post Lessons Learned After 6.5 Years Of Machine Learning appeared first on Towards Data Science. Pascal Janetzky Go to original source
-
Building A Modern Dashboard with Python and Taipy
Building A Modern Dashboard with Python and Taipy A guide to building a front-end data application. The post Building A Modern Dashboard with Python and Taipy appeared first on Towards Data Science. Thomas Reid Go to original source
-
Why You Should Not Replace Blanks with 0 in Power BI
Why You Should Not Replace Blanks with 0 in Power BI Did someone ask you to replace blank values with 0 in your reports? Maybe you should think twice before you do it! The post Why You Should Not Replace Blanks with 0 in Power BI appeared first on Towards Data Science. Nikola Ilic Go…
-
Mastering SQL Window Functions
Mastering SQL Window Functions Understand how to use Window Functions to perform calculations without losing details The post Mastering SQL Window Functions appeared first on Towards Data Science. Eugenia Anello Go to original source
-
How to Transition From Data Analyst to Data Scientist
How to Transition From Data Analyst to Data Scientist Playbook on how data analysts can become data scientists The post How to Transition From Data Analyst to Data Scientist appeared first on Towards Data Science. Egor Howell Go to original source
-
A Practical Introduction to Google Analytics
A Practical Introduction to Google Analytics Learn the key concepts and reports of Google Analytics while practising with the platform The post A Practical Introduction to Google Analytics appeared first on Towards Data Science. Eugenia Anello Go to original source
-
How Microsoft Power BI Elevated My Data Analysis and Visualization Workflow
How Microsoft Power BI Elevated My Data Analysis and Visualization Workflow Explaining useful features every data analyst needs The post How Microsoft Power BI Elevated My Data Analysis and Visualization Workflow appeared first on Towards Data Science. Benjamin Nweke Go to original source
-
How to Reduce Your Power BI Model Size by 90%
How to Reduce Your Power BI Model Size by 90% Have you ever wondered what makes Power BI so fast and powerful when it comes to performance? Learn on a real-life example about data model optimization and general rules for reducing data model The post How to Reduce Your Power BI Model Size by 90%…
-
About Calculating Date Ranges in DAX
About Calculating Date Ranges in DAX When performing date calculations, creating date ranges can be helpful. But how can we do this, and which DAX function can help us in which case? Now you can learn more about this topic. The post About Calculating Date Ranges in DAX appeared first on Towards Data Science. Salvatore…
-
Log Link vs Log Transformation in R — The Difference that Misleads Your Entire Data Analysis
Log Link vs Log Transformation in R — The Difference that Misleads Your Entire Data Analysis Although normal distributions are the most commonly used, a lot of real-world data unfortunately is not normal. When faced with extremely skewed data, it’s tempting for us to utilize log transformations to normalize the distribution and stabilize the variance. I…
-
Time Series Forecasting Made Simple (Part 2): Customizing Baseline Models
Time Series Forecasting Made Simple (Part 2): Customizing Baseline Models Thank you for the kind response to Part 1, it’s been encouraging to see so many readers interested in time series forecasting. In Part 1 of this series, we broke down time series data into trend, seasonality, and noise, discussed when to use additive versus…
-
The Dangers of Deceptive Data Part 2–Base Proportions and Bad Statistics
The Dangers of Deceptive Data Part 2–Base Proportions and Bad Statistics This is a follow-up to my earlier article: The Dangers of Deceptive Data–Confusing Charts and Misleading Headlines. My first article focused on how visualizations can be used to mislead, diving into a form of data presentation widely used in public matters. In this article,…
-
How to Get Performance Data from Power BI with DAX Studio
How to Get Performance Data from Power BI with DAX Studio Introduction To put things straight: I will not discuss how to optimize DAX Code today. More articles will follow, concentrating on common mistakes and how to avoid them. But, before we can understand the performance metrics, we need to understand the architecture of the…
-
Beyond the Code: Unconventional Lessons from Empathetic Interviewing
Beyond the Code: Unconventional Lessons from Empathetic Interviewing Recently, I’ve been interviewing Computer Science students applying for data science and engineering internships with a 4-day turnaround from CV vetting to final decisions. With a small local office of 10 and no in-house HR, hiring managers handle the entire process. This article reflects on the lessons…
-
How to Write Queries for Tabular Models with DAX
How to Write Queries for Tabular Models with DAX Introduction EVALUATE is the statement to query tabular models. Unfortunately, knowing SQL or any other query language doesn’t help as EVALUATE follows a different concept. EVALUATE has only two “Parameters”: A table to show A sort order (ORDER BY) You can pass a third parameter (START…
-
Fourier Transform Applications in Literary Analysis
Fourier Transform Applications in Literary Analysis Poetry is often seen as a pure art form, ranging from the rigid structure of a haiku to the fluid, unconstrained nature of free-verse poetry. In analysing these works, though, to what extent can mathematics and Data Analysis be used to glean meaning from this free-flowing literature? Of course,…
-
Experiments Illustrated: How Random Assignment Saved Us $1M in Marketing Spend
Experiments Illustrated: How Random Assignment Saved Us $1M in Marketing Spend Running cool experiments is easily one of my favorite parts of working in data science. Most experiments don’t deliver big wins, so the winners make for fun stories. We’ve had a few of these at IntelyCare, and I’m sharing each story in a way…
-
One-Tailed Vs. Two-Tailed Tests
One-Tailed Vs. Two-Tailed Tests Introduction If you’ve ever analyzed data using built-in t-test functions, such as those in R or SciPy, here’s a question for you: have you ever adjusted the default setting for the alternative hypothesis? If your answer is no—or if you’re not even sure what this means—then this blog post is for…
-
Do European M&Ms Actually Taste Better than American M&Ms?
Do European M&Ms Actually Taste Better than American M&Ms? (Oh, I am the only one who’s been asking this question…? Hm. Well, if you have a minute, please enjoy this exploratory Data Analysis — featuring experimental design, statistics, and interactive visualization — applied a bit too earnestly to resolve an international debate.) 1. Introduction 1.1…
-
Triangle Forecasting: Why Traditional Impact Estimates Are Inflated (And How to Fix Them)
Triangle Forecasting: Why Traditional Impact Estimates Are Inflated (And How to Fix Them) Accurate impact estimations can make or break your business case. Yet, despite its importance, most teams use oversimplified calculations that can lead to inflated projections. These shot-in-the-dark numbers not only destroy credibility with stakeholders but can also result in misallocation of resources and…
-
Myths vs. Data: Does an Apple a Day Keep the Doctor Away?
Myths vs. Data: Does an Apple a Day Keep the Doctor Away? Introduction “Money can’t buy happiness.” “You can’t judge a book by its cover.” “An apple a day keeps the doctor away.” You’ve probably heard these sayings several times, but do they actually hold up when we look at the data? In this article series,…
-
How to do Date calculations in DAX
How to do Date calculations in DAX Moving back and forth in time is a common task for Time Intelligence in DAX. Let’s take a deeper look on how DATEADD() works. Continue reading on Towards Data Science » Salvatore Cagliari Go to original source
-
Water Cooler Small Talk, Ep 7: Anscombe’s Quartet and the Datasaurus
Water Cooler Small Talk, Ep 7: Anscombe’s Quartet and the Datasaurus Why descriptive statistics aren’t enough and plotting your data is always essential Continue reading on Towards Data Science » Maria Mouschoutzi, PhD Go to original source
-
Does It Matter That Online Experiments Interact?
Does It Matter That Online Experiments Interact? What interactions do, why they are just like any other change in the environment post-experiment, and some reassurance Photo by Uriel Soberanes on Unsplash Experiments do not run one at a time. At any moment, hundreds to thousands of experiments run on a mature website. The question comes up:…
-
The Basics you Must Master Before Diving into Marketing & Product Analytics
The Basics you Must Master Before Diving into Marketing & Product Analytics Things that still confuse many Data Analysts Recently, I gave a presentation on a specific topic: how to investigate drop-offs in conversion funnels within the context of marketing and product analysis. What surprised me? The incredible engagement from the audience. The questions were varied…
-
Data-Driven Decision Making with Sentiment Analysis in R
Data-Driven Decision Making with Sentiment Analysis in R Leveraging the Quanteda, Textstem and Sentimentr Packages to Extract Customer Insights and Enhance Business Strategy Continue reading on Towards Data Science » Devashree Madhugiri Go to original source
-
My Experience Switching From Power BI to Looker (as a Senior Data Analyst)
My Experience Switching From Power BI to Looker (as a Senior Data Analyst) What you need to know before you switch from Power BI to Looker. Continue reading on Towards Data Science » Tomas Jancovic (It’s AI Thomas) Go to original source
-
The Data Analyst Every CEO Wants
The Data Analyst Every CEO Wants Data Analyst is probably the most underrated job in the data industry Continue reading on Towards Data Science » Benoit Pimpaud Go to original source
-
Scale Experiment Decision-Making with Programmatic Decision Rules
Scale Experiment Decision-Making with Programmatic Decision Rules Decide what to do with experiment results in code Photo by Cytonn Photography on Unsplash The experiment lifecycle is like the human lifecycle. First, a person or idea is born, then it develops, then it is tested, then its test ends, and then the Gods (or Product Managers) decide its worth.…
-
How to Build an AI Agent for Data Analytics Without Writing SQL
How to Build an AI Agent for Data Analytics Without Writing SQL Create a comprehensive AI agent from the ground up utilizing LangChain and DuckDB Continue reading on Towards Data Science » Chengzhi Zhao Go to original source
-
Data behind the Luck, Ambition, and a Billion-Dollar Dream: Lottery
Data behind the Luck, Ambition, and a Billion-Dollar Dream: Lottery Using Seattle’s local retail store data for consumer patterns of the lottery (SQL, Python) Continue reading on Towards Data Science » Sunghyun Ahn Go to original source
-
Jingle Bells and Statistical Tests
Jingle Bells and Statistical Tests Data Types, Hypotheses and Statistical Tests That Fit Them with Festive Christmas Market Examples🎄🎅🎡 Continue reading on Towards Data Science » Gizem Kaya Go to original source
-
5 Essential Tips to Build Business Dashboards Stakeholders Love
5 Essential Tips to Build Business Dashboards Stakeholders Love A practical guide to designing clear, effective, and actionable dashboards for decision-making Continue reading on Towards Data Science » Yu Dong Go to original source
-
Bridging the Data Literacy Gap
Bridging the Data Literacy Gap The Advent, Evolution, and Current state of “Data Translators” Introduction With Data being constantly glorified as the most valuable asset organizations can own, leaders and decision-makers are always looking for effective ways to put their data insights to use. Every time customers interact with digital products, millions of data points…
-
Think you Know Excel? Take Your Analytics Skills to the Next Level with Power Query!
Think you Know Excel? Take Your Analytics Skills to the Next Level with Power Query! 5 practical use cases that prove Power Query is worth exploring. I have a confession to make: I’ve been living under a rock 🪨. Not literally, but how else can I explain not discovering Power Query in Excel until now? Imagine…
-
Five Reasons You Cannot Afford Not Knowing Probability Proportional to Size (PPS) Sampling
Five Reasons You Cannot Afford Not Knowing Probability Proportional to Size (PPS) Sampling Data Science Simple Random Sampling (SRS) works, but if you do not know Probability Proportional to Size Sampling (PPS), you are risking yourself some critical statistical mistakes. Learn why, when, and how you can use PPS Sampling here! Photo by Justin Morgan on Unsplash…