Data Observability
Stale Data Explained: Why It Kills Data-Driven Organizations
Dashboards that don’t refresh, machine learning applications that don’t learn, and other consequences of stale data.
How Best Egg Implemented a Reliable Data Mesh with Data Observability
See how the fast growing fintech marketplace has matured their data stack and driven increased levels of data quality, trust,…
How BlaBlaCar Built a Practical Data Mesh to Support Self-Service Analytics at Scale
See how BlaBlaCar reduced incidents and time to insights by enabling self service analytics and implementing data mesh.
Rise of the MLOps Engineer And 4 Critical ML Model Monitoring Techniques
MLOps engineers are automating ML model monitoring to quickly detect problems like pipeline issues, model drift, feature drift and more.
How Data Enablement Drives Sustainable Value at Upside
Upside leverages a model that emphasizes upfront investments in data enablement to create self-sustaining “data gardens.” Here’s how.
How Mercari Operationalizes Data Reliability Engineering at Scale
6 best practices from Mercari’s data reliability engineering team for ensuring high quality data..
5 Ways to Use Column Level Data Lineage
Dive deep into use cases and explore the connections between column and table-level lineage. Read on to master the art…
Introducing Table Health Dashboard, a Better Way to Track Data Quality Coverage at Scale
Monte Carlo’s Table Health Dashboard gives data teams visibility into the reliability and monitoring coverage of their most critical data…
Modern Data Quality Management: A Proven 6 Step Guide
This 6 step data quality management framework has helped hundreds of organizations achieve higher quality data across their modern data…
How Checkout.com Achieves Data Reliability at Scale with Monte Carlo
Learn how Checkout.com gained visibility into data across domains, scaled data quality checks, and achieved reliability at scale.
IMPACT 2022: The Data Observability Summit Videos Are Now Available On Demand
Missed IMPACT? Don't worry! All our 2022 sessions—including keynotes with Nate Silver, Jay Kreps, and more—are now available on demand.
The 31 Flavors of Data Lineage And Why Vanilla Doesn’t Cut It
4 critical reasons why your data observability solution needs to have data lineage.
How PepsiCo Achieved Data Quality at Scale with Monte Carlo
Learn how the data team at PepsiCo uses data observability through Monte Carlo to discover data incidents faster.
How Blend Scales the Impact of Reliable Data with dbt Cloud and Monte Carlo
Discover how Blend’s data team leverages Monte Carlo and dbt Cloud to reduce compute costs and deliver more reliable data…
Freshly’s Journey to Building Their 5-Layer Data Platform Architecture
How Freshly, a leading meal delivery service, built a more reliable data platform architecture with Snowflake, Fivetran, dbt, Looker, and…
Find and Solve Databricks Data Quality Issues with Monte Carlo
Monte Carlo “Sample Rows” and “Reproduce Anomalies” functionality gives the ability to sample impacted rows of an incident and reproduce…
How Collaborative Imaging Delivers Healthier Data Products with Monte Carlo
In healthcare, bad data can have severe implications. Here's how Collaborative Imaging uses Monte Carlo to drive data health at…
Our Top 5 Most Popular Data Engineering Articles In 2022
Data mesh, data observability, data contracts, data platforms and our other most popular data engineering articles.
Barr Moses: My Top 5 Articles of 2022
Covering 2023 predictions, data self-service, KPIs, big data egos, underestimating data issues and other issues that are top of mind…
Using Data Observability For Third-Party Data Validation
Third-party data validation and ingestion at scale is not easy. Here is one way to solve this challenge.
From Concept to Reality: Migrating to Data Mesh at BairesDev with Databricks and Monte Carlo
Migrating to data mesh? Learn how BairesDev, a leading Brazilian software development company, got started on this epic data journey.
How ELT Schedules Can Improve Root Cause Analysis For Data Engineers
Why Bayesian networks hold more promise segmentation analysis.
How BlaBlaCar Reduced Data Incident Time to Resolution by 100+ Hours Per Quarter with Monte Carlo
As part of their data mesh migration, the carpooling company’s data engineering team unlocked unprecedented levels of productivity through decentralization,…
How To Implement Data Mesh: Top Tips From 4 Data Leaders
Four data leaders from leading organizations give their practical advice on how to implement data mesh.
How SeatGeek Reduced Data Incidents to Zero with Data Observability
In this video, SeatGeek's Brian London and Kyle Shannon share how data observability helped their data team reduce data incidents…
Announcing Monte Carlo’s Data Reliability Dashboard, a Better Way Understand the Health of Your Data
Data Reliability Dashboard gives data engineers the tools necessary to measure data uptime, drive operational improvements, and scale reliability.
5 Steps To A Successful Data Warehouse Migration
Real lessons from recent data warehouse migrations like Qubole to AWS EMR andMySQL to AWS Redshift.
Monitoring for the dbt Semantic Layer and Beyond
Let’s talk about the dbt Semantic Layer as well as anomaly detection, resolution, and prevention across the data most important…
Why Data Cleaning is Failing Your ML Models – And What To Do About It
When it comes to achieving model accuracy, data cleaning alone is insufficient. Here’s why.
The Significance of O’Reilly’s Data Quality Fundamentals
O'Reilly Data Quality Fundamentals' is the publishing house’s first-ever book on data observability.
How Dr. Squatch Keeps Data Clean & Fresh with Monte Carlo
Data observability helps the groundbreaking men’s personal care product company maintain excellent data hygiene.
Big Data (Quality), Small Data Team: How Prefect Saved 20 Hours Per Week with Data Observability
Learn how Dylan Hughes and Prefect’s lean data team kept data reliability high and costs low with Monte Carlo.
How to Make Data Anomaly Resolution Less Cartoonish
Fixing broken data doesn’t have to be a game of whack-a-mole. Here’s how to speed up your data incident resolution…
New Feature Recap: Data Lakehouse Support, Anomalous Row Distribution Monitors, and More!
Highlighting Monte Carlo's latest product releases, including data lakehouse support, and anomalous row distribution monitors.
You Can’t Out-Architect Bad Data
Even with the most well-designed data platforms, systems will break. Without some measure of observability, you’re playing with fire.
Data Quality Monitoring – You’re Doing It Wrong
Monitoring just your “important” data only gets you so far. Here’s a better approach.
5 Steps to Operationalizing Data Observability with Monte Carlo
Driving early value with your new data observability platform doesn't have to be difficult. We share 5 tips for driving…
A Data Engineer’s Guide to Building Reliable Systems
Over the years, I’ve helped companies of all sizes build and maintain data systems—from my days as a data engineer…
The Future of Big Data Analytics & Data Science: 5 Trends of Tomorrow
What does the future of big data analytics hold? Will our analytical tools scale fast enough to provide real business…
Data Observability First, Data Catalog Second. Here’s Why.
You can’t realize the full value of a data catalog without observability. Here’s why.
How To Create Data Trust Within Your Organization
How to build data trust by preventing data incidents before they happen with the data uptime metric.
Data Engineers Spend Two Days Per Week Firefighting Bad Data, Data Quality Survey Says
Check out the results from our 2022 data quality survey and benchmark your data quality practices against 300 of your…
Data Contracts and 4 Other Ways to Overcome Schema Changes
There are virtually an unlimited number of ways data can break. It could be a bad JOIN statement, an untriggered…
Snowflake Data Mesh: Ensure Reliable Data with Data Observability
Here’s how Snowflake and Monte Carlo are working together to help data teams realize the potential of the data mesh…
Monte Carlo Achieves Snowflake Premier Partner Status to Help Companies Accelerate the Adoption of Reliable Data
With over 70 mutual customers, Monte Carlo becomes the first data observability provider to achieve Snowflake Premier Partner status.
Snowflake Observability and 4 Reasons Data Teams Should Invest In It
Snowflake is a gamechanger for your data strategy. With the right approach to Snowflake observability, you can unlock its full…
Building An External Data Product Is Different. Trust Me. (but read this anyway)
Developing an external data product is different, and let's face it harder, than serving internal customers. We dive into 5…
Building Spark Lineage For Data Lakes
Spark lineage has been a blindspot for the data engineering industry so we set off to engineer a solution. Here's…
How Monte Carlo and Snowflake Gave Vimeo a “Get Out Of Jail Free” Card For Data Fire Drills
See how Snowflake and Monte Carlo helped Vimeo achieve world-class data reliability on a massive scale.
The Ultimate Guide To Data Lineage
Data lineage is a must-have feature of the modern data stack, yet we're struggling to derive value from it. Here's…
DataOps Explained: How To Not Screw It Up
DataOps merges data engineering and data science teams to support an organization’s data needs, in a similar way to how…
5 Ways to Improve Data Quality with the New Monte Carlo Data Quality Trends Dashboard
The new Monte Carlo Dashboard incorporates data and visualization to provide actionable insights to users across data teams.
You Have More Data Quality Issues Than You Think
On average, companies experience one data issue for every 15 tables in their warehouse. Here are 8 reasons why and…
The Cost of Bad Data Has Gone Up. Here Are 8 Reasons Why.
The rising cost of bad data and poor data quality has nothing to do with inflation and everything to do…
Data Observability Doesn’t Just Create Savings – It Drives Revenue, Too
If you think the benefits of data observability stop at cost cutting or avoiding bad outcomes, you’re only looking at…
Treat Your Data Like An Engineering Problem: An Interview with Snowflake Director of Product Management Chris Child
Snowflake Director of Product Management Chris Child talks about the role of data observability solutions in the modern data stack…
What is Data Observability? 5 Key Pillars To Know
Data observability improves data quality with features like data monitoring, lineage, automated root cause analysis, and data health insights to…
Data Observability for Developers: Announcing Monte Carlo’s Python SDK
Our Python SDK gives data engineers programmatic access to Monte Carlo to augment our platform’s lineage, cataloging, and monitoring functionalities.
10 Quick Tips for Getting Started with Monte Carlo
Getting started with Monte Carlo and data observability? Here's how to use 10 of our most popular features.
Data Observability vs. Data Testing: Everything You Need to Know
You already test your data. Do you need observability, too?…
Stop Treating Your Data Engineer Like A Data Catalog
How to build a data certification program so everyone knows what to expect and what data to trust.
The Non-Engineer’s Guide to Bad Data
According to a recent study by HFS, 75 percent of executives don’t trust their data. Here’s why and what data-reliant…
Now Available: O’Reilly Data Quality Fundamentals, Chapter 3
Available today, Chapter 3 of O'Reilly's Data Quality Fundamentals outlines the tools and techniques necessary to build more resilient data…
Monte Carlo Announces dbt Core Integration to Help Companies Ship Reliable Data Faster
When it comes to achieving reliable data, Monte Carlo, the leading data observability platform and dbt, the data build tool,…
Data Observability: How Clearcover Increased Quality Coverage for ELT by 70 Percent
Learn how the data engineering team at Clearcover increased data quality coverage across their stack by 70 percent with Monte…
Reflections on tech, trust, and data adoption
A few quick thoughts on why trust, and not technology, is stopping today's leader's from driving adoption and impact with…
How to Achieve More Trustworthy Data Pipelines with the Prefect Integration for Monte Carlo
With Monte Carlo and Prefect’s strategic partnership and integration, data engineering teams can seamlessly manage the reliability of their data…
Monte Carlo Named to First-Ever Intelligent Apps Top 40 List
Monte Carlo was recognized as one of the first-ever companies named to the Intelligent Applications 40 list.
IMPACT 2021: The Data Observability Summit Videos Are Now Available On Demand
Missed IMPACT? Have no fear! Full recordings of our keynotes, panels, and fireside chats with Bob Muglia, DJ Patil, Zhamak…
How The Farmer’s Dog Achieves Self-Serve Data Observability with Monte Carlo
How the data team at The Farmer's Dog, a fresh dog food company, achieves reliable data pipelines with automated, end-to-end…
Monte Carlo Launches Insights to Help Data Teams Understand What Data Matters Most to Your Business
Monte Carlo Insights is the first solution on the market to offer customers operational analytics about their data environment.
Unicorns, data mesh, category creation, and more reasons to attend IMPACT: The Data Observability Summit
Five reasons why you should attend IMPACT, the world's first Data Observability summit on Wednesday, November 3, 2021.
Data Observability 101: Everything You Need to Know to Get Started
What is data observability and does it make sense for your stack? Here’s your go-to guide to starting on the…
The Future of Data Engineering as a Data Engineer
Is the data engineer still the "worst seat at the table?" Maxime Beauchemin, creator of Apache Airflow, weighs in on…
Announcing O’Reilly’s Data Quality Fundamentals
Available today, Data Quality Fundamental's press release chapters dive into how some of the best teams are architecting for data…
Monitors as Code: A New Way to Deploy Custom Data Quality Monitors From Your CI/CD Workflow
Monte Carlo releases Monitors as Code, allowing data engineers to easily configure new data quality monitors as part of their…
Data Observability: Five Quick Ways to Improve the Reliability of Your Data
Five common data observability use cases and how they can help your team improve data quality at scale and trust…
Bob Muglia, former Snowflake CEO, to Speak at IMPACT, the World’s First Data Observability Summit
Muglia will join the first Chief Data Scientist of the U.S., the founder of the data mesh, and the creator…
Data Anomaly Detection: Why Your Data Team Is Just Not That Into It
Delivering reliable data products doesn't have to be so painful. Introducing a more proactive approach to detecting data anomalies: the…
Reverse ETL and Data Observability: Solving Data’s “Last Mile” Problem
How Reverse ETL and Data Observability can help teams go the extra mile when it comes to trusting your data…
What is a Data Incident Commander?
How data teams can build more resilient incident workflows with DevOps best practices.
How Vimeo Achieved End-to-End Visibility in Snowflake and Looker with Monte Carlo
Learn why the the data engineering team at Vimeo chose to partner with Monte Carlo for data observability.
The Ultimate Data Observability Checklist
Here are the 5 things every data observability strategy needs to help companies achieve end-to-end data trust.
Getting Started: Automatic Detection and Alerting for Data Incidents with Monte Carlo
Here’s how data teams get up and running with Monte Carlo to automatically detect and alert on data incidents with…
Data Quality Solutions: Build or Buy? 4 Things To Know
Investing in a data quality solution? Here's everything you need to know.
Announcing Monte Carlo’s Incident IQ, a Root Cause Analysis Workflow for Data Teams
How to get started with Incident IQ, Monte Carlo's all-in-one solution for troubleshooting and preventing broken data pipelines.
The Ultimate Guide to Data Quality
What is data quality and why does it matter?…
Monte Carlo and PagerDuty Integration Brings DevOps to Data Pipelines with End-to-End Data Observability
Monte Carlo's PagerDuty integration helps data engineering teams achieve greater visibility into the end-to-end health of their data pipelines.
Beyond Monitoring: The Rise of ML Observability
Modern data and machine learning systems need both monitoring and observability. Here’s why.
How to Extract Snowflake Data Observability Metrics Using SQL in 5 Steps
Monitor the health of your Snowflake data pipelines with these 7 queries to extract Snowflake data observability metrics.
How to Conduct Data Incident Management for Data Teams
Conduct data incident management with 4 simple steps to identify, root cause, and fix data quality issues at scale.
The Right Way to Measure ROI on Data Quality
Introducing a better approach for measuring the cost of bad data to your business.
The Data Engineer & Scientist’s Guide To Root Cause Analysis for Data Quality Issues
Introducing a five-step engineering root cause analysis approach used by some of the best data engineering and data science teams…
5 Reasons Data Discovery Platforms Are Best For Data Lakes
Here are 5 reasons why using a data discovery platform is a better alternative to data catalogs to ensure your…
5 Things Every Data Engineer Needs to Know About Data Observability
With data observability, data engineers can think more strategically about tackling the "good pipelines, bad data" problem.
Data Observability in Practice: Data Monitoring at Scale with SQL and Machine Learning
How to make your own data observability monitors from scratch and leverage basic principles of machine learning to apply them…
The Ultimate Data Observability Checklist
For most teams, data observability is more than just setting up a bunch of pipeline tests and hoping for the…
The New Rules of Data Quality
Unit testing your data only gets you so far. Here’s a better way to manage data quality at scale.
Data Observability: How to Build Your Own Data Anomaly Detectors Using SQL
How to use metadata to understand the root cause of data anomalies and take your data quality testing to the…
Why You Need to Set SLAs for Your Data Pipelines
How to set expectations around data quality and reliability for your company…
Data Observability in Practice Using SQL
A step-by-step tutorial for creating your own data quality monitors to catch freshness and distribution anomalies in your data pipelines.
Data Pipeline Monitoring- 5 Strategies To Stop Bad Data
Your data broke. Now what? Here's how some of the best data teams prevent data downtime and, in the process,…
3 Reasons You Can’t Rely on Testing Data Pipelines to Find Quality Issues
Why aren't we treating data as the dynamic, ever-evolving entity it is? Here's why a hybrid approach to testing data…
How to Improve Data Engineering Workflows with End-to-End Data Observability
With data observability, data teams can now identify and prevent inaccurate, missing, or erroneous data from breaking your analytics dashboards,…
Incident Prevention for Data Teams: Introducing the 5 Pillars of Data Observability
The five pillars of data observability are: Freshness, Distribution, Volume, Schema, Lineage…
Metadata is Useless — Unless You Have a Use Case
Here's why having metadata and lineage without a clear business application is worse than having no metadata at all.
The Data Downtime Before Christmas
What happens when a freshness anomaly threatens to ruin Christmas? Turns out, even Santa Claus and his elves aren’t…
Data Catalogs Are Dead; Long Live Data Discovery
Data catalogs aren't cutting it any more when it comes to metadata management and data governance. Here's how data discovery…
A Summer at Monte Carlo: Improving Data Pipeline Observability at Scale
How I spent my summer internship on Monte Carlo’s software engineering team…
Bringing Reliable Data and AI to the Cloud: A Q&A with Databricks’ Matei Zaharia
An interview with Apache Spark creator Matei Zaharia on all things AI, the cloud, and data reliability…
Demystifying Data Observability
3 practical examples on how to get started with data observability…
Data Observability: How to Fix Your Broken Data Pipelines
In 2020, data is the new software. While software needs to be highly available, data needs to be highly reliable.
How to Solve the “You’re Using THAT Table?!” Problem
How to keep track of your data warehouse's most critical table and reports…
Data Observability Tools: Data Engineering’s Next Frontier
To keep pace with data’s clock speed of innovation, data engineers need to invest in data observability, the next frontier…
[VIDEO] Introducing Data Downtime: From Firefighting to Winning
During a 2019 Data Council meetup, Monte Carlo Co-founder & CEO Barr Moses discusses why data downtime matters to the…
How to Calculate the Cost of Data Downtime
Introducing a better way to measure the financial impact of bad data on your company…
How to Fix Your Data Quality Problem
Introducing a better way to prevent bad data.
What is Data Reliability?
And how to use it to start trusting your data.
How to Migrate to Snowflake Like a Boss
3 things you need to know for a smooth migration.
What We Got Wrong About Data Governance
And how we can make it right.
12 Data Quality Metrics That ACTUALLY Matter
How to improve your Data Quality Metrics and why it matters to your business.
Good Pipelines, Bad Data
How to start trusting data in your company.
Closing the Data Downtime Gap
How to get ahead of bad data.
What is Data Downtime?
Data downtime refers to periods of time when your data is partial, erroneous, missing or otherwise inaccurate.
[Video] What is Data Observability?
What is Data Observability?…