Ensuring Quality in Analytics

There are multiple angles of quality that we pay attention to:

  • Quality of the inputs (source data quality)
  • Quality of the deliverables (KPIs, benchmarks, forecasts, models, recommendations, etc.)
  • Quality of the development pipeline (change control and testing before release)
  • Quality of the value or operations pipeline (detecting changes in data, monitoring models and watching for drift, etc.)

Multiple roles may be involved in ensuring quality:

  • The developers (e.g. data scientist, data engineer, ML engineer, data analyst, or other)
  • The development team (e.g. peers reviews, manager reviews, etc.)
  • The product owner (deserves special mention)
  • The user or a representative of the user (e.g. UAT)

I’d like to borrow from an excellent QA methodology created by FedEx named: Quality Driven Management

The key principles in QDM are:

  • Quality is defined by the customer
  • Be scientific
  • Measure measure measure
  • Optimize business performance
  • Quality involves teamwork
  • View failures as opportunities

In the weeks and months to come I will be writing more about how to apply these principles to ensure adequate quality in BI and AI projects while also remaining fast. More to come!