att_abstract={{While there has been significant focus on collecting and managing data feeds, it is only now that attention is turning to their quality. In this paper, we propose a principled approach to online data quality monitoring in a dynamic feed
environment. Our goal is to alert quickly when feed behavior deviates from expectations.

We make contributions in two distinct directions. First, we propose novel enhancements to the DFMS architecture to permit a publish-subscribe approach to incorporate data quality modules into the DFMS architecture. Second, we propose novel temporal extensions to standard statistical techniques to adapt them to online feed monitoring for outlier detection and alert generation at multiple scales along three dimensions: aggregation at multiple time intervals to detect at varying levels of sensitivity; multiple lengths of data history for varying the speed at which models adapt to change; and multiple levels of monitoring delay to address lagged data arrival.

FIT, or Feed Inspection Tool, is the result of a successful implementation of our approach. We present several case studies outlining the effective deployment of FIT in real applications along with user testimonials.}},
	att_authors={ds8961, td3863, vs9593, ds1693},
	att_categories={C_BB.1, C_IIS.6},
	att_copyright={{VLDB Foundation}},
	att_copyright_notice={{The definitive version was published in Very Large Databases, 2015. {{, Volume 8}}{{, Issue 12}}{{, 2015-05-31}}{{, http://www.vldb.org/pvldb/vol8/p1728-dasu.pdf}}
	author={Divesh Srivastava and Tamraparni Dasu and Vladislav Shkapenyuk and Deborah Swayne},
	institution={{Proceedings of the VLDB Endowment}},
	title={{FIT to monitor feed quality}},