[1]胡诗苑,高金良,钟丹,等.供水管网流量监测数据异常值检测方法对比分析[J].中国给水排水,2024,40(3):53-59.
HUShi-yuan,GAOJin-liang,ZHONGDan,et al.Comparison of Methods for Flow Monitoring Data Outlier Detection in Water Distribution Network[J].China Water & Wastewater,2024,40(3):53-59.
点击复制
HUShi-yuan,GAOJin-liang,ZHONGDan,et al.Comparison of Methods for Flow Monitoring Data Outlier Detection in Water Distribution Network[J].China Water & Wastewater,2024,40(3):53-59.
供水管网流量监测数据异常值检测方法对比分析
中国给水排水[ISSN:1000-4062/CN:12-1073/TU]
卷:
第40卷
期数:
2024年第3期
页码:
53-59
栏目:
出版日期:
2024-02-01
- Title:
- Comparison of Methods for Flow Monitoring Data Outlier Detection in Water Distribution Network
- Keywords:
- flow monitoring data; outlier detection; Boxplot; LOF; Prophet
- 摘要:
- 随着信息化技术的发展,水务企业迎来了智慧化转型升级。数据采集与预处理作为水务企业实现智慧管理的重要前序步骤,为后续数据挖掘、运营管理、调度决策提供了基础。由于环境的影响、管网中的随机扰动、管网事故等原因,监测数据的质量问题广泛存在,因此寻求有效的供水管网流量监测数据的异常值检测方法至关重要。基于此,首先根据供水管网流量监测数据的基本特征和时间维度的相关性,将常见异常归纳为3 个类型;其次,以东南沿海某市的真实小区流量监测数据为例,分别探究基于统计、密度和预测的Boxplot、LOF与Prophet异常值检测模型在不同类型异常数据检测中的性能。结果表明,Boxplot与LOF模型能够较准确地识别出异常数据,但Boxplot对异常的判断标准较宽泛,容易将部分非异常数据识别为异常点,Prophet对于不稳定性较高的流量数据识别效果有限。
- Abstract:
- With the development of information technology, water enterprises are undergoing intelligent transformation and upgrading. Data collection and preprocessing is an important pre-step for water enterprises to realize intelligent management, and provides a foundation for subsequent data mining, operation management and scheduling decision. Due to the reasons such as environmental factors, random disturbance in the pipe network and pipe network accident, monitoring data quality issues exist widely, making it is very important to find an effective method for flow monitoring data outlier detection in water distribution network. The common anomalies were firstly classified into three categories according to the basic characteristics and temporal correlation of flow monitoring data in water distribution network. Then, the performance of Boxplot, LOF and Prophet outlier detection models based on statistics, density and prediction in the detection of different types of real flow monitoring data outliers was explored in a southeast coastal city of China. Boxplot and LOF models identified outliers more accurately. However, Boxplot had broad criteria for outlier identification, and it was easy to identify some non-abnormal data as outliers. Prophet had limited effectiveness in identifying unstable flow data.
更新日期/Last Update:
2024-02-01