石油学报 ›› 2020, Vol. 41 ›› Issue (12): 1657-1664.DOI: 10.7623/syxb202012018

• 石油工程 • 上一篇    下一篇

典型聚类算法在区块抽油机井系统效率分析中的应用

刘合1, 卢秋羽2, 朱世佳1, 蒋薇2, 王素玲2   

  1. 1. 中国石油勘探开发研究院 北京 100083;
    2. 东北石油大学机械科学与工程学院 黑龙江大庆 163318
  • 收稿日期:2020-04-23 修回日期:2020-06-18 出版日期:2020-12-25 发布日期:2021-01-06
  • 通讯作者: 王素玲,女,1975年10月生,1999年获大庆石油学院机械设计及制造专业学士学位,2009年获东北石油大学油气田地面工程专业博士学位,现为东北石油大学教授、博士生导师,主要从事油气田地面工程方面的研究工作。Email:wsl19751028@163.com
  • 作者简介:刘合,男,1961年3月生,1982年获大庆石油学院石油矿场机械专业学士学位,2002年获哈尔滨工程大学控制理论及控制工程专业博士学位,现为中国工程院院士、中国石油勘探开发研究院副总工程师,主要从事机采系统提高效率、低渗透油气藏增产改造、分层注采、井筒工程控制技术方面的科研工作。Email:liuhe@petrochina.com.cn
  • 基金资助:

    科技部创新方法工作专项项目(2018IM040100)资助。

Application of typical clustering algorithms in analysis of system efficiency of pumping wells in blocks

Liu He1, Lu Qiuyu2, Zhu Shijia1, Jiang Wei2, Wang Suling2   

  1. 1. PetroChina Research Institute of Petroleum Exploration and Development, Beijing 100083, China;
    2. School of Mechanical Science and Engineering, Northeast Petroleum University, Heilongjiang Daqing 163318, China
  • Received:2020-04-23 Revised:2020-06-18 Online:2020-12-25 Published:2021-01-06

摘要:

因采油中—后期抽油机系统效率的影响因素多且数据庞杂无特征,造成系统效率调控效果差。基于大数据挖掘技术,以大庆油田中区的采油区块为研究对象,将地面数据和井下数据相结合,采用典型聚类算法揭示了系统效率的变化规律,针对区块数据,采用了k-means和DBSCAN聚类算法应用于油田数据分析,首先利用组间误差平方和、组内误差平方和与总误差平方和之比确定了最佳k值,用k-means算法对数据集进行聚类。然后通过设定不同Mε值用DBSCAN算法对数据集进行聚类,通过将聚类结果可视化并对比两种聚类方法的不同之处发现,k-means算法更符合大庆油田中区数据的聚类分析结果,并以k=4时k-means聚类结果给出了区块低效率井的表现特征,为区块井系统效率调控提供方向指导。

关键词: 抽油机, 系统效率, 大数据, 聚类分析, 影响因素特征

Abstract:

There are many factors affecting the system efficiency of pumping units in the middle and late stages of oil recovery and the data is complex and featureless, resulting in the poor regulation effect on system efficiency. Based on the big data mining technology, taking an oil production block in the central Daqing oilfield as the research object, the paper reveals the changing law of system efficiency by combining surface data and downhole data, and using typical clustering algorithms for block data, the clustering algorithms of k-means and DBSCAN are used to analyze oilfield data. First, the optimal k value is determined by the ratios of the inter-cluster and intra-cluster error square sum to the total error square sum, and the data set is clustered using the k-means algorithm. Then, the data set is further clustered using the DBSCAN algorithm by setting different Minpts and ε values. By visualizing the clustering results, this paper compares the differences between the two clustering methods, and finds that the k-means algorithm is more consistent with the cluster analysis results of the central Daqing oilfield, and provides the representation characteristics of low-efficiency wells in the block according to the k-means clustering results when k=4, thus providing guidance for the regulation of system efficiency for wells in the block.

Key words: pumping unit, system efficiency, big data, cluster analysis, characteristics of influencing factors

中图分类号: