石油学报 ›› 2009, Vol. 30 ›› Issue (4): 542-549.DOI: 10.7623/syxb200904011

• 地质勘探 • 上一篇    下一篇

复杂储层测井评价数据挖掘方法研究

李洪奇1,2, 郭海峰1,2, 郭海敏3, 孟照旭1,2,4, 谭锋奇1,2, 张军1,2   

  1. 1. 中国石油大学资源与信息学院, 北京, 102249;
    2. 中国石油大学油气资源与探测国家重点实验室, 北京, 102249;
    3. 长江大学地球物理与石油资源学院, 湖北荆州, 434023;
    4. 新疆油田公司勘探开发研究院, 新疆克拉玛依, 834000
  • 收稿日期:2008-09-10 修回日期:2008-12-15 出版日期:2009-07-25 发布日期:2010-05-21
  • 作者简介:李洪奇,男,1960年1月生,1998年获中国科学院固体地球物理专业博士学位,现为中国石油大学(北京)教授,博士生导师,主要从事地球探测与信息技术、测井地质学以及智能信息处理研究.E-mail:hq.li@cup.edu.cn
  • 基金资助:
    国家"十五"科技攻关项目(2001BA605A09)资助.

An approach of data mining for evaluation of complex formation using well logs

LI Hongqi1,2, GUO Haifeng1,2, GUO Haimin3, MENG Zhaoxu1,2,4, TAN Fengqi1,2, ZHANG Jun1,2   

  1. 1. School of Resources and Information Technology, China University of Petroleum, Beijing 102249, China;
    2. State Key Laboratory of Petroleum Resource and Prospecting, China University of Petroleum, Beijing 102249, China;
    3. College of Geophysics and Oil Resources, Yangtze University, Jingzhou 434023, China;
    4. Research Institute of Petroleum Exploration and Development, PetroChina Xinjiang Oilfield Company, Karamay 834000, China
  • Received:2008-09-10 Revised:2008-12-15 Online:2009-07-25 Published:2010-05-21

摘要: 数据挖掘是应对石油勘探开发未来挑战的10项关键技术之一。提出了一种将预测性数据挖掘技术应用于复杂储层测井评价的方法。将遗传算法应用于特征子集选择和建模参数优化,利用重复交叉验证得到泛化误差的无偏估计,并从选定的多学习算法建模结果中优选出最终预测模型。以克拉玛依油田六中区克下组油藏水淹级别划分为例,在该方法框架内对比研究了8种特征子集方案和决策树、神经网络、支持向量机、贝叶斯网络、组合学习等5种分类方法12种预测模型。结果表明支持向量机预测准确率最高,达91.47%,选择其作为最终预测模型,而决策树模型容易理解,作为辅助参考模型。利用该数据挖掘方法解决油、气、水层识别和岩性划分等问题时,能够获得高性能的分类模型,从而将有效地提高解释精度和符合率。

关键词: 储层评价, 数据挖掘, 预测建模, 参数优化, 特征选择, 支持向量机, 水淹层, 决策树

Abstract: Data mining is regarded as one of the ten key techniques for challenging problem of oil exploration and development.A practical approach for evaluation of the complex formation was presented using the predictive data mining techniques.Both feature selection and parameter optimization were performed using the genetic algorithm.The unbiased estimation of generalization error was calculated with the repeated cross-validation.The final optimal model was selected from the results obtained by using the multiple learning algorithms.The water-flooded interval in the Lower Kelamayi Reservoir of Liuzhong area in Karamay Oilfield was evaluated by using eight feature subsets and twelve models obtained from five distinct kinds of classification methods,including Decision Tree (DT),Artificial Neural Network,Support Vector Machines (SVM),Bayesian Network and Ensemble Learning method.The results show that the SVM is superior to others in the prediction accuracy (91.47 % ) and can be used as the final classification model.The DT can be used as the assistant model for discovering knowledge because of its easy understandability.It is suggested that the high-level classification models can be obtained using the data mining approach,and the precision of well log interpretation can be effectively improved in solving the problems such as identification of oil-bearing formation and lithologic discrimination.

Key words: formation evaluation, data mining approach, prediction model, parameter optimization, feature selection, support vector machine, water-flooded interval, decision tree

中图分类号: