• 中国出版政府奖提名奖

    中国百强科技报刊

    湖北出版政府奖

    中国高校百佳科技期刊

    中国最美期刊

    Volume 35 Issue 2
    Mar.  2010
    Turn off MathJax
    Article Contents
    GU Qiong, CAI Zhi-hua, ZHU Li, WANG Xian-ming, 2010. A Novel Hybrid Re-Sampling Algorithm and Its Application in Predicting Rockburst. Earth Science, 35(2): 311-316. doi: 10.3799/dqkx.2010.032
    Citation: GU Qiong, CAI Zhi-hua, ZHU Li, WANG Xian-ming, 2010. A Novel Hybrid Re-Sampling Algorithm and Its Application in Predicting Rockburst. Earth Science, 35(2): 311-316. doi: 10.3799/dqkx.2010.032

    A Novel Hybrid Re-Sampling Algorithm and Its Application in Predicting Rockburst

    doi: 10.3799/dqkx.2010.032
    • Received Date: 2009-04-26
    • Publish Date: 2010-03-01
    • Because of poor understanding about the mechanism of rockbust and about the effect factors, the statistic data of large amounts of rockburst are typical imbalanced data sets (IDS). On the basis of analyzing re-sampling technology, a novel hybrid re-sampling technique based on Automated Adaptive Selection of the Number of Nearest Neighbors (ADSNN-Hybrid RS) is proposed and applied to study the prediction of rockburst. This method takes advantage of both technology of improved Synthetic Minority Over-sampling Technique (SMOTE) method and Neighborhood Cleaning Rule (NCR) data cleaning method. In the procedure of over-sampling with the SMOTE method, blindfold new synthetic minority class examples by randomly interpolating pairs of closest neighbors were added into the minority class; and data sets with nominal features can not be dealt with. These two problems were solved by the automated adaptive selection of nearest neighbors and adjusting the neighbor selective strategy. As a consequence, the quality of the new samples can be well controlled. In the procedure of under-sampling, by using the improved under-sampling technique of neighborhood cleaning rule, borderline majority class examples and the noisy or redundant data were removed. The main motivation behind these methods is not only to balance the training data, but also to remove noisy examples lying on the wrong side of the decision border. The removal of noisy examples might aid in finding better-defined class clusters, therefore, allow the creation of simpler models with better generalization capabilities. In turn, it promises effective processing of IDS and a considerably enhanced classifier performance. The VCR rockburst data sets were employed as a sample IDS for classification and prediction. By adding extra artificial minority class samples as the expanded training set, experiment was conducted, which yields exactly consistent prediction results with the actual situation. The ADSNN-Hybrid RS and classification scheme we developed is feasible and reasonable for applications of IDS from engineering. Thus this method can be readily implemented to determine the controlling factors of engineering. Such a prediction can provide reasonable and sufficient guidance to design a safe construction scheme in deep mining engineering.

       

    • loading
    • Chawla, N.V., Bowyer, K.W., Hall, L.O., et al., 2002. SMOTE: synthetic minority over-sampling technique. Journal of Artificial Intelligence Research, 16(3): 321-357.
      Chawla, N.V., Lazarevic, A., Hall, L.O., et al., 2003. SMOTEboost: improving prediction of the minority class in boosting. Lecture Notes in Computer Science, 2838: 107-119. doi. 10.1007/b13634 http://nd.edu/~dial/papers/ECML03.pdf
      Chen, H.J., Li, N.H., Nie, D.X., et al., 2002. A model for prediction of rockburst by artificial neural network. Chinese Journal of Geotechnical Engineering, 24(2): 229-232 (in Chinese with English abstract). http://en.cnki.com.cn/Article_en/CJFDTOTAL-YTGC200202023.htm
      Estabrooks, A., 2000. A combination scheme for inductive learning from imbalanced data sets. Faculty of Computer Science, Dalhousie University, Halifax, Nova Scotia, Canada. http://www.researchgate.net/publication/244448006_A_Combination_Scheme_for_Inductive_Learning_from_Imbalanced_Data_Sets
      Feng, X.T., 2000. Introduction to intelligent rock mechanics. Science Press, Beijing (in Chinese).
      Ge, Q.F., Feng, X.T., 2008. Classification and prediction of rockburst using AdaBoost combination learning method. Rock and Soil Mechanics, 29(4): 943-948 (in Chinese with English abstract). http://www.researchgate.net/publication/292298061_Classification_and_prediction_of_rockburst_using_AdaBoost_combination_learning_method
      Han, H., Wang, W.Y., Mao, B.H., 2005. Borderline-SMOTE: a new over-sampling method in imbalanced data sets learning. Lecture Notes in Computer Science, 3644(1): 878-887. doi. 10.1007/11538059_91 http://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=09AE1A9AE1C42CFB8FAD9D86B2547B3B?doi=10.1.1.308.9315&rep=rep1&type=pdf
      Hart, P., 1968. The condensed nearest neighbor rule(Corresp. ). IEEE Transactions on Information Theory, 14(3): 515-516. doi: 10.1109/TIT.1968.1054155
      Jiang, T., Huang, Z.Q., Zhao, Y.Y., et al., 2003. Application of grey system optimal theory model in forecasting rockburst. Journal of North China Institute of Water Conservancy and Hydroelectric Power, 24(2): 37-40 (in Chinese with English abstract). http://www.wanfangdata.com.cn/details/detail.do?_type=perio&id=hbslsdxyxb200302012
      Kubat, M., Matwin, S., 1997. Addressing the curse of imbalanced training sets: one-sided selection. Proceedings of the Fourteenth International Conference on Machine Learning. Morgan Kaufmann Publishers, Inc., 179-186. http://ci.nii.ac.jp/naid/10012743635
      Laurikkala, J., 2001. Improving identification of difficult small classes by balancing class distribution. Lecture Notes in Computer Science, 2101: 63-66. doi. 10.1007/3-540-48229-6_9 http://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=0E369DACF967C4C1A9853A603C6AB1E7?doi=10.1.1.309.2519&rep=rep1&type=pdf
      Quinlan, J.R., 1993. C4.5: programs for machine learning. Morgan Kaufmann. doi. 10.1007/BF00993309
      Stanfill, C., Waltz, D., 1986. Toward memory-based reasoning. Communications of the ACM, 29(12): 1213-1228. doi: 10.1145/7902.7906
      Tomek, I., 1976. Two modifications of CNN. IEEE Transactions on Systems, Man and Cybernetics, 6(6): 769-772. http://ieeexplore.ieee.org/document/4309452/references
      Yang, Y.C., Zhu, J., 2001. An matter-elements model and its application to classified prediction of rockburst. Systems Engineering—Theory & Practice, 21(8): 125-129 (in Chinese with English abstract). http://en.cnki.com.cn/Article_en/CJFDTOTAL-XTLL200108023.htm
      Yang, Z.M., Qiao, L.Y., Peng, X.Y., 2007. Research on detamining method for imbalanced dataset based on improved SMOTE. Acta Electronica Sinica, 35(12A): 22-26 (in Chinese with English abstract). http://www.cqvip.com/QK/71135X/201107/27251556.html
      Zhao, H.B., 2005. Classification of rockburst using support vector machine. Rock and Soil Mechanics, 26(4): 642-644(in Chinese with English abstract). http://en.cnki.com.cn/Article_en/CJFDTOTAL-YTLX200504029.htm
      陈海军, 郦能惠, 聂德新, 等. 2002. 岩爆预测的人工神经网络模型. 岩土工程学报, 24(2): 229-232. doi: 10.3321/j.issn:1000-4548.2002.02.023
      冯夏庭. 2000. 智能岩石力学导论. 北京: 科学出版社. https://www.cnki.com.cn/Article/CJFDTOTAL-YSLX902.024.htm
      葛启发, 冯夏庭. 2008. 基于AdaBoost组合学习方法的岩爆分类预测研究. 岩土力学, 29(4): 943-948. doi: 10.3969/j.issn.1000-7598.2008.04.017
      姜彤, 黄志全, 赵彦彦, 等. 2003. 灰色系统最优归类模型在岩爆预测中的应用. 华北水利水电学院学报, 24(2): 37-40. https://www.cnki.com.cn/Article/CJFDTOTAL-HBSL200302012.htm
      杨莹春, 诸静. 2001. 物元模型及其在岩爆分级预报中的应用. 系统工程理论与实践, 21(8): 125-129. https://www.cnki.com.cn/Article/CJFDTOTAL-XTLL200108023.htm
      杨智明, 乔立岩, 彭喜元. 2007. 基于改进SMOTE的不平衡数据挖掘方法研究. 电子学报, 35(12A): 22-26.
      赵洪波. 2005. 岩爆分类的支持向量机方法. 岩土力学, 26(4): 642-644. https://www.cnki.com.cn/Article/CJFDTOTAL-YTLX200504029.htm
    • 加载中

    Catalog

      通讯作者: 陈斌, bchen63@163.com
      • 1. 

        沈阳化工大学材料科学与工程学院 沈阳 110142

      1. 本站搜索
      2. 百度学术搜索
      3. 万方数据库搜索
      4. CNKI搜索

      Figures(2)  / Tables(2)

      Article views (3158) PDF downloads(71) Cited by()
      Proportional views

      /

      DownLoad:  Full-Size Img  PowerPoint
      Return
      Return