A Novel Approach on Towards Effective Bug Triage and Improve the Quality of Bug Data

G Praveen, V. Sridhar Reddy, Shaik Abdul Nabi


In order to diminution the time cost in physical work, text classification practices are pragmatic to conduct automatic bug triage. Presently, software companies spend over 45 percent of cost in dealing with software bugs. A foreseeable step of fixing bugs is bug triage, which ambitions to decorously consign a developer to a new bug. In this scheme, it is addressed the problem of data reduction for bug triage, to reduce the scale and progress the reputation of bug data. It is here combined with instance selection with feature selection to simultaneously reduce data scale on the bug dimension and the word dimension. To define the order of applying instance selection and feature selection, it is extract attributes from historical bug data sets and build a predictive model for a new bug data set. It is practically scrutinized the enactment of data reduction on totally 600,000 bug reports of two large open source projects, namely Eclipse and Mozilla. The work provides an approach to leveraging techniques on data processing to form reduced and high-quality bug data in software development and maintenance.

Full Text:



J. Anvik, L. Hiew, and G. C. Murphy, “Who should fix this bug?” in Proc. 28th Int. Conf. Softw. Eng., May 2006, pp. 361–370.

S. Artzi, A. Kie_zun, J. Dolby, F. Tip, D. Dig, A. Paradkar, and M. D.Ernst, “Finding bugs in It isb applications using dynamic test generation and explicit-state model checking,” IEEE Softw., vol. 36, no. 4, pp. 474–494, Jul./Aug. 2010.

J. Anvik and G. C. Murphy, “Reducing the effort of bug report triage:Recommenders for development-oriented decisions,” ACM Trans. Soft. Eng. Methodol., vol. 20, no. 3, article 10, Aug. 2011.

C. C. Aggarwal and P. Zhao, “Towards graphical models for text processing,” Knowl. Inform. Syst., vol. 36, no. 1, pp. 1–21, 2013.

Bugzilla, (2014). [Online]. Avaialble: http://bugzilla.org/

K. Balog, L. Azzopardi, and M. de Rijke, “Formal models for expert finding in enterprise corpora,” in Proc. 29th Annu. Int. ACM SIGIR Conf. Res. Develop. Inform. Retrieval, Aug. 2006, pp. 43–50.

P. S. Bishnu and V. Bhattacherjee, “Software fault prediction using quad tree-based k-means clustering algorithm,” IEEE Trans. Knowl. Data Eng., vol. 24, no. 6, pp. 1146–1150, Jun. 2012.

H. Brighton and C. Mellish, “Advances in instance selection for instance-based learning algorithms,” Data Mining Knowl. Discovery, vol. 6, no. 2, pp. 153–172, Apr. 2002.

S. Breu, R. Premraj, J. Sillito, and T. Zimmermann, “Information needs in bug reports: Improving cooperation betIt isen developers and users,” in Proc. ACM Conf. Comput. Supported Cooperative Work, Feb. 2010, pp. 301–310.

V. Bol_on-Canedo, N. S_anchez-Maro~no, and A. Alonso-Betanzos, “A review of feature selection methods on synthetic data,” Knowl. Inform. Syst., vol. 34, no. 3, pp. 483–519, 2013.

V. Cerver_on and F. J. Ferri, “Another move toward the minimum consistent subset: A tabu search approach to the condensed nearest neighbor rule,” IEEE Trans. Syst., Man, Cybern., Part B, Cybern., vol. 31, no. 3, pp. 408–413, Jun. 2001.

D. _Cubrani_c and G. C. Murphy, “Automatic bug triage using text categorization,” in Proc. 16th Int. Conf. Softw. Eng. Knowl. Eng., Jun. 2004, pp. 92–97.

Eclipse. (2014). [Online]. Available: http://eclipse.org/

B. Fitzgerald, “The transformation of open sthece software,” MISQuart., vol. 30, no. 3,pp. 587–598, Sep. 2006. [32] D. Lo, J. Li, L. Wong, and S. C. Khoo, “Mining iterative generators and representative rules for software specification discovery,”IEEE Trans. Knowl. Data Eng., vol. 23, no. 2, pp. 282–296, Feb. 2011.

Mozilla. (2014). [Online]. Available: http://mozilla.org/

D. Matter, A. Kuhn, and O. Nierstrasz, “Assigning bug reports using a vocabulary-based expertise model of developers,” in Proc. 6th Int. Working Conf. Mining Softw. Repositories, May 2009, pp. 131–140.

G. Miao, L. E. Moser, X. Yan, S. Tao, Y. Chen, and N. Anerousis, “Generative models for ticket resolution in expert networks,” in Proc. 16th ACM SIGKDD Int. Conf. Knowl. Discovery Data Mining, 2010, pp. 733–742.

E. Murphy-Hill, T. Zimmermann, C. Bird, and N. Nagappan, “The design of bug fixes,” in Proc. Int. Conf. Softw. Eng., 2013, pp. 332–341.

J. A. Olvera-L_opez, J. A.Carrasco-Ochoa, J. F. Mart_ınez-Trinidad, and J. Kittler, “A review of instance selection methods,” Artif. Intell. Rev., vol. 34, no. 2, pp. 133–143, 2010.

J. A. Olvera-L_opez, J. F. Mart_ınez-Trinidad, and J. A. Carrasco- nOchoa, “Restricted sequential floating search applied to object selection,” in Proc. Int. Conf. Mach. Learn. Data Mining Pattern Recognit., 2007, pp. 694–702.

R. S. Pressman, Software Engineering: A Practitioner’s Approach, 7th ed. New York, NY, USA: McGraw-Hill, 2010.

J. W. Park, M. W. Lee, J. Kim, S. W. Hwang, and S. Kim, “Costriage: A cost-aware triage algorithm for bug reporting systems,” in Proc. 25th Conf. Artif. Intell., Aug. 2011, pp. 139–144.

J. C. Riquelme, J. S. Aguilar-Ru_ız, and M. Toro, “Finding representative patterns with ordered projections,” Pattern Recognit., vol. 36,pp. 1009–1018, 2003.

DOI: https://doi.org/10.23956/ijarcsse.v7i9.402


  • There are currently no refbacks.

© International Journals of Advanced Research in Computer Science and Software Engineering (IJARCSSE)| All Rights Reserved | Powered by Advance Academic Publisher.