Home   >   CSC-OpenAccess Library   >    Manuscript Information
Software Defect Trend Forecasting In Open Source Projects using A Univariate ARIMA Model and FBProphet
Michael Thomas Shrove, Emil Jovanov
Pages - 1 - 15     |    Revised - 31-03-2020     |    Published - 30-04-2020
Volume - 8   Issue - 1    |    Publication Date - April 2020  Table of Contents
MORE INFORMATION
IJSE-167 - VIDEO PRESENTATION
 
SEE ALL VIDEOS >>
KEYWORDS
Software Engineering, Software Defects, Time Series Forecasting, ARIMA, FBProphet.
ABSTRACT
Our objective in this research is to provide a framework that will allow project managers, business owners, and developers an effective way to forecast the trend in software defects within a software project in real-time. By providing these stakeholders with a mechanism for forecasting defects, they can then provide the necessary resources at the right time in order to remove these defects before they become too much ultimately leading to software failure. In our research, we will not only show general trends in several open-source projects but also show trends in daily, monthly, and yearly activity. Our research shows that we can use this forecasting method up to 6 months out with only an MSE of 0.019. In this paper, we present our technique and methodologies for developing the inputs for the proposed model and the results of testing on seven open source projects. Further, we discuss the prediction models, the performance, and the implementation using the FBProphet framework and the ARIMA model.
1 Google Scholar 
2 refSeek 
3 BibSonomy 
4 Scribd 
5 SlideShare 
"6.4.4. Univariate Time Series Models." [Online]. Available: https://www.itl.nist.gov/div898/handbook/pmc/section4/pmc44.htm. [Accessed: 30-Aug-2019].
"Introduction to ARIMA models." [Online]. Available: https://people.duke.edu/~rnau/411arim.htm. [Accessed: 31-Aug-2019].
"Prophet | Prophet is a forecasting procedure implemented in R and Python. It is fast and provides completely automated forecasts that can be tuned by hand by data scientists and analysts." [Online]. Available: https://facebook.github.io/prophet/. [Accessed: 30-Jan-2020].
"Prophet: forecasting at scale - Facebook Research." [Online]. Available: https://research.fb.com/blog/2017/02/prophet-forecasting-at-scale/. [Accessed: 01-Sep-2019].
Bou-Hamad, I., & Jamali, I. (2020). Forecasting financial time-series using data mining models: A simulation study. Research in International Business and Finance, 51. https://doi.org/10.1016/j.ribaf.2019.101072
Fenton, N., & Neil, M. (1999). A critique of software defect prediction models. IEEE Transactions on Software Engineering, 25(5), 675-689. https://doi.org/10.1109/32.815326
Fenton, N., Neil, M., Marsh, W., Hearty, P., Marquez, D., Krause, P., & Mishra, R. (2007). Predicting software defects in varying development lifecycles using Bayesian nets. Information and Software Technology, 49(1), 32-43. https://doi.org/10.1016/j.infsof.2006.09.001
Lehtinen, T., Mäntylä, M., Vanhanen, J., Itkonen, J., & Lassenius, C. (2014). Perceived causes of software project failures - An analysis of their relationships. Information and Software Technology, 56(6), 623-643. https://doi.org/10.1016/j.infsof.2014.01.015
Lessmann, S., Baesens, B., Mues, C., & Pietsch, S. (2008). Benchmarking Classification Models for Software Defect Prediction: A Proposed Framework and Novel Findings. IEEE Transactions on Software Engineering, 34(4), 485-496. https://doi.org/10.1109/TSE.2008.35
Leybourne, S., Kim, T., & Newbold, P. (2005). Examination of Some More Powerful Modifications of the Dickey-Fuller Test. Journal of Time Series Analysis, 26(3), 355-369. https://doi.org/10.1111/j.1467-9892.2004.00406.x
Manzano, M., Ayala, C., Gomez, C., & Lopez Cuesta, L. (2019). A Software Service Supporting Software Quality Forecasting. 2019 IEEE 19th International Conference on Software Quality, Reliability and Security Companion (QRS-C), 130-132. https://doi.org/10.1109/QRS-C.2019.00037
N. K. Chikkakrishna, C. Hardik, K. Deepika and N. Sparsha, "Short-Term Traffic Prediction Using Sarima and FbPROPHET," 2019 IEEE 16th India Council International Conference (INDICON), Rajkot, India, 2019, pp. 1-4.
Nam, J., Fu, W., Kim, S., Menzies, T., & Tan, L. (2018). Heterogeneous Defect Prediction. IEEE Transactions on Software Engineering, 44(9), 874-896. https://doi.org/10.1109/TSE.2017.2720603
Okutan, A., & Yildiz, O. (2014). Software defect prediction using Bayesian networks. Empirical Software Engineering, 19(1), 154-181. https://doi.org/10.1007/s10664-012-9218-8
Qinbao Song, Zihan Jia, Shepperd, M., Shi Ying, & Jin Liu. (2011). A General Software Defect-Proneness Prediction Framework. IEEE Transactions on Software Engineering, 37(3), 356-370. https://doi.org/10.1109/TSE.2010.90
Raja, U., Hale, D., & Hale, J. (2009). Modeling software evolution defects: a time series approach. Journal Of Software Maintenance And Evolution-Research And Practice, 21(1), 49-71. https://doi.org/10.1002/smr.398
Ramaswamy, V., Suma, V., & Pushphavathi, T. (2012). An approach to predict software project success by cascading clustering and classification. IET Seminar Digest, 2012(4). https://doi.org/10.1049/ic.2012.0137
S. Wojewoda and S. Hastie, "Standish Group 2015 Chaos Report - Q&A with Jennifer Lynch," 2015. [Online]. Available: https://www.infoq.com/articles/standish-chaos-2015/. [Accessed: 25-Aug-2019].
Shrove, M. T., & Jovanov, E. (2019). sP2D2: Software Productivity and Popularity of Open Source Projects based on Defect Technical Debt. In IEEE SoutheastCON. IEEE.
Shuo Wang, & Xin Yao. (2013). Using Class Imbalance Learning for Software Defect Prediction. IEEE Transactions on Reliability, 62(2), 434-443. https://doi.org/10.1109/TR.2013.2259203
V. Vashisht, M. Lal, and G. S. Sureshchandar, "A Framework for Software Defect Prediction Using Neural Networks," J. Softw. Eng. Appl., vol. 08, no. 08, pp. 384-394, 2015.
Weber, R., Waller, M., Verner, J., & Evanco, W. (2003). Predicting software development project outcomes. Lecture Notes in Computer Science (including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2689, 595-609. https://doi.org/10.1007/3-540-45006-8_45
Mr. Michael Thomas Shrove
Millennium Corporation, Huntsville - United States of America
tshrove@gmail.com
Dr. Emil Jovanov
ECE Department, University of Alabama, Huntsville - United States of America