Reproduction of C. Pornprasit; C. Kla Tantithamthavorn, DeepLineDP: Towards a Deep Learning Approach for Line-Level Defect Prediction (2023).
LineDP: Predicting Defective Lines Using a Model-Agnostic Technique
- Jakub Tkaczyk
- Karol Waliszewski
- Mikołaj Macioszczyk
Steps to replicate the study:
- Access the shared file 🛠️ Main - with our research - DeepLineDp.
- Create a copy of that file:
File -> Save a copy in Drive
. - On your copy of
Main - with our research - DeepLineDP
, execute all the tasks. They include data preprocessing, training DeepLineDP, RandomForest, XGBoost, LighGBM models, predicting results, top-k tokens investigation and comparison of the classifiers based on MCC, BA metrics. Predictions are stored as .csv files (one file per release of the library) located in./comparisonModelsExperiment/output
, split by model folders:prediction/DeepLineDP
,RF-line-level-result
,XGB-line-level-result
,LGBM-line-level-result
. The comparison results of the classifiers with metrics are located in./comparisonModelsExperiment/output/figure
. - The workflow asks you to connect a google drive. This is optional but will provide better user experience - it's possible to download all of the results and figures.
- The next step is the evaluation results: Recall@Top20LOC, Effort@Top20Recall and IFA. After that the top-k investigation will be performed and at the end MCC, BA metrics will be calculated.
- The
./comparisonModelsExperiment/output
folder will be copied to your google drive ascomparisonModelsExperimentFigure.zip
. You can then download the file from google drive and unzip on your local machine. - The evaluation results are located in
./figure
folder.