Malicious Web Sites Detection using C4.5 Decision Tree

Zerina Mašetic, Abdulhamit Subasi, Jasmin Azemovic

Abstract


The technology advancement poses the challenge to the cybercriminals for doing various online criminal acts, such as identity theft, extortion of money or simply, viruses and worms spreading. The common aim of the online criminals is to attract visitors to the Web site, which can be easily accessed by clicking on the URL. Blacklisting seems not to be the successful way of marking Web sites with the “bad” content, considering that many malicious Web sites are not blacklisted. The aim of this paper is to evaluate the ability of C4.5 decision tree classifier in detecting malicious Web sites, based on the features that characterize URLs. The classifier is evaluated through several performance evaluation criteria, namely accuracy, sensitivity, specificity and area under the ROC curve. C4.5 decision tree classifier achieved significant success in malicious Web sites detection, considering all four criteria (accuracy 96.5, sensitivity 96.4, specificity 96.5 and area under the curve 0.958).

Keywords


Malicious Web Sites;Blacklisting;URL;C4.5 Decision Tree

Full Text:

PDF


DOI: http://dx.doi.org/10.21533/scjournal.v5i1.109

Refbacks

  • There are currently no refbacks.


Copyright (c) 2016 Zerina Mašetic, Abdulhamit Subasi, Jasmin Azemovic

ISSN 2233 -1859

Digital Object Identifier DOI: 10.21533/scjournal

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License