Detailed Information

Cited 3 time in webofscience Cited 3 time in scopus
Metadata Downloads

Phishing Webpage Detection via Multi-Modal Integration of HTML DOM Graphs and URL Features Based on Graph Convolutional and Transformer Networks

Full metadata record
DC Field Value Language
dc.contributor.authorYoon, Jun-Ho-
dc.contributor.authorBuu, Seok-Jun-
dc.contributor.authorKim, Hae-Jung-
dc.date.accessioned2024-12-03T04:30:53Z-
dc.date.available2024-12-03T04:30:53Z-
dc.date.issued2024-08-
dc.identifier.issn2079-9292-
dc.identifier.issn2079-9292-
dc.identifier.urihttps://scholarworks.gnu.ac.kr/handle/sw.gnu/73988-
dc.description.abstractDetecting phishing webpages is a critical task in the field of cybersecurity, with significant implications for online safety and data protection. Traditional methods have primarily relied on analyzing URL features, which can be limited in capturing the full context of phishing attacks. In this study, we propose an innovative approach that integrates HTML DOM graph modeling with URL feature analysis using advanced deep learning techniques. The proposed method leverages Graph Convolutional Networks (GCNs) to model the structure of HTML DOM graphs, combined with Convolutional Neural Networks (CNNs) and Transformer Networks to capture the character and word sequence features of URLs, respectively. These multi-modal features are then integrated using a Transformer network, which is adept at selectively capturing the interdependencies and complementary relationships between different feature sets. We evaluated our approach on a real-world dataset comprising URL and HTML DOM graph data collected from 2012 to 2024. This dataset includes over 80 million nodes and edges, providing a robust foundation for testing. Our method demonstrated a significant improvement in performance, achieving a 7.03 percentage point increase in classification accuracy compared to state-of-the-art techniques. Additionally, we conducted ablation tests to further validate the effectiveness of individual features in our model. The results validate the efficacy of integrating HTML DOM structure and URL features using deep learning. Our framework significantly enhances phishing detection capabilities, providing a more accurate and comprehensive solution to identifying malicious webpages. © 2024 by the authors.-
dc.language영어-
dc.language.isoENG-
dc.publisherMDPI AG-
dc.titlePhishing Webpage Detection via Multi-Modal Integration of HTML DOM Graphs and URL Features Based on Graph Convolutional and Transformer Networks-
dc.typeArticle-
dc.publisher.location스위스-
dc.identifier.doi10.3390/electronics13163344-
dc.identifier.scopusid2-s2.0-85202694271-
dc.identifier.wosid001305724000001-
dc.identifier.bibliographicCitationElectronics (Basel), v.13, no.16-
dc.citation.titleElectronics (Basel)-
dc.citation.volume13-
dc.citation.number16-
dc.type.docTypeArticle-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaPhysics-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryPhysics, Applied-
dc.subject.keywordAuthorcyberspace security-
dc.subject.keywordAuthorgraph convolutional network-
dc.subject.keywordAuthormulti-modal integration-
dc.subject.keywordAuthorphishing webpage detection-
dc.subject.keywordAuthortransformer network-
Files in This Item
There are no files associated with this item.
Appears in
Collections
ETC > Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Seok-Jun, Buu photo

Seok-Jun, Buu
IT공과대학 (컴퓨터공학부)
Read more

Altmetrics

Total Views & Downloads

BROWSE