Detailed Information

Cited 1 time in webofscience Cited 0 time in scopus
Metadata Downloads

Reinforced Disentangled HTML Representation Learning with Hard-Sample Mining for Phishing Webpage Detection

Authors
Yoon, Jun-HoBuu, Seok-JunKim, Hae-Jung
Issue Date
Mar-2025
Publisher
MDPI AG
Keywords
phishing detection; reinforcement learning-based sampling; disentangled representation learning; multimodal feature integration; cybersecurity applications
Citation
Electronics (Basel), v.14, no.6
Indexed
SCIE
SCOPUS
Journal Title
Electronics (Basel)
Volume
14
Number
6
URI
https://scholarworks.gnu.ac.kr/handle/sw.gnu/77709
DOI
10.3390/electronics14061080
ISSN
2079-9292
2079-9292
Abstract
Phishing webpage detection is critical in combating cyber threats, yet distinguishing between benign and phishing webpages remains challenging due to significant feature overlap in the representation space. This study introduces a reinforced Triplet Network to optimize disentangled representation learning tailored for phishing detection. By employing reinforcement learning, the method enhances the sampling of anchor, positive, and negative examples, addressing a core limitation of traditional Triplet Networks. The disentangled representations generated through this approach provide a clear separation between benign and phishing webpages, substantially improving detection accuracy. To achieve comprehensive modeling, the method integrates multimodal features from both URLs and HTML DOM Graph structures. The evaluation leverages a real-world dataset comprising over one million webpages, meticulously collected for diverse and representative phishing scenarios. Experimental results demonstrate a notable improvement, with the proposed method achieving a 6.7% gain in the F1 score over state-of-the-art approaches, highlighting its superior capability and the dataset's critical role in robust performance.
Files in This Item
There are no files associated with this item.
Appears in
Collections
ETC > Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Seok-Jun, Buu photo

Seok-Jun, Buu
IT공과대학 (컴퓨터공학부)
Read more

Altmetrics

Total Views & Downloads

BROWSE