본문 바로가기
  • Home

An Analysis of Machine-Learning Feature-Extraction Techniques using Syntactic Tagging for Cross-site Scripting Detection

  • Journal of Software Assessment and Valuation
  • Abbr : JSAV
  • 2022, 18(1), pp.107-118
  • DOI : 10.29056/jsav.2022.06.13
  • Publisher : Korea Software Assessment and Valuation Society
  • Research Area : Engineering > Computer Science
  • Received : May 6, 2022
  • Accepted : June 20, 2022
  • Published : June 30, 2022

Talib Nurul Atiqah Abu 1 Doh, Kyung-Goo 2

1한양대학교 ERICA
2한양대학교

Accredited

ABSTRACT

Working for a strategy to ensure web application security has become more complex as it is not only to protect against the more challenging cross-site scripting (XSS) attacks but also to assist the open expressiveness of web applications to provide users with interactive services. The feature extractions used in supervised machine learning to detect XSS as an approach to the strategy are now in question of their effective classification. Their lack of preserving structural information may not uphold the property of structured data in input payloads. We apply the concept of syntactic n-grams to a payload text representation. The study of different feature extractions on the representation is to see if syntactic information is maintained. Our purpose is to determine the more effective approach to correctly classify benign and malicious payloads on a real-world dataset. The use of sn-grams that produces the most favourable results of accuracy and precision would only indicate that the extraction is reasonably able to minimize false reports during classification.

Citation status

* References for papers published after 2023 are currently being built.