We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
DEOBFUSCATING JAVASCRIPT CODE USING CHARACTER-BASED TOKENIZATION.
- Authors
SÎRBU, ALEXANDRU-GABRIEL
- Abstract
The JavaScript code deployed goes through the process of minification, in which variables are renamed using single character names and spaces are removed in order for the files to have a smaller size, thus loading faster. Because of this, the code becomes unintelligible, making it harder to be analyzed manually. Since JavaScript experts can understand it, machine learning approaches to deobfuscate the minified file are possible. Thus, we propose a technique that finds a fitting name for each obfuscated variable, which is both intuitive and meaningful based on the usage of that variable, based on a Sequence-to-Sequence model, which generates the name character by character to cover all the possible variable names. The proposed approach achieves an average exact name generation accuracy of 70.53%, outperforming the state-of-the-art by 12%.
- Subjects
MACHINE learning; JAVASCRIPT programming language; RECURRENT neural networks; DEEP learning
- Publication
Studia Universitatis Babes-Bolyai, Informatica, 2023, Vol 68, Issue 2, p5
- ISSN
1224-869X
- Publication type
Article
- DOI
10.24193/subbi.2023.2.01