Deobfuscating Name Scrambling as a Natural Language Generation Task

Duboue, Pablo Ariel

dc.creator	Duboue, Pablo Ariel
dc.date	2018-09
dc.date	2018-11-14T14:57:48Z
dc.identifier	http://sedici.unlp.edu.ar/handle/10915/70714
dc.identifier	http://47jaiio.sadio.org.ar/sites/default/files/ASAI-12.pdf
dc.identifier	issn:2451-7585
dc.description	We are interested in data-driven approaches to Natural Language Generation, but semantic representations for human text are difficult and expensive to construct. By considering a methods implementation as weak semantics for the English terms extracted from the method’s name we can collect massive datasets, akin to have words and sensor data aligned at a scale never seen before. We applied our learned model to name scrambling, a common technique used to protect intellectual property and increase the effort necessary to reverse engineer Java binary code: replacing all the method and class names by a random identifier. Using 5.6M bytecode-compiled Java methods obtained from the Debian archive, we trained a Random Forest model to predict the first term in the method name. As features, we use primarily the opcodes of the bytecodes (that is, bytecodes without any parameters). Our results indicate that we can distinguish the 15 most popular terms from the others at 78% recall, helping a programmer performing reverse engineering to reduce half of the methods in a program they should further investigate.
dc.description	Sociedad Argentina de Informática e Investigación Operativa
dc.format	application/pdf
dc.format	77-90
dc.language	en
dc.rights	http://creativecommons.org/licenses/by-sa/3.0/
dc.rights	Creative Commons Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0)
dc.subject	Ciencias Informáticas
dc.subject	random forest model
dc.subject	bytecodes
dc.subject	Natural language
dc.title	Deobfuscating Name Scrambling as a Natural Language Generation Task
dc.type	Objeto de conferencia
dc.type	Objeto de conferencia

Este ítem pertenece a la siguiente institución

Universidad Nacional de La Plata (Argentina)