Malware homology determination using visualized images and feature fusion.

Citation metadata

Date: Apr. 15, 2021
From: PeerJ Computer Science(Vol. 7)
Publisher: PeerJ. Ltd.
Document Type: Article
Length: 7,779 words
Lexile Measure: 1470L

Document controls

Main content

Abstract :

The family homology determination of malware has become a research hotspot as the number of malware variants are on the rise. However, existing studies on malware visualization only determines homology based on the global structure features of executable, which leads creators of some malware variants with the same structure intentionally set to misclassify them as the same family. We sought to develop a homology determination method using the fusion of global structure features and local fine-grained features based on malware visualization. Specifically, the global structural information of the malware executable file was converted into a bytecode image, and the opcode semantic information of the code segment was extracted by the n-gram feature model to generate an opcode image. We also propose a dual-branch convolutional neural network, which features the opcode image and bytecode image as the final family classification basis. Our results demonstrate that the accuracy and F-measure of family homology classification based on the proposed scheme are 99.05% and 98.52% accurate, respectively, which is better than the results from a single image feature or other major schemes.

Source Citation

Source Citation   

Gale Document Number: GALE|A658519231