Malware detection based on mining API calls

Research Output

Financial loss due to malware nearly doubles every two years. For instance in 2006, malware caused near 33.5 Million GBP direct financial losses only to member organizations of banks in UK. Recent malware cannot be detected by traditional signature based anti-malware tools due to their polymorphic and/or metamorphic nature. Malware detection based on its immutable characteristics has been a recent industrial practice. The datasets are not public. Thus the results are not reproducible and conducting research in academic setting is difficult. In this work, we not only have improved a recent method of malware detection based on mining Application Programming Interface (API) calls significantly, but also have created the first public dataset to promote malware research.

Our technique first reads API call sets used in a collection of Portable Executable (PE) files, then generates a set of discriminative and domain interpretable features. These features are then used to train a classifier to detect unseen malware. We have achieved detection rate of 99.7% while keeping accuracy as high as 98.3%. Our method improved state of the art technology in several aspects: accuracy by 5.24%, detection rate by 2.51% and false alarm rate was decreased from 19.86% to 1.51%. This project's data and source code can be found at http://home.shirazu.ac.ir/~sami/malware.

Date:

22 March 2010
Publication Status:

Published
Publisher

ACM Press
DOI:

10.1145/1774088.1774303
Cross Ref:

10.1145/1774088.1774303
Funders:

Historic Funder (pre-Worktribe)

http://researchrepository.napier.ac.uk/output/2925498 <p>Sami, A., Yadegari, B., Rahimi, H., Peiravian, N., Hashemi, S., & Hamze, A. (2010). Malware detection based on mining API calls. In <i>SAC '10: Proceedings of the 2010 ACM Symposium on Applied Computing</i> (1020-1025). https://doi.org/10.1145/1774088.1774303</p>

Citation

Sami, A., Yadegari, B., Rahimi, H., Peiravian, N., Hashemi, S., & Hamze, A. (2010). Malware detection based on mining API calls. In SAC '10: Proceedings of the 2010 ACM Symposium on Applied Computing (1020-1025). https://doi.org/10.1145/1774088.1774303

Authors

Prof Ashkan Sami

School of Computing Engineering and the Built Environment

Monthly Views:

Available Documents

Files currently unavailable for download , please contact A.Sami@napier.ac.uk to request a copy
Downloadable citations
HTML BIB RTF

Date:

Publication Status:

Publisher

DOI:

Cross Ref:

Funders:

Citation

Authors

Prof Ashkan Sami

Monthly Views:

Files currently unavailable for download , please contact A.Sami@napier.ac.uk to request a copy

Downloadable citations