Downloads: 112 | Views: 125
Research Paper | Computer Science & Engineering | India | Volume 3 Issue 9, September 2014
N-Gram Analysis in SVM Training Phase Reduction Using Dataset Feature Filtering for Malware Detection
Pagidimarri Venu | Dasu Vaman Ravi Prasad 
Abstract: An n-gram is a sub-sequence of n items from a given sequence. Various areas of statistical natural language processing and genetic sequence analysis are using N-gram Analysis. In which sequence analysis is the process of comparing the sequence or series of attributes in order to find the similarity. Malicious software that is designed by attackers for disturbing computers is called as malware. The principal belong to the same family of malware eventhough Malware variants will have distinct byte level representations. The byte level content is different because small changes to the malware source code can result in significantly different compiled object code. In which programs are used as operational code (opcode) density histograms obtained through dynamic analysis. The process of testing and evaluation of application or a program during running time is called as dynamic analysis. A SVM is used for classification or regression problems. Kernel trickis a technique by SVM to transform your data and then based on these transformations it finds an optimal boundary between the possible outputs. We employ static analysis to classify malware which is identified a prefilter stage using hex values of files, that can reduce the feature set and therefore reduce the training effort. The result shows that the relationships between features are complex and simple statistics filtering approaches do not provide a Practical approach. One of the approach, hex decimal based produces a suitable filter. The entire system will be implemented in WEKA tool.
Keywords: n-gram analysis, malware variants, kernel trick, SVM, WEKA tool
Edition: Volume 3 Issue 9, September 2014,
Pages: 550 - 554
Similar Articles with Keyword 'SVM'
Survey Paper, Computer Science & Engineering, India, Volume 11 Issue 8, August 2022Pages: 947 - 949
COVID-19 Prediction using Machine Learning Algorithms
Saily Suresh Patil
Research Paper, Computer Science & Engineering, India, Volume 11 Issue 11, November 2022Pages: 629 - 634
A Face Spoof Detection using Feature Extraction and SVM
Lovely Pal  | Renuka Singh