Consumer electronics, particularly Android, has become the leading mobile ecosystem due to its accessibility and adaptability. However, the constant connectivity of Android apps makes them a target for malicious network attacks, leading to the potential theft of sensitive data and disruptions across various sectors, including the economy and healthcare. Developing a reliable, distributed malware detection system using training data from multiple sources is challenging. This is mostly due to privacy issues and a lack of consistent data. This paper proposes a semantic-driven Federated Learning (FL) approach using transformer-based transfer learning to defend against distributed malicious attacks. The semantic features of malicious scripts are examined using the Bidirectional Encoder Representations from the Transformers (BERT) model. Following that, Deep Neural Network (DNN) uses these semantic features for local training, resulting in local model updates for each client. After merging the local model updates from each client, the global server generates global weights and sends them to distant clients. The proposed approach is evaluated on two standard datasets, including CIC-AndMal2017 and CICMalDroid2020, and it obtains high detection accuracy of 99.38% and 99.14%, respectively. These findings encourage cybersecurity organizations to collaborate and develop a powerful distributed security system using private data. IEEE

Semantic-based Federated Defense for Distributed Malicious Attacks

Ullah Farhan;Cacciagrano Diletta;
2024-01-01

Abstract

Consumer electronics, particularly Android, has become the leading mobile ecosystem due to its accessibility and adaptability. However, the constant connectivity of Android apps makes them a target for malicious network attacks, leading to the potential theft of sensitive data and disruptions across various sectors, including the economy and healthcare. Developing a reliable, distributed malware detection system using training data from multiple sources is challenging. This is mostly due to privacy issues and a lack of consistent data. This paper proposes a semantic-driven Federated Learning (FL) approach using transformer-based transfer learning to defend against distributed malicious attacks. The semantic features of malicious scripts are examined using the Bidirectional Encoder Representations from the Transformers (BERT) model. Following that, Deep Neural Network (DNN) uses these semantic features for local training, resulting in local model updates for each client. After merging the local model updates from each client, the global server generates global weights and sends them to distant clients. The proposed approach is evaluated on two standard datasets, including CIC-AndMal2017 and CICMalDroid2020, and it obtains high detection accuracy of 99.38% and 99.14%, respectively. These findings encourage cybersecurity organizations to collaborate and develop a powerful distributed security system using private data. IEEE
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11581/484763
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact