An Upgraded Blended Model for Human Protein Classification Using Fast Spec CNN with Multi-Head Attention and GAN Augmentation

Authors

DOI:

https://doi.org/10.56294/sctconf2024804

Keywords:

Human Protein Classification, Fast Spectral CNN, Multi-Head Attention, GAN Augmentation, Biomedical Image Analysis

Abstract

Introduction: proteins play a critical role in cellular functions, and the evaluation of protein patterns in microscope images is vital for biomedical research. This study introduces a pioneering hybrid framework for human protein classification, leveraging a combination of Fast Spectral Convolutional Neural Network (CNN) with Multi-Head Attention and GAN Augmentation. This innovative approach aims to mechanize the examination of microscope images containing mixed protein patterns, thereby accelerating biomedical research insights into human cells and diseases.

Method: the framework integrates spectral processing layers and attention mechanisms into the Fast Spec CNN architecture to enhance classification accuracy and interpretability. Through GAN augmentation, synthetic protein images are generated to complement the real dataset, bolstering model generalization and robustness. The Fast Spec CNN model, coupled with Multi-Head Attention, adeptly captures spectral features and discerns discriminative representations.

Results: the study achieved an impressive accuracy rate of 98,79 % on the Image segmentation of the Human Protein Atlas dataset, outperforming prior methodologies. The results underscore the efficacy of the suggested model in accurately classifying proteins across various hierarchical levels simultaneously. GAN augmentation enriches dataset variability and fortifies model resilience.

Conclusion: this study makes significant additions to automated biomedical image analysis, providing a valuable tool for the expedited exploration of human cells and diseases. The architectural flexibility of the emulate enables end-to-end processing of protein images, offering interpretable representations and profound insights into cellular structures and functions. Compared to earlier studies, such as UNet, DeepHiFam with ProtCNN, ProPythia, Protein Bert, ELM, and CNN, this framework performs better than others in terms of accuracy, achieving 98,79 %, the highest among the compared methodologies

References

1. Le T, Winsnes CF, Axelsson U, et al. Analysis of the Human Protein Atlas Weakly Supervised Single-Cell Classification competition. Nat Methods. 2022;19:1221–1229. doi: 10.1038/s41592-022-01606-z.

2. Hanhart D, Gossi F, Rapsomaniki MA, et al. ScLinear predicts protein abundance at single-cell resolution. Commun Biol. 2024;7:267. doi: 10.1038/s42003-024-05958-4.

3. Sandaruwan PD, Wannige CT. An improved deep learning model for hierarchical classification of protein families. PLoS ONE. 2021;16(10) . doi: 10.1371/journal.pone.0258625.

4. Sequeira AM, Lousa D, Rocha M. ProPythia: A Python package for protein classification based on machine and deep learning. Neurocomputing. 2022;484:172-182. doi: 10.1016/j.neucom.2021.07.102.

5. Brandes N, Ofer D, Peleg Y, Rappoport N, Linial M. ProteinBERT: a universal deep-learning model of protein sequence and function. Bioinformatics. 2022;38(8):2102–2110. doi: 10.1093/bioinformatics/btac020.

6. Mostafa FA, Mohamed Afify Y, Ismail RM, Lotfy Badr N. Protein Deep Learning Classification Using 3D Features. In: 2021 Tenth International Conference on Intelligent Computing and Information Systems (ICICIS); 2021 Dec 5-7; Cairo, Egypt. New York: IEEE; 2021. p. 462-466. doi: 10.1109/ICICIS52592.2021.9694247.

7. Zou C. Automatic Protein Sequences Classification Using Machine Learning Methods based on N-Gram Model. In: Proceedings of the 2023 4th International Conference on Machine Learning and Computer Application (ICMLCA '23); 2023 Dec 1-3; New York, NY, USA. New York: ACM; 2023. p. 936–940. doi: 10.1145/3650215.3650382.

8. Alquran H, Al Fahoum A, Zyout A, Abu Qasmieh I. A comprehensive framework for advanced protein classification and function prediction using synergistic approaches: Integrating bispectral analysis, machine learning, and deep learning. PLoS ONE. 2023;18(12) . doi: 10.1371/journal.pone.0295805.

9. Sandaruwan PD, Wannige CT. An improved deep learning model for hierarchical classification of protein families. PLoS ONE. 2021;16(10) . doi: 10.1371/journal.pone.0258625.

10. Cao J, Xiong L. Protein Sequence Classification with Improved Extreme Learning Machine Algorithms. BioMed Res Int. 2014;2014:103054. doi: 10.1155/2014/103054.

11. Zhang D, Kabuka M. Protein Family Classification from Scratch: A CNN Based Deep Learning Approach. IEEE/ACM Trans Comput Biol Bioinform. 2021;18(5):1996-2007. doi: 10.1109/TCBB.2020.2966633.

12. Chandra A, Tünnermann L, Löfstedt T, Gratz R. Transformer-based deep learning for predicting protein properties in the life sciences. eLife. 2023;12 . doi: 10.7554/eLife.82819.

13. Afify HM, Zanaty MS. A Comparative Study of Protein Sequences Classification-Based Machine Learning Methods for COVID-19 Virus against HIV-1. Appl Artif Intell. 2021;35(15):1733-1745. doi: 10.1080/08839514.2021.1991136.

14. Gelman S, Fahlberg SA, et al. Neural networks to learn protein sequence–function relationships from deep mutational scanning data. Proc Natl Acad Sci U S A. 2021;118(48) . doi: 10.1073/pnas.2104878118.

15. Satpute BS, Yadav R. An Efficient Machine Learning Technique for Protein Classification Using Probabilistic Approach. In: Kulkarni A, Satapathy S, Kang T, Kashan A, editors. Proceedings of the 2nd International Conference on Data Engineering and Communication Technology. Advances in Intelligent Systems and Computing, vol 828. Singapore: Springer; 2019. p. 399-411. doi: 10.1007/978-981-13-1610-4_41.

Downloads

Published

2024-01-01

How to Cite

1.
Savitha S, Kalai Vani Y, Najma U, Komala KV, Deepa VP, Jyothi NM. An Upgraded Blended Model for Human Protein Classification Using Fast Spec CNN with Multi-Head Attention and GAN Augmentation. Salud, Ciencia y Tecnología - Serie de Conferencias [Internet]. 2024 Jan. 1 [cited 2024 Nov. 21];3:804. Available from: https://conferencias.ageditor.ar/index.php/sctconf/article/view/965