Журнал «Современная Наука»

Russian (CIS)English (United Kingdom)
MOSCOW +7(495)-142-86-81

WEBSITE ANALYSIS SOFTWARE DEVELOPMENT FOR THE LEAKAGE OF PERSONAL DATA

Purtov Daniil Vladimirovich  (student, MIREA — Russian Technological University )

Purtov Vladimir Sergeevich  (art director LLC "Elora" )

Shmitko Kirill Andreevich  (student, MIREA — Russian Technological University )

Rusakov Aleksej Mixajlovich  (senior lecturer, MIREA — Russian Technological University )

Melnikov Aleksey Olegovich  (docent, MIREA — Russian Technological University)

Filatov Vyacheslav Valerievich  (docent, MIREA — Russian Technological University )

This article presents a study on developing a software tool, Web-PD-Scanner, which aims to analyze web pages in HTML format to detect potential personal data leakage. The article provides an overview of modern software tools for parsing web resources, as well as a review of HTML-page parsing technologies and their limitations. The relevance of the proposed study is substantiated, and the object, subject of research, scope, and limitations of the software are defined. The main tasks to be performed by the software are formulated, and various mathematical methods, algorithms, and software tools that can be used to develop the Web-PD-Scanner software are identified. The article concludes that a hybrid approach that combines rule-based algorithms and machine learning is the most effective solution for detecting leaks of personal data on websites. The next stage of the research involves defining a model for storing aggregated personal data and selecting specific methods and algorithms for developing the Web-PD-Scanner software. This study provides valuable insights for researchers and practitioners interested in developing software tools for analyzing web pages for personal data leakage.

Keywords:web scraping, data mining, HTML parsing, personal data protection, software development

 

Read the full article …



Citation link:
Purtov D. V., Purtov V. S., Shmitko K. A., Rusakov A. M., Melnikov A. O., Filatov V. V. WEBSITE ANALYSIS SOFTWARE DEVELOPMENT FOR THE LEAKAGE OF PERSONAL DATA // Современная наука: актуальные проблемы теории и практики. Серия: Естественные и Технические Науки. -2023. -№05. -С. 97-104 DOI 10.37882/2223-2966.2023.05.29
LEGAL INFORMATION:
Reproduction of materials is permitted only for non-commercial purposes with reference to the original publication. Protected by the laws of the Russian Federation. Any violations of the law are prosecuted.
© ООО "Научные технологии"