Document Type : Research Paper

Authors

1 Department of Information Technology Management, Faculty of Management, Central Tehran Branch, Islamic Azad University, Tehran, Iran

2 Department of industrial Management, Faculty of Management, Central Tehran Branch, Islamic Azad University, Tehran, Iran

Abstract

Today, data, as one of the valuable assets of various organizations and industries, plays an important role in the development and progress of businesses. In fact, every organization uses different sources to collect its data, one of which is the web platform, where a lot of data is produced and published by different users or even robots all over the world every day. Examining, researching, studying and analyzing such data can provide useful information and knowledge for the organization. For this purpose, during the past decades, various tools have been developed that have greatly helped in extracting information from the web platform, among which we can mention Request, Selenium, Scrapy, Beautiful Soup, etc. libraries in the Python programming language. However, each of these libraries faces challenges. In this article, by studying the Selenium library and considering the existence of many challenges in it, we have presented a solution for time management and improving the challenge of its Asynchronous. Our experiments show that the use of the proposed solution increases the accuracy of the information retrieved from the web platform and thus improves the challenge of Asynchronous and also reduces the time to retrieve information from the web platform.

Keywords

Main Subjects