Document Type : Research Paper

Authors

1 Department of Knowledge and Information Science, Kharazmi University, Tehran, Iran

2 Department of Knowledge and Information Science, Shahed University, Tehran, Iran

3 Department of Information Systems Research, Iranian Research Institute for Information Science and Technology (IranDoc), Tehran, Iran

Abstract

Introduction

Regarding the increase in digital images and easy access to digital cameras, image processing and retrieval have become an important research field. Image retrieval is one of the important subfields of information retrieval that usually uses different techniques and models than text retrieval. The main expectation of users of information retrieval systems is to find relevant resources among thousands of resources available in these systems. Creating a scientific map of articles in the field of image retrieval using ontology can provide awareness of the status of published research; it can also help to show thematic relationships, identify influential topics, mature, emerging, and undeveloped topics, thematic gaps, and create appropriate scientific policies in this field.

Literature Review

A small number of studies have been conducted in the field of image retrieval using ontology, none of which have examined the hierarchical diagram. In the field of scientometric research, such as Daniali, Naghshineh, and Fadaei (2017); Daniali and Naghshineh (2018); Azimi and Jozi (2014); Ghanbari et al. (2014); Liu et al. (2021); Jo (2024); Khan et al. (2024). Scientometric studies often focus on assessing publication patterns, citation networks, co-authorship, and research productivity. These studies help researchers understand the structure and evolution of the scientific literature in a given field.

Methodology

In terms of type, the present study is in the category of applied research in which scientometric techniques and social network analysis have been used. The research community consists of those studies in the field of information retrieval with ontology that have been indexed in the Scopus database from the beginning to the end of 2024.
Based on the formula in the search, 716 indexed articles were found in Scopus in the desired field. To be more precise, the statistical population of this study consists of all 716 published articles in the field of image retrieval using ontology.
After retrieving relevant records and integrating data, based on the research objectives and questions, data analysis was carried out using BibExcel, Gephi, Excel, and SPSS, and the maps were created by VOSviewer software. In order to draw thematic maps and analyze them correctly, keywords were controlled and standardized by creating a Thesaurus in the software. In such a way that identical and similar keywords and plural and singular forms were merged and non-specialized and searched keywords were removed. In order to classify words in published documents based on semantic similarity using algorithms such as Euclidean distance and..., hierarchical clustering is usually used. Hierarchical clustering was performed using SPSS software. In order to implement and achieve the analysis, requirements such as a co-occurrence matrix must first be prepared, and then the co-occurrence matrix must be converted into a correlation matrix. The statistical population of the present study was the entire population. In order to perform a more accurate synonym analysis and final synonym analysis, the matrix was called through SPSS software, and the regular matrix was converted into a correlation matrix by SPSS software. The correlation matrix that was based on the obtained cognates frequency matrix, clusters and hierarchical were drawn using hierarchical clustering using the Ward method and squared Euclidean distance. Among the 716 retrieved articles, keywords with a frequency of 11 and more were selected for the research to prepare the matrix, and finally a square matrix of 142 by 142 was formed for the research.
The diagonal cells of the matrices were considered zero and then these ordinary matrices were converted into a correlation matrix. Finally, the clustering of concepts was drawn based on statistical software (SPSS version 26). Finally, to draw a scatter diagram and identify the development and maturity status of the topics, the frequency matrix of each cluster was drawn separately, then their correlation matrix was drawn, and with the help of the strategic diagram, using the density and centrality of each cluster, their coherence and maturity were calculated. In the next step, a strategic diagram of thematic clusters was drawn; the strategic diagram describes the internal relationship and correlation between the different thematic clusters. Excel, Gephi, Babel Excel, and SPSS software were used to analyze the data, and Word Viewer, Excel, and SPSS software were used to draw the diagram.

Results

The results showed that among the keywords, the keyword image retrieval is in the first place with a frequency of 572. Among them, the keywords ontology and semantics are in the second and third places, respectively. Also, the analysis of the findings related to the synonym of image retrieval using ontology took the form of six thematic areas. Infrastructures and fundamental technologies of image retrieval, techniques of concepts and image analysis, intelligent machines and applications in image retrieval, web concepts and search based on ontology, managerial and human aspects in information retrieval, quality improvement and query processing in image retrieval.
The results obtained from the hierarchical diagram formed four topic clusters. Also, the findings from the strategic map of image retrieval topics using ontology indicate that cluster 1 was placed in the first part due to its high centrality and density. These clusters are higher in centrality and density. Clusters 2 and 4 are placed in the second part. The clusters that are located in the second part of the strategic picture are regional clusters. But they are developed. Cluster 3 is located in the fourth part. The clusters in the fourth part are the main ones, but are undeveloped and immature.

Discussion

With the help of scientometrics, a macro picture of the state of research and how different domains are related, can be presented. Co-occurrence analysis led to the formation of six clusters. Fundamental infrastructures and technologies of image retrieval, semantic techniques and image content analysis, machine learning and intelligent applications in image retrieval, semantic web and ontology-based search, managerial and human aspects of information retrieval, quality improvement and query processing in image retrieval. Among the existing concepts, some of them have received the highest number of citations: such as image retrieval, machine learning, computer vision, deep learning, database systems, digital libraries, Internet, WordNet, artificial intelligence, and information retrieval. A hierarchical diagram of four thematic clusters was formed: semantic-based image retrieval, intelligent image retrieval with learning algorithms, semantic image retrieval with annotations, and intelligent image retrieval.

Conclusion

Studies in the field of image retrieval using ontology as a useful tool for effective retrieval can play an effective role. Most studies are in the field of ontology and ontology construction, but no specific research has specifically addressed this area. In this study, the emphasis is on image retrieval, but the high importance of this area, the recognition of the components of this area, and the impact of ontology on semantic retrieval require researchers to focus on these issues, and the results of this study indicate that very little attention has been paid to this area, especially in Iran. Scientific maps are a suitable method for displaying the increasing growth of scientific activities and organizing the intellectual and scientific structure that constitutes a thematic domain. Researchers, science policymakers, and other interested parties can advance their own goals and advance with greater awareness in this field by being aware of this structure.

Keywords

Main Subjects

Alipour, O., Soheili, F., Ziaei, S., & Khasseh, A. A. (2021). Co-authorship network analysis of knowledge organization articles in Iran. Knowledge Retrieval and Semantic Systems, 13(49), 1–24. https://doi.org/10.22054/jks.2022.69907.1532 [In Persian]
Azimi, M. H., & Jozi, Z. (2023). Scientometrics and subject cluster analysis of ontology research in information retrieval. Caspian Journal of Scientometrics, 10(2), 54–66. http://dx.doi.org/10.22088/cjs.4.2.53 [In Persian]
Berners-Lee, T., Hendler, J., & Lassila, O. (2023). The Semantic Web: A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities. In Linking the world's information: Essays on Tim Berners-Lee's invention of the World Wide Web (pp. 91–103). Association for Computing Machinery.
Bitters, B. (2005). A geographical ontology of objects in the visible domain [Doctoral dissertation, The Florida State University]. ProQuest Dissertations & Theses Global.
Cure, O. (2003). Mapping databases to ontologies to design and maintain data in a semantic web environment. Journal of Systemics, Cybernetics and Informatics, 1(5), 1–6. http://www.iiisci.org/journal/cvs/sci/pdfs/p704935.pdf
Daniali, S., & Naqshineh, N. (2018). Studying the research trend and drawing a knowledge map of active research domains in the image retrieval field based on articles indexed in the Web of Science from 2001-2012. Scientometrics Journal, 4(1), 119–142. https://doi.org/10.22070/rsci.2018.612 [In Persian]
Daniali, S., Naqshineh, N., & Fadaei, G. (2017). Drawing a co-occurrence map of image retrieval terms based on articles indexed in the Web of Science database. Caspian Journal of Scientometrics, 4(2), 53–61. https://doi.org/10.22088/cjs.4.2.53 [In Persian]
Azimi, M. H. & Esmaeili, S. (2023). Drawing the intellectual structure of knowledge in the field of RDF. Knowledge Retrieval and Semantic Systems, 15(57), 1–26. https://doi.org/10.22054/jks.2022.69907.1532 [In Persian]
Farhadi, M., & Jamzadeh, M. (2017). Content-based image retrieval using segmentation similarity criterion. Yaranshi Sciences, 3(1), 3–12. [In Persian]
Farshid, R., Soheili, F., Gholami, H., & Geraee, E. (2020). Analyzing the study areas of gastric cancer using hierarchical clustering method. Health Information Management, 17(3), 133–139. https://doi.org/10.22122/him.v17i3.4117 [In Persian]
Ghanbari, M., Voghofi, O., & Hajiani, E. (2024). Scientometric analysis of research on future images in the world's most prestigious scientific publications. Strategic Studies of Culture, 3(4), 89–108. https://doi.org/10.22083/scsj.2024.429841.1133 [In Persian]
Ghasemian, A., Nojavan, F., Asnafi, A., & Zohorifar, F. (2025). Analysis of research outputs in the field of psychology and their presence in scientific social media. Knowledge Retrieval and Semantic Systems, 12(42), 61–90. https://doi.org/10.22054/jks.2022.67962.1504 [In Persian]
Haghani, M. (2023). What makes an informative and publication-worthy scientometric analysis of literature: A guide for authors, reviewers and editors. Transportation Research Interdisciplinary Perspectives, 22, Article 100956. https://doi.org/10.1016/j.trip.2023.100956
Homavandi, H., Fahimnia, F., Nakhoda, M., & Hosseini Beheshti, M. (2020). A study on ontology building methods: Understanding of the features and requirements. Journal of Academic Librarianship and Information Research, 54(1), 13–39. https://doi.org/10.22059/jlib.2020.303455.1493 [In Persian]
Jafari Baghiabadi, S., & Farshid, R. (2021). Studying of research related to COVID-19 vaccine in Iran and the world: A thematic analysis and scientific collaborations. Iranian Journal of Medical Microbiology, 15(4), 414–457. https://doi.org/10.30699/ijmm.15.4.414 [In Persian]
Ju, F. (2024). Mapping the knowledge structure of image recognition in cultural heritage: A scientometric analysis using CiteSpace, VOSviewer, and Bibliometrix. Journal of Imaging, 10(11), Article 272. https://doi.org/10.3390/jimaging10110272
Khan, U., Khan, H. U., Iqbal, S., & Munir, H. (2024). Four decades of image processing: A bibliometric analysis. Library Hi Tech, 42(1), 180–202. https://doi.org/10.1108/LHT-10-2021-0351
Khasseh, A. A., Mokhtari, H., & Asheghi Moaf, M. (2022). Information retrieval in Iran: A scientometric study and scientific visualization. Knowledge Retrieval and Semantic Systems, 9(33), 1–36. https://doi.org/10.22054/jks.2022.64246.1476 [In Persian]
Li, J., Goerlandt, F., & Reniers, G. (2021). An overview of scientometric mapping for the safety science community: Methods, tools, and framework. Safety Science, 134(1), Article 105093. https://doi.org/10.1016/j.ssci.2020.105093
Liu, W., Wu, H., Hu, K., Luo, Q., & Cheng, X. (2021). A scientometric visualization analysis of image captioning research from 2010 to 2020. IEEE Access, 9, 156799–156817. https://doi.org/10.1109/ACCESS.2021.3129782
Liu, Y., Zhang, D., Lu, G., & Ma, W. Y. (2007). A survey of content-based image retrieval with high-level semantics. Pattern Recognition, 40(1), 262–282. https://doi.org/10.1016/j.patcog.2006.04.045
Manzoor, U., Balubaid, M. A., Zafar, B., Umar, H., & Khan, M. S. (2015). Semantic image retrieval: An ontology based approach. International Journal of Advanced Research in Artificial Intelligence, 4(4), 1–8. https://doi.org/10.14569/IJARAI.2015.040401
Mezaris, V., Kompatsiaris, I., & Strintzis, M. G. (2004). Region-based image retrieval using an object ontology and relevance feedback. EURASIP Journal on Advances in Signal Processing, 2004(6), Article 870494. https://doi.org/10.1155/S1110865704401188
Moosavi, S. S., Farshid, R., & Jafari Baghi Abadi, S. (2021). The role of medical and health archives in scientific research from a scientometrics perspective. Iranian Journal of Medical Microbiology, 15(5), 508–536. https://doi.org/10.30699/ijmm.15.5.508 [In Persian]
Sanatjoo, A. (2012). The performance of ontologies in information retrieval systems. Book of Generalities, 16(2), 43–47. [In Persian]
Sharif, A. (2008). Application of ontologies in knowledge management system. Library and Information Sciences, 11(3), 97–116. [In Persian]
Smith, J. R. (2001). Quantitative assessment of image retrieval effectiveness. Journal of the American Society for Information Science and Technology, 52(11), 969–979. https://doi.org/10.1002/asi.1162
Taheri, S. M., & Shokrzadeh, N. (2025). Mapping the scientific outputs of Allameh Tabataba'i University from 1974 to 2024. Knowledge Retrieval and Semantic Systems, 12(42), 23–60. https://doi.org/10.22054/jks.2024.81904.1671 [In Persian]
Yu, D., Xu, Z., Kao, Y., & Lin, C. T. (2017). The structure and citation landscape of IEEE Transactions on Fuzzy Systems (1994–2015). IEEE Transactions on Fuzzy Systems, 26(2), 430–442. https://doi.org/10.1109/TFUZZ.2017.2672732