نوع مقاله : مقاله پژوهشی

نویسندگان

1 دانشجوی کارشناسی‌ارشد، علم اطلاعات و دانش‌شناسی، دانشگاه تربیت مدرس، تهران، ایران.

2 استادیار، گروه علم اطلاعات و دانش‌شناسی، دانشگاه تربیت مدرس، تهران، ایران.

3 دانشیار، گروه علم اطلاعات و دانش‌شناسی، دانشگاه تربیت مدرس، تهران، ایران.

چکیده

ظهور وب‌ معنایی در جهت تحقق بازیابی معنایی اطلاعات است و در حال حاضر در داده‌های پیوندی تجلی یافته‌است. کتابخانه‌ها در تولید و مدیریت داده‌های مستند و معتبر فراوانی نقش دارند و می‌توانند نقشی مؤثر در نظام‌های اطلاعاتی پیش‌رو ایفا کنند و می‌توانند با اجرایی کردن داده‌های پیوندی گامی در این مسیر بردارند. هدف از انجام این پژوهش ارائه چارچوبی برای انتشار و تبدیل سرعنوان‌های موضوعی فارسی مورداستفاده کتابخانه ملی ایران به‌صورت داده‌های پیوندی و ایجاد پیوند با مجموعه داده‌ای مشابه است. پژوهش حاضر از نوع کاربردی است؛ با استفاده‌ از روش کتابخانه‌ای به طراحی چارچوبی برای انتشار سرعنوان‌های موضوعی پرداخته و برای اطمینان از امکان انتشار داده‌ها، روش موردنظر مورد پیاده‌سازی قرار گرفته‌است. بدین ترتیب ابتدا داده‌های موضوعی فارسی مورد پاک‌سازی و ویرایش قرار گرفتند، سپس با نرم‌افزار اپن‌ریفاین به آر.دی.اف تبدیل شدند و با سرعنوان‌های موضوعی کتابخانه کنگره پیوند دریافت کردند. داده‌های موردمطالعه پس از نگاشت به اسکاس به یک فایل آر.دی.اف در قالب ترتل تبدیل شدند. فایل تبدیل‌شده ابتدا وارد مخزن آر.دی.اف جینا فوسکی شد و سپس در رابط کاربری اسکاسموس در محیط وب نمایش داده شد. به‌طورکلی این چارچوب می‌تواند در فرایند انتشار داده‌های مستند کتابخانه ملی در قالب داده‌های پیوندی مورداستفاده قرار گیرد. در این چارچوب امکان برقراری پیوند با مجموعه داده‌های مشابه نیز در نظر گرفته شده‌است و پیاده‌سازی نمونه‌ای از داده‌ها با موفقیت انجام پذیرفت.

کلیدواژه‌ها

موضوعات

عنوان مقاله [English]

A Framework for Transforming the Persian Subject Headings into Linked Data

نویسندگان [English]

  • Zeynab Sabbaghi Bidgoli 1
  • Atefeh Sharif 2
  • Fatemeh Zandian 3

1 M.Sc. Student in Knowledge and Information Science, Tarbiat Modares University, Tehran, Iran.

2 Assistant Professor, Knowledge and Information Science, Tarbiat Modares University, Tehran, Iran.

3 Associate Professor, Knowledge and Information Science, Tarbiat Modares University, Tehran, Iran.

چکیده [English]

Introduction
The emergence of the web facilitated the retrieval of information. This made libraries as one of the most important centers of information considering the web for the information retrieval process. However, the fast change of the web leads to the transformation of library functions. The semantic web is an opportunity for libraries to change their functions. Linked data as a method in the semantic web can make a major change in library functions. It can improve the discoverability, visibility, and interoperability of the resources. For example, all libraries use authority controls for organizing their information. But using authority controls in a traditional way can be challenging. Therefore, using the web can help libraries tackle these potential challenges and problems. Transforming authority data into linked data which seems an innovative and faster way for finding the resources can be a step forward for libraries and users. This paper aims to design a framework for transforming the National Library of Iran Subject Headings into linked data and publish them on the web.

Literature Review
Designing and proposing a framework for linking the data was the topic of some research papers. Linking the university data (Behkamal et al., 2011) linking and visualizing medicine information (Sekhavati, Farahi, & Jalali, 2011) web objects (Hosseini, 2020), table data (Mulwad et al., 2010), Industrial Data (Graube et al.,2012), and government data (Villazón-Terraza, Vilches-Blázquez, Corcho, & Gómez-Pérez, 2011; Mulwad, Finin, & Joshi, 2011) were the topics for some reviewed studies. The results of their studies indicated that in general, linked data could improve information retrieval. Implementing a linked data method in library data was discussed in some papers. Kar & Das (2020) designed a methodology for linking bibliographic information in a digital repository. Similarly, Ryan et al. (2015) examined the linking of place names in a dataset, transferring them into RDF and linking them with other similar datasets. Summers, et al (2008) provide a methodology for transferring subject headings into linked data. their results showed that transferring LCSH into SKOS affects information retrieval. The linking and publishing National Library of Iran data were also investigated by Eslami & Vaghefzadeh (2013). Fathian Dastgerdi et al (2020) tried to make a pattern for linking data in library systems. They examined the components which are needed for implementing the linked data method in library systems. Their result showed that using linked data in library systems affects the visibility of bibliographic metadata. Based on the reviewed studies, many international papers discussed publishing library linked data in theoretical and practical ways. Whereas studies done in Iran focusing on linked data mostly developed patterns and models for linking data (e.g., Fathian Dastgerrdi; 2020). Few Persian studies were done for publishing bibliographic data (e.g., Eslami & Vaghefzadeh, 2013; Sekhavati, 2011). Although there is a significant number of papers discussing linked data, the technical aspect for publishing and linking library data was rarely examined. To fill this gap, this study aims to develop a framework for publishing National Library of Iran subject headings which is unlike Fathian Dastgerdi et al., (2020) paper considers the technical tools and aspects and unlike Sekhavati’s (2011) paper examines the Persian subject headings.

Methodology
This research is an applied study that utilizes a library method for designing a publishing framework. Linked data was implemented to ensure the possibility of publishing the research data. First, Persian subject headings which are represented in Iran MARC format were obtained in Marc XML files From the National Library of Iran. Then the method for transferring and publishing the data was applied.

Results
The framework developed in this research collected National Library of Iran subject headings randomly. The selected data were first cleaned by Microsoft Excel and MarcEdit. In the next step, cleaned data were converted into RDF Using OpenRefine. The study’s project was imported to Open Refine software, linked with external datasets, and saved in a triple store. Finally, the linked subject headings were displayed through the Skosmos interface.

Discussion
Publishing library data as linked data is an example of utilizing Web 3 in library systems. National libraries worldwide have tried linking their data including subject headings with other datasets. However, there remains a gap in publishing linked Persian subject headings and to the best of the authors' knowledge it seems that no paper has pointed to technical aspects of implementing Persian subject headings.

Conclusion
The current paper has transformed the Persian subject headings into a linked dataset in an RDF turtle format. Then, it visualized the linked data in the Skosmos interface. But there can be some limitations to this study. Using OpenRefine was reported successfully in this paper, but it seems that there may be a problem in data with larger sizes. In conclusion, since this framework improve the retrieval of authority data in this research, it can be used for publishing National library of Iran subject headings.

کلیدواژه‌ها [English]

  • Linked Data
  • the National Library of Iran Subject Heading
  • Open Refine
  • Data Publication
بهکمال، بهشید، کاهانی، محسن، دادخواه، محبوبه، زرین کلام، فتانه و پایدار، صمد. (1390). ارائه چارچوبی برای انتشار مجموعه داده‌های فارسی به‌صورت داده‌های پیوندی روی وب. فنی مهندسی-دانشگاه آزاد مشهد، 4(1)، 1–19.
حسینی، الهه. (1399). چهارچوب معنایی برای یکپارچه‌سازی و بازیابی معنایی اشیای محتوایی وب: رویکرد داده پیوندی در بافت سرطان. پایان‌نامه دکتری، دانشگاه الزهرا، تهران.
سخاوتی، الهه. (1390). ارائه چارچوبی جهت انتشار اطلاعات کتابخانه‌ای بر پایه اصول داده‌های پیوندی. پایان‌نامه کارشناسی ارشد، دانشگاه پیام نور.
سخاوتی، الهه، فراهی، احمد و جلالی، مهرداد. (1390). ارائه چارچوبی جهت انتشار اطلاعات دارو بر پایه Linked data.. دومین همایش فناوری اطلاعات، حال، آینده. بازیابی از https://civilica.com/doc/130594
شریف، عاطفه. (1393). پیوندهای کور، چالشی در ایده داده‌های پیوندی: واکاوی سرعنوان‌های موضوعی فارسی. پردازش و مدیریت اطلاعات، 30 (1)، 223–244.
فتحیان دستگردی، اکرم، طاهری، سید مهدی، صنعت جو، اعظم و کاهانی، محسن. (1399). پیاده‌سازی روش داده‌های پیوندی در نظام‌های کتابخانه‌ای: بررسی مؤلفه‌های موردنیاز و ارائه یک الگو. کتابخانه‌های دیجیتالی: پردازش و سازماندهی اطلاعات و دانش، 7 (25)، 67-94.
Akbaridaryan, S., Khosravi, F., Ebrahimi, M. & Mahabadi, H. B. (2017). SKOSification of Trilingual Cultural Thesaurus (TCH) of National Library of Iran (NLI): A step in line with NLI’s Linked Data strategy. Presented at the IFLA WLIC 2016 – Columbus, OH – Connections. Collaboration. Community –Retrieved from https://library.ifla.org/id/eprint/2091
Apenīte, M. & Bojārs, U. (2021). National Library of Latvia Subject Headings as Linked Open Data. In R. Verborgh, A. Dimou, A. Hogan, C. d’Amato, I. Tiddi, A. Bröring, … M. Alam (Eds.). The Semantic Web: ESWC 2021 Satellite Events (pp. 33–37). Cham: Springer International Publishing. https://doi.org/10.1007/978-3-030-80418-3_6
Bushman, B., Anderson, D. & Fu, G. (2015). Transforming the Medical Subject Headings into Linked Data: Creating the Authorized Version of MeSH in RDF. Journal of Library Metadata15(3–4), 157–176. https://doi.org/10.1080/19386389.2015.1099967
Candela, G., Escobar, P., Carrasco, R. C. & Marco-Such, M. (2020). Evaluating the quality of linked open data in digital libraries. Journal of Information Science, 016555152093095. https://doi.org/10.1177/0165551520930951
Eslami, S. & Vaghefzadeh, M. H. (2013). Publishing Persian linked data of National library and Archive of Iran. Retrieved 9 November 2022 from http://library.ifla.org/id/eprint/193/
Goswami, S. & Biswas, P. (2011). THE CONCEPT OF SEMANTIC WEB IN LIBRARY SERVICES. International Journal of Information Dissemination and Technology, 1(3), 165–170.
Graube, M., Pfeffer, J., Ziegler, J. & Urbas, L. (2012). Linked Data as Integrating Technology for Industrial Data: International Journal of Distributed Systems and Technologies3(3), 40–52. https://doi.org/10.4018/jdst.2012070104
Halla, M. (2013). Linked Data in Libraries: Library of Congress’ Bibliographic Framework Transition Initiative. Library Philosophy and Practice (e-Journal). 1015.
Hallo, M., Luján-Mora, S. & Trujillo, J. (2014). Transforming Library Catalogs into Linked Data. In ICERI2014 (pp. 1845–1853). Seville, Spain.
Hannemann, J. & Kett, J. (2010). Linked Data for Libraries. Presented at the World Library and Information Congress: 76th Ifla General Conference and Assembly, Gothenburg, Sweden: Ifla.
Hanson, E. M. (2014). A Beginner’s Guide to Creating Library Linked Data: Lessons from NCSU’s Organization Name Linked Data Project. Serials Review, 40(4), 251–258. https://doi.org/10.1080/00987913.2014.975887
Kamila, K. (2008). Application of Semantic Web in Modern Library and Information Services. GNIMS - International E-Journal on Library Science, 2. Retrieved 9 November 2022 from https://gnimswebsite.s3.ap-south-1.amazonaws.com/files/2022/02/20122439/December-2014.pdf#page=7
Kar, S. & Das, R. (2020). Publishing E-resources of Digital Institutional Repository as Linked Open Data: an experimental study. Library Philosophy and Practice (e-Journal), (4699). Retrieved 1 December 2022 from https://digitalcommons.unl.edu/libphilprac/4699?utm_source=digitalcommons.unl.edu%2Flibphilprac%2F4699&utm_medium=PDF&utm_campaign=PDFCoverPages
Kar, S. & Das, R. (2021). A Methodology for Transforming MARC21 Personal Name Authority Metadata into Linked Open Data with Integration of VIAF and LCNAF Datasets: An experimental study. Library Philosophy and Practice (e-journal), (5458).
Lampert, C. K. & Southwick, S. B. (2013). Leading to Linking: Introducing Linked Data to Academic Library Digital Collections. Journal of Library Metadata، 13(2–3)، 230–253. https://doi.org/‌10.1080/‌19386389.2013.826095.
Mulwad, V., Finin, T., Syed, Z. & Joshi, A. (2010). Using linked data to interpret tables. Proceedings of the the First International Workshop on Consuming Linked Data. co-located at ISWC 2010
Park, H. & Kipp, M. (2019). Library Linked Data Models: Library Data in the Semantic Web. Cataloging & Classification Quarterly, 57(5), 261–277. https://doi.org/10.1080/01639374.2019.1641171
Ryan, C., Grant, R., Carragáin, E. Ó., Collins, S., Decker, S. & Lopes, N. (2015). Linked data authority records for Irish place names. International Journal on Digital Libraries15(2–4), 73–85. https://doi.org/10.1007/s00799-014-0129-8
Southwick, S. B. (2015). A Guide for Transforming Digital Collections Metadata into Linked Data Using Open Source Technologies. Journal of LibraryMetadata, 15(1),1–35. https://doi.org/10.1080/19386389.2015.1007009
Summers, E., Isaac, A., Redding, C. & Krech, D. (2008, January). LCSH, SKOS and Linked Data.
Suominen, O., Ylikotila, H., Sini, P., Mikko, L. & Frosterus, M. (2015). Publishing SKOS vocabularies with Skosmos. Manuscript Submitted for Review. Retrieved from https://skosmos.org/publishing-skos-vocabularies-with-skosmos.pdf
Tian, C. T., Cole, T. W. & Yu, K. (2021). Name and Subject Heading Reconciliation to Linked Open Data Authorities using Virtual International Authority File and Library of Congress Linked Data Service APIs: A Case Study featuring Emblematica Online. Library Resources & Technical Services (ALA LRTS). Retrieved from https://scholarship.law.nd.edu/law_faculty_scholarship/1467
Zengenene, D., Casarosa, V. & Meghini, C. (2014). Towards a Methodology for Publishing Library Linked Data. In T. Catarci, N. Ferro, & A. Poggi (Eds.). Bridging Between Cultural Heritage Institutions. IRCDL 2013. Communications in Computer and Information Science, vol 385. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54347-0_10
 
References [In Persian]
Behkamal, B., Kahani, M., Dadkhah, M., Zarin Kalam, F. & Paydar, S. (2011). Providing a framework for publishing Persian datasets as linked data on the web. Technical and Engineering Mashhad Azad University, 4(1), 1-19. [In Persian]
Fathian Dastgerdi, A., Taheri, S. M., Sanatjo, A. & Kahani, M. (2019). implementing the linked data method in library systems: examining the required components and presenting a model. Digital libraries: processing and organizing information and knowledge. [In Persian]
Hosseini, E. (2019). A Semantic Framework for Semantic Integration and Retrieval of Web Content Objects: A Linked Data Approach in Cancer Tissue. Ph.D Dissertation, Al-Zahra University, Tehran. [In Persian]
Sekhavati, E. (2011). Providing a framework for publishing library information based on the principles of linked data (Master's thesis, Payam-e Noor university). [In Persian]
Sekhavati, E., Farahi, A. & Jalali, M. (2011). Implementing a Framework for Publishing Medciene Information Based on Linked Data. 2nd Information Technology, Present, Future. Retrieved from https://civilica.com/doc/130594 [In Persian]
Sharif, A. (2013). Blind links, a challenge in the idea of linked data: analysis of Persian subject headings. Information processing and management, 30(1), 223-244. [In Persian]eferences