نوع مقاله : مقاله پژوهشی
نویسندگان
1 دانشجوی کارشناسیارشد، علم اطلاعات و دانششناسی، دانشگاه تربیت مدرس، تهران، ایران.
2 استادیار، گروه علم اطلاعات و دانششناسی، دانشگاه تربیت مدرس، تهران، ایران.
3 دانشیار، گروه علم اطلاعات و دانششناسی، دانشگاه تربیت مدرس، تهران، ایران.
چکیده
ظهور وب معنایی در جهت تحقق بازیابی معنایی اطلاعات است و در حال حاضر در دادههای پیوندی تجلی یافتهاست. کتابخانهها در تولید و مدیریت دادههای مستند و معتبر فراوانی نقش دارند و میتوانند نقشی مؤثر در نظامهای اطلاعاتی پیشرو ایفا کنند و میتوانند با اجرایی کردن دادههای پیوندی گامی در این مسیر بردارند. هدف از انجام این پژوهش ارائه چارچوبی برای انتشار و تبدیل سرعنوانهای موضوعی فارسی مورداستفاده کتابخانه ملی ایران بهصورت دادههای پیوندی و ایجاد پیوند با مجموعه دادهای مشابه است. پژوهش حاضر از نوع کاربردی است؛ با استفاده از روش کتابخانهای به طراحی چارچوبی برای انتشار سرعنوانهای موضوعی پرداخته و برای اطمینان از امکان انتشار دادهها، روش موردنظر مورد پیادهسازی قرار گرفتهاست. بدین ترتیب ابتدا دادههای موضوعی فارسی مورد پاکسازی و ویرایش قرار گرفتند، سپس با نرمافزار اپنریفاین به آر.دی.اف تبدیل شدند و با سرعنوانهای موضوعی کتابخانه کنگره پیوند دریافت کردند. دادههای موردمطالعه پس از نگاشت به اسکاس به یک فایل آر.دی.اف در قالب ترتل تبدیل شدند. فایل تبدیلشده ابتدا وارد مخزن آر.دی.اف جینا فوسکی شد و سپس در رابط کاربری اسکاسموس در محیط وب نمایش داده شد. بهطورکلی این چارچوب میتواند در فرایند انتشار دادههای مستند کتابخانه ملی در قالب دادههای پیوندی مورداستفاده قرار گیرد. در این چارچوب امکان برقراری پیوند با مجموعه دادههای مشابه نیز در نظر گرفته شدهاست و پیادهسازی نمونهای از دادهها با موفقیت انجام پذیرفت.
کلیدواژهها
موضوعات
عنوان مقاله [English]
A Framework for Transforming the Persian Subject Headings into Linked Data
نویسندگان [English]
- Zeynab Sabbaghi Bidgoli 1
- Atefeh Sharif 2
- Fatemeh Zandian 3
1 M.Sc. Student in Knowledge and Information Science, Tarbiat Modares University, Tehran, Iran.
2 Assistant Professor, Knowledge and Information Science, Tarbiat Modares University, Tehran, Iran.
3 Associate Professor, Knowledge and Information Science, Tarbiat Modares University, Tehran, Iran.
چکیده [English]
Introduction
The emergence of the web facilitated the retrieval of information. This made libraries as one of the most important centers of information considering the web for the information retrieval process. However, the fast change of the web leads to the transformation of library functions. The semantic web is an opportunity for libraries to change their functions. Linked data as a method in the semantic web can make a major change in library functions. It can improve the discoverability, visibility, and interoperability of the resources. For example, all libraries use authority controls for organizing their information. But using authority controls in a traditional way can be challenging. Therefore, using the web can help libraries tackle these potential challenges and problems. Transforming authority data into linked data which seems an innovative and faster way for finding the resources can be a step forward for libraries and users. This paper aims to design a framework for transforming the National Library of Iran Subject Headings into linked data and publish them on the web.
Literature Review
Designing and proposing a framework for linking the data was the topic of some research papers. Linking the university data (Behkamal et al., 2011) linking and visualizing medicine information (Sekhavati, Farahi, & Jalali, 2011) web objects (Hosseini, 2020), table data (Mulwad et al., 2010), Industrial Data (Graube et al.,2012), and government data (Villazón-Terraza, Vilches-Blázquez, Corcho, & Gómez-Pérez, 2011; Mulwad, Finin, & Joshi, 2011) were the topics for some reviewed studies. The results of their studies indicated that in general, linked data could improve information retrieval. Implementing a linked data method in library data was discussed in some papers. Kar & Das (2020) designed a methodology for linking bibliographic information in a digital repository. Similarly, Ryan et al. (2015) examined the linking of place names in a dataset, transferring them into RDF and linking them with other similar datasets. Summers, et al (2008) provide a methodology for transferring subject headings into linked data. their results showed that transferring LCSH into SKOS affects information retrieval. The linking and publishing National Library of Iran data were also investigated by Eslami & Vaghefzadeh (2013). Fathian Dastgerdi et al (2020) tried to make a pattern for linking data in library systems. They examined the components which are needed for implementing the linked data method in library systems. Their result showed that using linked data in library systems affects the visibility of bibliographic metadata. Based on the reviewed studies, many international papers discussed publishing library linked data in theoretical and practical ways. Whereas studies done in Iran focusing on linked data mostly developed patterns and models for linking data (e.g., Fathian Dastgerrdi; 2020). Few Persian studies were done for publishing bibliographic data (e.g., Eslami & Vaghefzadeh, 2013; Sekhavati, 2011). Although there is a significant number of papers discussing linked data, the technical aspect for publishing and linking library data was rarely examined. To fill this gap, this study aims to develop a framework for publishing National Library of Iran subject headings which is unlike Fathian Dastgerdi et al., (2020) paper considers the technical tools and aspects and unlike Sekhavati’s (2011) paper examines the Persian subject headings.
Methodology
This research is an applied study that utilizes a library method for designing a publishing framework. Linked data was implemented to ensure the possibility of publishing the research data. First, Persian subject headings which are represented in Iran MARC format were obtained in Marc XML files From the National Library of Iran. Then the method for transferring and publishing the data was applied.
Results
The framework developed in this research collected National Library of Iran subject headings randomly. The selected data were first cleaned by Microsoft Excel and MarcEdit. In the next step, cleaned data were converted into RDF Using OpenRefine. The study’s project was imported to Open Refine software, linked with external datasets, and saved in a triple store. Finally, the linked subject headings were displayed through the Skosmos interface.
Discussion
Publishing library data as linked data is an example of utilizing Web 3 in library systems. National libraries worldwide have tried linking their data including subject headings with other datasets. However, there remains a gap in publishing linked Persian subject headings and to the best of the authors' knowledge it seems that no paper has pointed to technical aspects of implementing Persian subject headings.
Conclusion
The current paper has transformed the Persian subject headings into a linked dataset in an RDF turtle format. Then, it visualized the linked data in the Skosmos interface. But there can be some limitations to this study. Using OpenRefine was reported successfully in this paper, but it seems that there may be a problem in data with larger sizes. In conclusion, since this framework improve the retrieval of authority data in this research, it can be used for publishing National library of Iran subject headings.
کلیدواژهها [English]
- Linked Data
- the National Library of Iran Subject Heading
- Open Refine
- Data Publication