Vector Space Model-based Information Retrieval Systems at South Sumatera Regional Libraries

Authors

  • M. Akbar As Shiddiqi University of Indo Global Mandiri
  • A Sanmarino University of Indo Global Mandiri

Keywords:

Vector Space Model (VSM), Information Retrieval Systems, Retrieval Precision, South Sumatera Regional Libraries, Document Representation

Abstract

This study presents an overview of the research aimed at optimizing library information retrieval through the utilization of the Vector Space Model (VSM) method in a computer science context. Libraries, as publicly financed collections, provide extensive knowledge resources, eliminating the need for individual book purchases. However, the challenge lies in efficiently navigating the expanding library collections. To tackle this issue, the study employs information retrieval techniques, particularly the VSM method, which assesses term similarity by assigning weights to terms, enabling document and query representation as vectors. The relevance between documents and queries is measured through vector similarity. This approach, integrated with indexing, streamlines collection retrieval in libraries. Employing the Waterfall model for system development, the research outlines phases like analysis, design, coding, testing, and implementation. While effective, the model's rigidity in accommodating evolving requirements poses limitations. The VSM method's numerical representation of text documents facilitates precise similarity calculations, supported by TF-IDF values indicating term importance in documents relative to the corpus. The study further extends to system design using UML diagrams and a visitor interface, integrating VSM for efficient search functionality. Black-box testing confirms the robustness of the system components and interfaces. Overall, this research presents a systematic approach to enhance information retrieval in libraries, emphasizing the VSM's pivotal role in optimizing document searches within expansive collections.

Downloads

Download data is not yet available.

References

D. Xu and T. Miller, “A simple neural vector space model for medical concept normalization using concept embeddings,” J. Biomed. Inform., vol. 130, no. January, p. 104080, 2022, doi: 10.1016/j.jbi.2022.104080.

H. Du and Y. Bin Kang, “An open-source framework for ExpFinder integrating N-gram vector space model and μCO-HITS[Formula presented],” Softw. Impacts, vol. 8, no. March, p. 100069, 2021, doi: 10.1016/j.simpa.2021.100069.

K. D. Prasetya, Suharjito, and D. Pratama, “Effectiveness Analysis of Distributed Scrum Model Compared to Waterfall approach in Third-Party Application Development,” Procedia Comput. Sci., vol. 179, no. 2019, pp. 103–111, 2021, doi: 10.1016/j.procs.2020.12.014.

T. Thesing, C. Feldmann, and M. Burchardt, “Agile versus Waterfall Project Management: Decision model for selecting the appropriate approach to a project,” Procedia Comput. Sci., vol. 181, pp. 746–756, 2021, doi: 10.1016/j.procs.2021.01.227.

A. A. S. Gunawan, B. Clemons, I. F. Halim, K. Anderson, and M. P. Adianti, “Development of e-butler: Introduction of robot system in hospitality with mobile application,” Procedia Comput. Sci., vol. 216, no. 2019, pp. 67–76, 2022, doi: 10.1016/j.procs.2022.12.112.

G. Bergström et al., “Evaluating the layout quality of UML class diagrams using machine learning,” J. Syst. Softw., vol. 192, p. 111413, 2022, doi: 10.1016/j.jss.2022.111413.

H. Wu, “QMaxUSE: A new tool for verifying UML class diagrams and OCL invariants,” Sci. Comput. Program., vol. 228, p. 102955, 2023, doi: 10.1016/j.scico.2023.102955.

P. Danenas, T. Skersys, and R. Butleris, “Natural language processing-enhanced extraction of SBVR business vocabularies and business rules from UML use case diagrams,” Data Knowl. Eng., vol. 128, no. February, p. 101822, 2020, doi: 10.1016/j.datak.2020.101822.

Meiliana, I. Septian, R. S. Alianto, Daniel, and F. L. Gaol, “Automated Test Case Generation from UML Activity Diagram and Sequence Diagram using Depth First Search Algorithm,” Procedia Comput. Sci., vol. 116, pp. 629–637, 2017, doi: 10.1016/j.procs.2017.10.029.

Z. Daw and R. Cleaveland, “Comparing model checkers for timed UML activity diagrams,” Sci. Comput. Program., vol. 111, no. P2, pp. 277–299, 2015, doi: 10.1016/j.scico.2015.05.008.

F. Chen, L. Zhang, X. Lian, and N. Niu, “Automatically recognizing the semantic elements from UML class diagram images,” J. Syst. Softw., vol. 193, p. 111431, 2022, doi: 10.1016/j.jss.2022.111431.

D. Felicio, J. Simao, and N. Datia, “Rapitest: Continuous black-box testing of restful web apis,” Procedia Comput. Sci., vol. 219, no. 2022, pp. 537–545, 2023, doi: 10.1016/j.procs.2023.01.322.

H. Bostani and V. Moonsamy, “EvadeDroid: A Practical Evasion Attack on Machine Learning for Black-box Android Malware Detection,” Comput. Secur., p. 103676, 2021, doi: 10.1016/j.cose.2023.103676.

F. Pagano, A. Romdhana, D. Caputo, L. Verderame, and A. Merlo, “SEBASTiAn: A static and extensible black-box application security testing tool for iOS and Android applications,” SoftwareX, vol. 23, p. 101448, 2023, doi: 10.1016/j.softx.2023.101448.

C. Cronley et al., “Designing and evaluating a smartphone app to increase underserved communities’ data representation in transportation policy and planning,” Transp. Res. Interdiscip. Perspect., vol. 18, no. January, p. 100763, 2023, doi: 10.1016/j.trip.2023.100763.

Published

2023-07-31

How to Cite

Akbar As Shiddiqi, M., & Sanmarino, A. (2023). Vector Space Model-based Information Retrieval Systems at South Sumatera Regional Libraries. Journal of Computer Science Application and Engineering (JOSAPEN), 1(2), 49–53. Retrieved from https://journal.lenterailmu.com/index.php/josapen/article/view/15

Similar Articles

1 2 > >> 

You may also start an advanced similarity search for this article.