As of 2022, the number of Internet sites for public institutions registered on the ‘Government 24’ website (www.gov.kr) of the Ministry of the Interior and Safety is 17,000. The direct transfer takes a lot of human and material resources and time between the records-producing institution and the records-management institution that manages websites as records. In addition, it is practically difficult for records management institutions to migrate and operate various software and application technologies required to run each website. A method of automatically collecting websites from a remote location using web crawler software is used domestically and abroad to overcome these practical limitations. This study compared the performance of the web crawler required to collect and manage public Internet websites as records remotely. The most suitable web crawler was selected through a step-by-step review of several web crawlers from previous studies and other literature. Several public agency websites were applied to compare the actual performance of the crawlers in the evaluation process. The study provides empirical and specific performance comparison information for organizations that need to choose a web crawler.
@article{ART002892954}, author={장진호 and 권혁상 and 이규모 and CHOI,DONG-JOON}, title={Comparison of Web Crawler Performance for Web Record Management}, journal={The Korean Journal of Archival Studies}, issn={1229-7941}, year={2022}, number={74}, pages={155-186}, doi={10.20923/kjas.2022.74.155}
TY - JOUR AU - 장진호 AU - 권혁상 AU - 이규모 AU - CHOI,DONG-JOON TI - Comparison of Web Crawler Performance for Web Record Management JO - The Korean Journal of Archival Studies PY - 2022 VL - null IS - 74 PB - Korean Society Of Archival Studies SP - 155 EP - 186 SN - 1229-7941 AB - As of 2022, the number of Internet sites for public institutions registered on the ‘Government 24’ website (www.gov.kr) of the Ministry of the Interior and Safety is 17,000. The direct transfer takes a lot of human and material resources and time between the records-producing institution and the records-management institution that manages websites as records. In addition, it is practically difficult for records management institutions to migrate and operate various software and application technologies required to run each website. A method of automatically collecting websites from a remote location using web crawler software is used domestically and abroad to overcome these practical limitations. This study compared the performance of the web crawler required to collect and manage public Internet websites as records remotely. The most suitable web crawler was selected through a step-by-step review of several web crawlers from previous studies and other literature. Several public agency websites were applied to compare the actual performance of the crawlers in the evaluation process. The study provides empirical and specific performance comparison information for organizations that need to choose a web crawler. KW - Web Record Management;Remote Collection Method;Web Crawler Performance Comparison DO - 10.20923/kjas.2022.74.155 ER -
장진호, 권혁상, 이규모 and CHOI,DONG-JOON. (2022). Comparison of Web Crawler Performance for Web Record Management. The Korean Journal of Archival Studies, 74, 155-186.
장진호, 권혁상, 이규모 and CHOI,DONG-JOON. 2022, "Comparison of Web Crawler Performance for Web Record Management", The Korean Journal of Archival Studies, no.74, pp.155-186. Available from: doi:10.20923/kjas.2022.74.155
장진호, 권혁상, 이규모, CHOI,DONG-JOON "Comparison of Web Crawler Performance for Web Record Management" The Korean Journal of Archival Studies 74 pp.155-186 (2022) : 155.
장진호, 권혁상, 이규모, CHOI,DONG-JOON. Comparison of Web Crawler Performance for Web Record Management. 2022; 74 : 155-186. Available from: doi:10.20923/kjas.2022.74.155
장진호, 권혁상, 이규모 and CHOI,DONG-JOON. "Comparison of Web Crawler Performance for Web Record Management" The Korean Journal of Archival Studies no.74(2022) : 155-186.doi: 10.20923/kjas.2022.74.155
장진호; 권혁상; 이규모; CHOI,DONG-JOON. Comparison of Web Crawler Performance for Web Record Management. The Korean Journal of Archival Studies, 74, 155-186. doi: 10.20923/kjas.2022.74.155
장진호; 권혁상; 이규모; CHOI,DONG-JOON. Comparison of Web Crawler Performance for Web Record Management. The Korean Journal of Archival Studies. 2022; 74 155-186. doi: 10.20923/kjas.2022.74.155
장진호, 권혁상, 이규모, CHOI,DONG-JOON. Comparison of Web Crawler Performance for Web Record Management. 2022; 74 : 155-186. Available from: doi:10.20923/kjas.2022.74.155
장진호, 권혁상, 이규모 and CHOI,DONG-JOON. "Comparison of Web Crawler Performance for Web Record Management" The Korean Journal of Archival Studies no.74(2022) : 155-186.doi: 10.20923/kjas.2022.74.155