In this paper, we propose a web document ranking method using topic modeling for effective information collection and classification. The proposed is applied to the technique avoid duplicated crawling when at high speed. Through technique, it feasible remove redundant documents, classify documents efficiently, confirm that crawler service running. enables rapid of many documents; user can searc...