Previous [ 1] [ 2] [ 3] [ 4] [ 5] [ 6] [ 7] [ 8] [ 9] [ 10] [ 11]

@

Journal of Information Science and Engineering, Vol. 29 No. 1, pp. 35-48 (January 2013)


Visualization of Document Retrieval using External Cluster Relationship


TAECHANG JEE1, HYUNJIN LEE2 AND YILLBYUNG LEE1
1School of Computer Science
Yonsei University
Seoul, 120-749 Korea
2Department of Computers, Information and Communication
Korea Soongsil Cyber University Seoul, 110-340 Korea

Owing to the limitations of existing visualization schemes, existing document retrieval systems display limited results, often showing only document titles, short summaries, and keywords. This makes it difficult to examine multiple results at once or to find a meaningful relationship between results. This study proposes a new method for the real-time visualization of document retrieval results via clustering. The method clusters similar documents into groups, making it easier to understand the relationship between the retrieved documents. This study also proposes a two-level visualization algorithm which projects the cluster centers onto a two-dimensional space using multidimensional scaling in order to illustrate the relationships among different clusters, and displays individual documents at locations determined by the external cluster relationship in low dimensional space in order to allow the comparison of individual documents. The method was tested on benchmark data and real-world data, and the results show that it is possible to visualize the search results in real time.

Keywords: clustering, information visualization, on-line computation, text mining, document visualization

Full Text () Retrieve PDF document (201301_03.pdf)

Received May 31, 2011; accepted March 31, 2012.
Communicated by Francisco J. Garcia-Penalvo, Ricardo Colomo-Palacios and Jane Yung-Jen Hsu.