Title:Latent Semantic Analysis for Text Segmentation
Authors: Freddy Y.Y. Choi ; Peter Wiemer-Hastings ; Johanna Moore
Date:Jun 2001
Publication Title:Proceedings of Empirical Methods in Natural Language Processing (EMNLP)
Publication Type:Conference Paper Publication Status:Published
This paper describes a method for linear text segmentation that is more accurate or at least as accurate as state-of-the-art methods (Utiyama and Isahara, 1002; Choi, 2000a). Inter-sentence similarity is estimated by latent semantic analysis (LSA). Boundary locations are discovered by divisive clustering. Test results show LSA is a more accurate similarity measure than the cosine metric (van Rijsbergen, 1979).
