NOTE: this is not a complete bibliography. In particular, descriptions of stemmers for languages other than English are mostly not included. 数据挖掘研究院
Adamson, G.W. & Boreham, J., 1974: "The use of an association measure based on character structure to identify semantically related pairs of words and document titles," Information Processing & Management 10(7/8), 253-260.
Church, K.W., 1995: "One term or two?" in E.A. Fox, P. Ingwersen & R. Fidel (eds.), Proceedings of the 18th ACM SIGIR conference held at Seattle, WA, July 9-13, 1995; pp.310-318.
Dawson, J.L., 1974: "Suffix removal for word conflation," Bulletin of the Association for Literary & Linguistic Computing, 2(3), 33-46. 数据挖掘研究院
Frakes, W.B. & Baeza-Yates, R., 1992: Information Retrieval: Data Structures & Algorithms. Englewood Cliffs, NJ: Prentice-Hall. Chapter 8.
Hafer, M.A. & Weiss, S.F., 1974: "Word segmentation by letter successor varieties", Information Processing & Management 10(11/12), 371-386.
Harman, D., 1991: "How effective is suffixing?" Journal of the American Society for Information Science 42 (1), 7-15.
Hull, D.A., 1996: "Stemming algorithms: a case study for detailed evaluation", Journal of the American Society for Information Science, 47(1), 70-84. 数据挖掘实验室
Kraaij, W. & Pohlmann, R., 1996: "Viewing stemming as recall enhancement," in H-P. Frei, D. Harman, P. Schauble & R. Wilkinson (eds.), Proceedings of the 17th ACM SIGIR conference held at Zurich, August 18-22, 1996; pp.40-48.
Krovetz, R., 1993: "Viewing morphology as an inference process", in R. Korfhage, E. Rasmussen & P. Willett (eds.), Proceedings of the 16th ACM SIGIR conference held at Pittsburgh, PA, June 27-July 1, 1993; pp.191-202. 数据挖掘研究院
Lennon, M., Pierce, D.S., Tarry, B.D. and Willett, P., 1981: "An evaluation of some conflation algorithms for information retrieval". Journal of Information Science 3, 177-183. 数据挖掘研究院
Lovins, J.B., 1968: "Development of a stemming algorithm". Mechanical Translation and Computational Linguistics 11, 22-31.
Lovins, J.B., 1971: "Error evaluation for stemming algorithms as clustering algorithms," Journal of the American Society for Information Science, 22(1), 28-40.
Paice, C.D., 1990: "Another stemmer", SIGIR Forum, 24(3), 56-61 (Fall 1990). 数据挖掘研究院
Paice, C.D., 1994: "An evaluation method for stemming algorithms", in Croft, W.B. & van Rijsbergen, C.J. (eds.), Proceedings of the 17th ACM SIGIR conference held at Dublin, July 3-6, 1994; pp. 42-50.
Paice, C.D., 1996: "A method for the evaluation of stemming algorithms based on error counting," Journal of the American Society for Information Science, 47(8), 632-649. 数据挖掘研究院
Popovic, M. and Willett, P., 1992: "The effectiveness of stemmng for natural language access to Slovene textual data," Journal of the American Society for Information Science, 43(5), 384-390. 数据挖掘研究院
Porter, M.F., 1980: "An algorithm for suffix stripping". Program 14,130-137.
Savoy, J., 1993: "Stemming of French words based on grammatical categories" Journal of the American Society for Information Science, 44(1), 1-9. 数据挖掘研究院
Ulmschneider, J.E. & Doszkocs, 1983: "A practical stemming algorithm for online search assistance," Online Review, 7(4), 301-318.
Xu, J. & Croft, W.B., 1998: "Corpus-based stemming using coocurrence of word variants," ACM Transactions on Information Systems, 16(1), 61-81.
数据挖掘研究院

