Courses in Summer term 2006/7 / Seminar on Text Mining / readings:
readings
List of readings (ee = link to electronic edition; ask me for the other references):
- -- Introduction --
-
-- Named Entity Recognition.
Speaker: ---
- [ee] Bernardo Magnini, Matteo Negri, Roberto Prevete , Hristo Tanev (2002): A WordNet-based approach to Named Entities recognition Association for Computational Linguistics, pp. 1-7.
Further Reading (optionally): -
Wed. 06.06 Google-PageRank.
Speaker: Dennis Holzberg
- [ee] Sergey Brin, Lawrence Page (1998): The anatomy of a large-scale hypertextual Web search engine WWW7: Proceedings of the seventh international conference on World Wide Web 7, pp. 107--117.
Further Reading (optionally): -
Wed. 13.06 Word Sense Disambiguation.
Speaker: Carsten Witzke
- [ee] Ted Pedersen, Satanjeev Banerjee, Siddharth Patwardhan (2005): Maximizing Semantic Relatedness to Perform Word Sense Disambiguation Research Report UMSI 2005/25, University of Minnesota Supercomputing Institute.
Further Reading (optionally): -
Wed. 20.06 Text Summarization.
Speaker: Dominik Lubian
- [ee] Regina Barzilay, Kathleen R. McKeown (2005): Sentence Fusion for Multidocument News Summarization Comput. Linguist., pp. 297--328.
Further Reading (optionally): -
Wed. 27.06 Sentiment Analysis.
Speaker: Benedikt Nienhaus
- [ee] Bo Pang, Lillian Lee (2004): A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts ACL '04: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, pp. 271-279.
Further Reading (optionally): -
Wed. 27.06 Overview of Text Mining
Speaker: Ben Thomas
- [ee] Andreas Hotho, Andreas Nürnberger, Gerhard Paaß (2005): A Brief Survey of Text Mining LDV Forum - GLDV Journal for Computational Linguistics and Language Technology, pp. 19-62.
Further Reading (optionally): -
--- Text analytics.
Speaker: ---
- [ee] Mark Dredze, Tessa Lau , Nicholas Kushmerick (2006): Automatically classifying emails into activities IUI '06: Proceedings of the 11th international conference on Intelligent user interfaces, pp. 70--77.
Further Reading (optionally): -
--- Text Clustering.
Speaker: ---
- [ee] Florian Beil, Martin Ester, Xiaowei Xu (2002): Frequent term-based text clustering KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 436--442.
Further Reading (optionally): -
--- Information Extraction by Rule Induction.
Speaker: ---
- [ee] Fabio Ciravegna (2001): Adaptive Information Extraction from Text by Rule Induction and Generalisation Proceedings of the 17th IJCAI, Seattle.
Further Reading (optionally): -
--- Technology.
Speaker: ---
- [ee] Hamish Cunningham, Diana Maynard, Kalina Bontcheva, Valentin Tablan (2001): GATE: an architecture for development of robust HLT applications ACL '02: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 168--175.
Further Reading (optionally):