wir bieten...
Dekobild im Seitenkopf ISMLL
 
Courses in summer term 2006 / Seminar on Text Mining and Ontology Learning / readings:

List of readings (ee = link to electronic edition; ask me for the other references):

 Tue. 25.4.(0)-- Introduction --
I. Text Classification
Tue. 16.05(1)A Survey of Text Classification Methods, especially Support Vector Machines
Speaker: Ling Chen
  • ee Dumais S., Platt J., Heckerman D. (1998): Inductive Learning algorithms and representations for text categorization. CIKM 1998.
  • ee Joachims T. (2001): A Statistical Learning Model of Text Classification for Support Vector. SIGIR 2001.

Further Reading (optionally):
  • ee Hotho Andreas, Nürnberger Andreas, Paaß Gerhard (2005): A brief Survey of Text Mining. LDV Forum - GLDV Journal for Computational Linguistics and Language Technology.
Tue. 23.05(2)Text Classification considering Background Knowledge
Speaker: Mohamad Rabbath
  • ee Bloehdorn S., Hotho A. (2004): Text classification by boosting weak learners based on terms and concepts. ICDM 20004.
  • ee Ifrim G., Theobald M., Weikum G. (2005): Learning word-to-concept mappings for automatic text classification. ICML Workshop on Learning in Web Search 2005.

Further Reading (optionally):
  • ee Bloehdorn S., Hotho A. (2005): Boosting for text classification with semantic features. SIGKDD Workshop on mining for and from semantic web 2005.
  • ee Cai L., Hofmann T. (2003): Text categorization by boosting automatically extracted concepts. SIGIR 2003.
Tue. 30.05No Seminar
Tue. 06.06Pentecost Holydays
Tue. 13.06(3)Automatic Classification based on semantic hierarchies
Speaker: Tobias Lang
  • ee Peng and Choi (2005): Document classifications based on word semantic hierarchies. (IASTED on AI, 2005).

II. Some Basic Problems
Tue. 20.06(4) Named Entity Recognition
Speaker: Nick Sutterer
  • ee Jason D. M. Rennie, Tommi Jaakkola (2005): Using term informativeness for named entity detection. SIGIR 2005.
  • ee GD Zhou, J Su (2002): Named Entity Recognition using an HMM-based Chunk Tagger. ACL 2002
Tue. 27.06(5) Word Sense Disambiguation
Speaker: Markus Wößner
  • ee Gliozzo A. and Giuliano C. and Strapparava C. (2005): Domain Kernels for Word Sense Disambiguation. ACL 2005.
  • ee Upali Sathyajith Kohomban and Wee Sun Lee (2005): Learning Semantic Classes for Word Sense Disambiguation. ACL 2005.

Further Reading (optionally):
  • ee Zheng-Yu Niu, Dong-Hong Ji, and Chew-Lim Tan (2005): Word Sense Disambiguation Using Label Propagation Based Semi-Supervised Learning. ACL 2005.
04.07(6) Corefence Resolution
Speaker: Johannes Wendeberg
  • ee Ng, V., Gardent C. (2002): Improving Machine Learning Approaches to Coreference Resolution - group of 19 ACL, 2002
  • ee Yang X., Zhou G., Su J., Tan C. (2003): Coreference Resolution Using Competition Learning Approach. ACL, 2003.

Further Reading (optionally):
  • ee Popescu-Bellis A. (2003): Evaluation-driven design of a robust coreference resolution system Natural Language Engineering, 2003 - CambridgeUnivPress
  • ee Ng V. (2005): Machine Learning for Coreference Resolution: From local classification to global ranking. ACL 2005.
III. Learning Concept Taxonomies
Tue. 11.07 (7) Ontology Semantic Similarity
Speaker: Gerald Lippert
  • ee Ganesan P., Garcia-Molina H., Widom J. (2003): Exploiting Hierarchical Domain Structure to Compute Similarity. ACM Standford University Technical report.
  • ee Maedche A., Staab S. (2002): Measuring similarity between ontologies. EKAW2002.
-- (8) Evaluation of information extraction tasks and ontologies
  • ee Neil Ireson, Fabio Ciravegna, Marie Elaine Califf, Dayne Freitag, Nicholas Kushmerick, Alberto Lavelli: Evaluating Machine Learning for Information Extraction. ICML 2005
  • ee Porzel R., Malaka R. (2004): A task-based approach for ontology evaluation. ECAI 2004.

Further Reading (optionally):
  • ee Brewster C., Alani H., Dasmahapatra S., Wilks Y.: Data driven OntologyEvaluation. LREC 2004.
  • ee Alberto Lavelli, Mary Elaine Califf, Fabio Ciravegna, Dayne Freitag, Claudio Giuliano, Nicholas Kushmerick, Lorenza Romano (2004): A Critical Survey of the Methodology for IE Evaluation. Proceedings of the 4th International Conference on Language Resources and Evaluation 2004
Tue.18.07 (9) Learning Concept Taxonomies
Speaker: André Borgeat
  • ee Maedche A., Staab S. (2005): Ontology Learning. In Handbook of ontologies in Information Systems 2005.
  • ee Cimiano P., Hotho A., Staab S. (2004): Comparing conceptual, divisive and agglomertive clustering for learning taxonomies from text. ECAI04.
IV. Learning General Relations
Tue.25.07 (10) Learning Relations using Association Rules
Speaker: Tomoko Kitamura
  • ee Maedche A., Staab S. (2000): Discovering conceptual relations from text. ECAI 2000
-- (11) Learning Relations using Kernel Methods
  • ee Shubin Zhao and Ralph Grishman (2005): Extracting Relations with Integrated Information Using Kernel Methods. ACL 2005.
  • ee Ryan McDonald, Fernando Pereira, Seth Kulick, Scott Winters, Yang Jin and Pete White (2005): Simple Algorithms for Complex Relation Extraction with Applications to Biomedical IE. ACL 2005.

Further Reading (optionally):
  • ee Bunescu, R. and Mooney, R. (2005): A Shortest Path Dependency Kernel for Relation Extraction. HLTC 2005.
-- (12) Adaptive Information Extraction (LP2 algorithm)
  • ee Ciravegna F.(2001): Adaptive Information Extraction from Text by Rule Induction and Generalisation, (IJCAI 2001).
  • ee Ciravegna F. (2003): (LP)2, Rule Induction for Information Extraction using Linguistic Constraints, Technical Report , University of Sheffield, September 2003.
V. Applications
-- (13) Text Summarization
  • ee Nomoto, Tadashi (2005):Bayesian Learning in Text Summarization. HLTC and EMNL 2005.