Short Course: Statistical Methods in NLP
Short Course: Statistical Methods in NLP
July 8-10, 1998
Philip Resnik
Prerequisites
Natural Language Processing: Problems and Approaches in Building
Large-Scale Systems (taught by Profs. Dorr and Weinberg), or
equivalent introduction including:
- Finite state grammars, automata, and transducers
- Context-free grammars and chart parsers (CKY, Earley)
- Computational complexity (big-O notation)
Main Sources for Readings:
Day 1
- Practical issues
- Probability and probability estimation
- Probability, independence, conditional probability (KS 1.1-1.2.4)
- Bayes Theorem (KS 1.2.5)
- Maximum likelihood probability estimates
- Smoothing methods: add-k, Good-Turing, interpolated (JM 6.2, 6.3)
- Measures of lexical association
- Mutual information (Church and Hanks, ACL-1989)
- Hypothesis testing
- Chi-squared (Georgetown
chi-square tutorial)
- Likelihood ratio (Dunning, CL 19(1):61-74, 1993)
Lab for Day 1:Bigram Counts and
Association Statistics.
Day 2
- Language models
- Brief intro to information-theoretic concepts
Lab for Day 2:Exercise Using
an HMM. (You can either download the
code now or follow the instructions given in the Lab.)
Day 3
- Performance Evaluation (KS Ch. 3)
- Lower and upper bounds
- Agreement and chance-corrected agreement
(Carletta (1996), Assessing
agreement on classification tasks: the kappa statistic, CL
22(2):249-254.)
- Evaluation of part-of-speech tagging
- Evaluation of sense disambiguation
(Resnik and Yarowsky (1997),
A
perspective on word sense disambiguation methods and their
evaluation, Proc. ACL
SIGLEX Workshop at ANLP 1997.)
- Evaluation of parsers
- Evaluation of IR systems
- Evaluation of MT systems
- Decision-based evaluation (Resnik (1997),
Evaluating Multilingual Gisting of Web Pages,
Technical Report UMIACS-TR-97-39, presented at
AAAI Symposium on
Natural Language Processing for the World Wide Web, Stanford,
CA, March 1997.)
- Class-based probabilistic methods (Resnik,
in revision [to appear in JAIR])
- WordNet and class-based probabilities
- Measures of semantic similarity
- Methods for word sense disambiguation
Last modified June 23, 1998.