NLP Reading Group

The reading group attempts to keep abreast of current trends in natural language processing research. We typically read one or two recent NLP conference papers each week, and occasionally look at material from the machine learning, statistics, and linguistics communities as well.

Starting in 2008, we will be posting the weekly readings here. Past readings since 2001 are being filled in presently.

Spring 2008

First meeting of the term will be on Thursday, Jan. 31, at noon in NEB 317. Feel free to bring lunch.

Fall 2007

Topics:

Domain adaptation
Recent parsing work
Text compression
Semisupervised learning

Date/Time	Presenter	Paper(s)
Sep.26	Omar F Zaidan	J. Blitzer, R. McDonald, F. Pereira Domain Adaptation with Structural Correspondence Learning EMNLP 2006
Oct.3	David Smith	Shai Ben-David, John Blitzer, Koby Crammer, Fernando Pereira. Analysis of Representations for Domain Adaptation.
Oct. 10	Nathaniel W Filardo	Mahoney, Matthew Adaptive Weighing of Context Models for Lossless Data Compression. Florida Institue of Technology, CS Department, Technical report CS-2005-16 EMNLP-CoNLL 2007
Oct. 17	Markus Dreyer	Nakagawa, Tetsuji Multilingual Dependency Parsing Using Global Features EMNLP-CoNLL 2007
Oct. 26	Christo Kirov	Seginer, Yoav Fast Unsupervised Incremental Parsing (syntax induction) Proceedings ACL 2007
Nov. 3	Christo Kirov	I. Titov, J. Henderson Constituent Parsing with Incremental Sigmoid Belief Networks ACL 2007
Nov. 17	David Smith	X. Zhu Semi-Supervised Learning Literature Survey
Dec. 12	Delip Rao	M. Belkin, P. Niyogi Laplacian Eigenmaps for Dimensionality Reduction and Data Representation ACM 2002 Mikhail Belkin, Partha Niyogi, Vikas Sindhwani On Manifold Regularization

Summer 2007

Topics:

Good recent papers (mainly from 2007)

Date/Time	Presenter	Paper(s)	Supporting Papers/Notes
May 10	David Smith	M. Johnson, T. Griffiths, and S. Goldwater Bayesian Inference for PCFGs via Markov Chain Monte Carlo HLT/NAACL 2007
May 17	Markus Dreyer	M. Galley, K. McKeown Lexicalized Markov Grammars for Sentence Compression HLT/NAACL 2007
June 2	Erin Fitzgerald	J. Jiang, C. Zhai A Systematic Exploration of the Feature Space for Relation Extraction HLT/NAACL 2007
June 6	Nikesh Garera	A. Alexandrescu, K. Kirchhoff Data-Driven Graph Construction for Semi-Supervised Graph-Based Learning in NLP HLT/NAACL 2007
June 14	David Smith	X. Zhu, Z. Ghahramani,J. Lafferty Semi-supervised learning using Gaussian fields and harmonic functions. ICML 2003
June 21	Christopher White	K. Murphy, Y. Weiss, M. Jordan Propagation for approximate inference: An empirical study. 15th UAI, pages 467-?75, 1999	... discussing (loopy) belief propagation as background for survey propagation, a topic which has been getting more attention lately for its ability to "solve very large hard combinatorial problems, such as determining the satisfiability of Boolean formulas. Chapter 8 of Chris Bishop's textbook is supposed to be a good treatment of graphical models overall. It is available free here [1]. He covers BP in section 8.4.4 after first presenting factor graphs in 8.4.3. David MacKay's treatment of BP, also in terms of factor graphs, is in chapter 26 of his book [2]. It's worth reading this chapter in full, perhaps first reading chapter 16. ... the update equations are given as (26.11) and (26.12) ... [substantial further discussion by jason was here] Some people may prefer Bishop's style, others MacKay's.
July 6	Christopher White	A. Braunstein, M. Mezard, R. Zecchina. Survey propagation: an algorithm for satisfiability. Random Structures and Algorithms, 2005.	We sent some questions to Zecchina. Lukas Kroc, Ashish Sabharwal and Bart Selman. Survey Propagation Revisited: An Empirical Study. 23rd UAI, 2007.
July 18	David Smith	P. Liang, S. Petrov, M. Jordan, D. Klein The Infinite PCFG Using Hierarchical Dirichlet Processes. Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning,
Aug. 3	Yi Su	M. Galley, K. McKeown Lexicalized Markov Grammars for Sentence Compression. NAACL-HLT 2007
Aug. 11	Nikesh Garera	L. Shen, G. Satta, A. Joshi. Guided learning for bidirectional sequence classification ACL 2007
Aug. 18	Markus Dreyer	D. Talbot, M. Osborne Randomised Language Modelling for Statistical Machine Translation ACL 2007	They use a space-efficient randomized data structure (Bloom Filter) to store very large n-gram models. There is a companion paper that people might want to have a quick look at as well, for comparison: D. Talbot, M. Osborne Smoothed Bloom Filter Language Models: Tera-Scale LMs on the Cheap ACL 2007
Aug. 30	Delip Rao	Gideon S. Mann Simple, Robust, Scalable Semi-supervised Learning via Expectation Regularization Proceedings of the 24 th International Conference on Machine Learning 2007

Spring 2007

Topics:

Morphology (unsupervised learning)
Recent IR/QA papers (with an NLP or multilingual focus)
Integrating search and learning

Date/Time	Presenter	Paper(s)	Supporting Papers/Notes
Apr. 19	John Blatz	A. Prieditis Machine discovery of Effective Admissible Heuristics Machine Learning Journal, 1993
Apr. 12	Markus Dreyer	A. Haghighi, J. DeNero and D. Klein Approximate Factoring for A* Search NAACL-HLT 2007
Mar. 29 & Apr. 5	Zhifei Li	H. Daume III, J. Langford, and D. Marcu Search-based structured prediction. Machine Learning Journal, forthcoming
Mar. 8	David Smith	H. Daume III & D. Marcu Learning as search optimization: approximate large margin methods for structured prediction. ICML 2005
Mar. 1	Wei Chen	M. Kaisser, S. Scheible, and B. Webber Experiments at the University of Edinburgh for the TREC 2006 QA track. TREC-15	They do some fairly deep interpretation of sentences, extracting their predicate-argument structure.
Feb. 22	Eric Harley	K. Kan Lo & W. Lam Using Semantic Relations with World Knowledge for Question Answering TREC-15
Feb. 15	Nikhil Bojja	C. Monson et. al. Unsupervised Induction of Natural Language Morphology Inflection Classes ACL Student Workshop '04
Feb. 8	Delip Rao	P. Schone and D. Jurafsky Knowledge-free induction of morphology using latent semantic analysis CoNLL 2000	However, there was an extension of this work reported in NAACL-2001 that looks at circumfixes and prefix/affix combinations. [3]
Feb. 1	Nikesh Garera	D. Yarowsky and R. Wicentowski Minimally supervised morphological analysis by multimodal alignment ACL 2000	For more details refer to Chapter 4 of Wicentowski's thesis.

Fall 2006

Topics:

Machine learning: Margin methods and structured classification
Linguistics: Syntactic formalisms
Syntax-based MT

Date/Time	Presenter	Paper(s)	Supporting Papers/Notes
Dec. 13	Delip Rao	J. Carbonell et. al. Context-based machine translation AMTA 2006
Dec. 6	Jason Smith	M. Galley et. al. Scalable Inference and Training of Context-Rich Syntactic Translation Models ACL 2006	It may also be helpful to look at: M. Galley et. al. What's in a translation rule? HLT/NAACL 2004
Nov. 29	Balakrishnan V	D. Marcu et. al. SPMT: Statistical Machine Translation with Syntactified Target Language Phrases EMNLP 2006
Nov. 15	Eric Harley	D. Chiang An introduction to synchronous grammars ACL 2006 Tutorial	Slides from the talk are also available. [4]
Nov. 8	Elliott Drabek	K.Shklovsky A Grammatical Sketch of Petalcingo Tzeltal Undergraduate Thesis, Reed College, 2005	It is 77 pages long, but not dense, and I will be skipping the following sections: Pages 01-14 Phonetics and phonology 18-18 Polyvalence 21-21 Inherent possession and ... 46-55 Tense and aspect and other sections
Nov. 1	Yi Su	M. Steedman Gapping as Constituent Coordination Linguistics and Philosophy, Vol. 13, 1990, pp.207-264.	See Yi for photocopies.
Oct. 25	Markus Dreyer	S. Reizler et. al. Parsing the Wall Street Journal using a Lexical-Functional Grammar and Discriminative Estimation Techniques ACL 2002
Oct. 18	Erin Fitzgerald	J. Bresnan & R.M. Kaplan Lexical-Functional Grammar: A Formal System for Grammatical Representation The Mental Representation of Grammatical Relations, MIT Press, 1982	BTW, the edited collection that this appears in is generally interesting. Bresnan defends and develops lexicalized grammars in general; the idea of separate surface and semantic roles; and Bresnan & Kaplan's LFG in particular. You should know that she originated (in 1978) the extremely influential idea of lexicalized syntax -- the idea that a grammar is simply a collection of lexical entries to be assembled in standard language-independent ways, but that there are also "lexical redundancy rules" that relate, e.g., active and passive entries for the same verb. Some chapters address morphological and cognitive issues pertaining to lexicalization, including an essay by Pinker on lexicalist learning. Slides from Erin's presentation can be found here.
Oct. 11	John Blatz	L.Xu, D. Wilkinson, F. Southey, & D. Schuurmans Discriminative Unsupervised Learning of Structured Predictors ICML 2006
Oct. 4	Nikesh Garera	A. Culotta & J. Sorensen Dependency Tree Kernels for Relation Extraction ACL 2004 D. Zelenko, C. Aone, & A. Richardella Kernel Methods for Relation Extraction JMLR, Volume 3, 2003
Sept. 27	David Smith	C. Cortes, P. Haffner, & M. Mohri Rational Kernels NIPS 2003	Papers extending rational kernels, including results on positive semidefinite cases, are at:[5] For the record, and not to be read, is an interesting parallel line of research in Fisher Kernels over strings, e.g. this paper by Saunders, Shawe-Taylor and Vinokourov: [6]
Sept. 20	Elliot Drabek	K.Q. Weinberger, F. Sha, & L.K. Saul Learning a kernel matrix for nonlinear dimensionality reduction ICML 2004	S.T. Roweis & L.K. Saul Nonlinear Dimensionality Reduction by Locally Linear Embedding Science, 22 December 2000 J.B. Tenenbaum, V. De Silva, & J.C. Langford A global geometric framework for nonlinear dimensionality reduction Science, 22 December 2000
Sept. 13	Roy Tromble	L. Xu, J. Neufeld, B. Larson, & D. Schuurmans Maximum Margin Clustering NIPS 2004

Summer 2006

Spring 2006

Fall 2005

Date/Time	Presenter	Paper(s)
Sept. 14	Nikesh Garera	M. Jordan Statistical Learning Theory Chapter 8 (Exponential family and Generalized linear models)
Sept. 21	Arnab Ghoshal	M. Jordan Statistical Learning Theory Chapter 2&3
Oct. 20	Roy Tromble	Sheila M. Reynolds, Jeff A. Bilmes Part-of-Speech Tagging using Virtual Evidence and Negative Training. Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing. 2005. pp 459--466.
Oct. 27	Markus Dreyer	D. Roth and W. Yih Integer Linear Programming Inference for Conditional Random Fields. ICML '2005
Nov. 4	Jason Riesa	Luke S. Zettlemoyer, Michael Collins. Learning to Map Sentences to Logical Form: Structured Classification with Probabilistic Categorial Proceedings of UAI 2005
Nov. 16	Safiullah Shareef	Hassan Sawaf, Jörg Zaplo, Hermann Ney Statistical Classification Methods for Arabic News Articles
Nov. 23	Roy Tromble	Sutton, Charles and McCallum, Andrew Composition of Conditional Random Fields for Transfer Learning Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing 2005

Summer 2005

Date/Time	Presenter	Paper(s)	Supporting Papers/Notes
ddd

Spring 2005

Date/Time	Presenter	Paper(s)	Supporting Papers/Notes
ddd

Fall 2004

Date/Time	Presenter	Paper(s)	Supporting Papers/Notes
ddd

Summer 2004

Topics:

Recent papers from ACL/EMNLP 2004

Date/Time	Presenter	Paper(s)	Supporting Papers/Notes
Aug. 20	Damianos Karakos, Charles Schafer	P. Pantel and D. Lin Discovering word senses from text Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, 2002 Diana McCarthy, Rob Koeling, Julie Weeds, John Carroll [ftp://ftp.informatics.susx.ac.uk/pub/users/dianam/senseranks.pdf Finding Predominant Word Senses in Untagged Text] 2004
Aug. 27	David Smith	I. Dan Melamed Statistical Machine Translation by Parsing ACL 2004 Daniel Gildea [http://acl.ldc.upenn.edu/acl2004/emnlp/pdf/Gildea.pdf Dependencies vs. Constituents for Tree-Based Alignment] ACL 2004
Sep. 2	Gideon Mann	Xin Li, Paul Morie, and Dan Roth [http://acl.ldc.upenn.edu/hlt-naacl2004/main/pdf/139_Paper.pdf Robust Reading: Identification and Tracing of Ambiguous Names] ACL 2004 Cheng Niu, Wei Li, Rohini K. Srihari [http://acl.ldc.upenn.edu/acl2004/main/pdf/372_pdf_2-col.pdf Weakly Supervised Learning for Cross-Document Person-Name Disambiguation Supported by Information Extraction] ACL 2004
Sep. 9	John Blatz	Pascale Fung and Percy Cheung [http://acl.ldc.upenn.edu/acl2004/emnlp/pdf/Fung.pdf Mining Very-Non-Parallel Corpora: Parallel Sentence and Lexicon Extraction via Bootstrapping and EM] ACL 2004 Dragos Stefan Munteanu, Alexander Fraser and Daniel Marcu [http://acl.ldc.upenn.edu/hlt-naacl2004/main/pdf/93_Paper.pdf Improved Machine Translation Performance via Parallel Sentence Extraction from Comparable Corpora] ACL 2004
Sep. 16	Roy Tromble	B. Taskar, C. Guestrin and D. Koller Max-Margin Markov Networks Neural Information Processing Systems Conference (NIPS03), 2003 B. Taskar, D. Klein, M. Collins, D. Koller and C. Manning Max-Margin Parsing EMNLP 2004	Background knowledge on SVM and Graphical Models [www.cse.msu.edu/~lawhiu/intro_SVM.ppt Intro SVM] [www.ai.mit.edu/~murphyk/Bayes/bnintro.html Intro Graphical Models]

Spring 2004

Topics:

combinatorial optimization (software)
optimality theory
information extraction

Date/Time	Presenter	Paper(s)	Supporting Papers/Notes
Feb. 5	Brock	Jessica A. Barlow and Judith A. Gierut Optimality theory in phonological acquisition Journal of Speech, Language and Hearing 42, 1999 ---- Paul Boersma, Joost Dekkers and Jeroen van de Weijer Introduction. In Optimality Theory: Phonology, Syntax and Acquisition Oxford University Press 2000
Feb. 12	Brock	Bob Frank, Giorgio Satta Optimality theory and the Generative Complexity of Constraint Violability MIT Press	A glimpse (from MIT Press): It has been argued that rule-based phonological descriptions can uniformly be expressed as mappings carried out by finite-state transducers, and therefore fall within the class of rational relations. If this property of generative capacity is an empirically correct characterization of phonological mappings, it should hold of any sufficiently restrictive theory of phonology, whether it utilizes constraints or rewrite rules. In this paper, we investigate the conditions under which the phonological descriptions that are possible within the view of constraint interaction embodied in Optimality Theory (Prince and Smolensky 1993) remain within the class of rational relations. We show that this is true when GEN is itself a rational relation, and each of the constraints distinguishes among finitely many regular sets of candidates.
Feb. 19	David Smith	Barzilay and Lee Learning to Paraphrase: An Unsupervise Approach Using Multiple-Sequen7:12 PM 2/4/2008ce Alignment HTL 2003
Mar. 5	Charles Schafer	Daniel Marcu Theory and Practice of Discourse Parsing and Summarization, Chapters 2 & 3 The MIT Press, 2000
Mar. 18	Markus Dreyer	Eugene Charniak, Niyu Ge, John Hale A Statistical Approach to Anaphora Resolution Proceedings of the Sixth Workshop on Very Large Corpora, 1998
Mar. 25	Eric Goldlust	Boyan and Moore Learning Evaluation Functions to Improve Optimization by Local Search Journal of Machine Learning Research, 2000
Apr. 3	Roy Tromble	Roman Bartak Constraint Programming: In Pursuit of the Holy Grail 1999
Apr. 10	Noah Ashton Smith	Denys Duchier Axiomatizing Dependency Parsing Using Set Constraints Sixth Meeting on Mathematics of Language, 2000
Apr. 10	Noah Ashton Smith	Denys Duchier Axiomatizing Dependency Parsing Using Set Constraints Sixth Meeting on Mathematics of Language, 2000
Apr. 17	Elliott Franco Drabek	Rina Dechter Mini-Buckets: A General Scheme for Generating Approximations in Automated Reasoning 2001
Apr. 24	David Smith	McCallum and Jensen Extraction and Data Mining using Conditional-Probability, Relational Models IJCAI'03 Workshop on Learning Statistical Models from Relational Data, 2003	The paper is a survey of recent trends in IE and data mining (biased of course towards the authors' work) and a proposal to unify them with conditional random fields.
May. 1	Izhak Shafran	Eric J. Friedman Strong Monotonicity in Surplus Sharing 1999	Used Tom Dietterich has a web page on probabilistic relational models: [7]
May. 15	Roy Tromble	Fuchun Peng, Andrew McCallum Accurate Information Extraction from Research Papers using Conditional Random Fields 2004

Fall 2003

Date/Time	Presenter	Paper(s)	Supporting Papers/Notes
Sep.11	Elliott Franco Drabek	Bernard Comrie Language Universals Linguistic Typology: Syntax and Morphology Language Universals Linguistic Typology: Syntax and Morphology, Chapter 1 Blackwell Pub (1989)
Sep.18	David Smith	Bernard Comrie Language Universals Linguistic Typology: Syntax and Morphology Language Universals Linguistic Typology: Syntax and Morphology, Chapter 2-3 Blackwell Pub (1989)
Oct. 3	Michelle Vanni	Bernard Comrie Language Universals Linguistic Typology: Syntax and Morphology Language Universals Linguistic Typology: Syntax and Morphology, Chapter 4-6 Blackwell Pub (1989)
Oct. 10	David Smith	Bernard Comrie Language Universals Linguistic Typology: Syntax and Morphology Language Universals Linguistic Typology: Syntax and Morphology, Chapter 6-7 Blackwell Pub (1989)
Oct. 24	Markus Dreyer	Stuart M. Shieber, Yves Schabes Synchronous Tree-Adjoining Grammars Coling 1990	An additional closely related paper Stuart M. Shieber, Yves Schabes Generation and Synchronous Tree-Adjoining Grammars Fifth International Workshop on Natural Language Generation.
Oct. 31	Roy Tromble	Dekai Wu An algorithm for simultaneously bracketing parallel texts by aligning words ACL 1995
Nov. 6	Brock Pytlik	Stuart M. Shieber Transducers as a Substrate for Natural Language Processing
Nov. 13	Markus Dreyer	Goldman and Zhou Enhancing Supervised Learning with Unlabeled Data 27th Int. Conf. on Mach. Learn. 2000	An additional paper with some experiments Clark, Curran and Osborne Bootstrapping POS taggers using Unlabelled Data CoNLL 2003
Nov. 20	Noah A. Smith	Rebecca Hwa, Miles Osborne, Anoop Sarkar, Mark Steedman Corrected Co-training for Statistical Parsers ICML 2003
Dec. 12	Paola Virga	Kamal Nigam and Rayid Ghani Analyzing the Effectiveness and Applicability of Co-training Ninth International Conference on Information and Knowledge Management 2000

Spring 2003

Date/Time	Presenter	Paper(s)
Feb. 13	David Smith	K. Church Empirical Estimates of Adaptation: The chance of Two Noriega's is closer to p/2 than p^2 Coling 2000, pp. 173-179
Feb. 19	Elliott Drabek	A. Lopez􀀀, M. Nossal􀀀, R. Hwa, P. Resnik Word-level Alignment for Multilingual Resource Acquisition Proceedings of the 2002 LREC Workshop on Linguistic Knowledge Acquisition and Representation: Bootstrapping Annotated Language Data
Feb. 26	Elliott Drabek	Steven Abney Bootstrapping ACL'02
Mar.6	Paola Virga	Carl M. Kadie, Christopher Meek, David Heckerman A Collaborative Filtering System Using Posteriors Over Weights of Evidence Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence, 2002.
Mar.20	Roy Tromble	Nikita Schmid, Ahmed Patel [ttp://arXiv.org/abs/cs/0201008 Using Tree Automata and Regular Expressions to Manipulate Hierarchically Structured Data]
Apr.10		V. N. Vapnik The Nature of Statistical Learning Theory, Intro and Chapters 1, 2A
Apr.17	Roy Tromble	V. N. Vapnik The Nature of Statistical Learning Theory,Chapters 2B - 4A
Apr. 24	Paola	V. N. Vapnik The Nature of Statistical Learning Theory, Chapters 4B - 5A
May 1	Noah	V. N. Vapnik The Nature of Statistical Learning Theory, Chapters 5B - 6A
May 8	Noah	V. N. Vapnik The Nature of Statistical Learning Theory, Chapters 6B - 7A
May 15	Chal	V. N. Vapnik The Nature of Statistical Learning Theory, Chapters 7B -

Fall 2002

Date/Time	Presenter	Paper(s)	Supporting Papers/Notes
Sep. 10	Noah A. Smith	Collins, Duffy. New Ranking Algorithms for Parsing and Tagging: Kernels over Discrete Structures, and the Voted Perceptron. ACL '2002
Sep. 19	Paola Virga	Yamada, Knight A decoder for Syntax-based Statistical MT ACL '2002
Sep. 26	Paul Ruhlen	Hwa, Resnik, Weinberg, Kolak Evaluating Translational Correspondence using Annotation Projection ACL '2002
Oct. 2	Gideon Mann	Gildea, Jurafsky Automatic Labeling of Semantics Roles ACL '2001
Oct. 8	Elliott Franco Drabek	Ravichandran, Hovy Learning Surface Text Patterns for a Question Answering System. ACL '2001	A similar paper Lin, Pantel Discovery of Inference Rules for Question Answwering
Oct. 17	David Smith	Cotton, Bird An Integrated Framework for Treebanks and Multilayer Annotations LREC '2002
Oct. 24	Roy Tromble	Han, Benjamin Building a Bilingual Dictionary with Scarce Resources: A Genetic Algorithm Approach.
Nov. 1	Chalaporn Hathaidharm	J.Gao, J.Goodman, M.Li, K.Lee Toward A Unified Approach To Statistical Language Modeling For Chinese ACM Transactions on Asian Language Information Processing, Vol. 1, No. 1, pp 3-33. 2002.
Nov. 7	Neda Khalili	Yamamoto, Church Using Suffix Arrays to Compute Term Frequency and Document Frequency for All Substrings in a Corpus Computational Linguistics '2001	A relative paper: Kageura Bigram Statistics Revisited A Comparative Examination of Some Statistical Measures in Morphological Analysis of Japanese Kanji Sequences
Nov. 14	Michelle Vanni	Hearst Untangling Text Data Mining. ACL '1999
Nov. 21	Silviu Cucerzan	Ueda, Nakano, Ghahramani, Hinton SMEM Algorithm for Mixture Models Neural Information Processing Systems '1998
Dec.5	Silviu Cucerzan	Pearce A Comparative Evaluation of Collocation Extraction Techniques. Darren Pearce. Third International Conference on Language Resources and Evaluation. May. 2002 D. Lin Automatic identification of non-compositional phrases. In Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, 317--324.

Summer 2002

Date/Time

Presenter

Paper(s)

Supporting Papers/Notes

July. 24

Michelle Vanni

Merlo

A Multilingual Paradigm for Automatic Verb Classification

ACL '2002

July. 31

Paola Virga

Yamada, Knight

A decoder for Syntax-based Statistical MT

ACL '2002

Spring 2002

Date/Time	Presenter	Paper(s)	Supporting Papers/Notes
Feb. 7	Paola Virga	Knight, Graehl Machine Transliteration Proceedings of the Thirty-Fifth Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Feb. 14	Charles Schafer	Yaser, Germann Translating with Scarce Resources American Association for Arti�cial Intelligence 2000
Feb. 21	Jia Cui	Barzilay, McKeown Extracting Paraphrases from a Parallel Corpus Computer Science Department Columbia.Univ.
Feb. 28	Silviu Cucerzan	Marcu Towards a Unified Approach to Memory- and Statistical-Based Machine Translation. Annual Meeting of the ACL, Proceedings of the 39th Annual Meeting on Association for Computational Linguistics '2001
Mar. 14	Noah A. Smith	Ratnaparkhi A Simple Introduction to Maximum Entropy Models for NLP Institute for Research in Cognitive Science, Univ. of Penn.
Mar. 28	Swapna Somasundaran	Crestan, El-Beze Improving supervised WSD by including rough semantic features in a Multilevel view of the Context SEMPRO Workshop, Edinburgh, 2001.
Apr. 11	Paola Virga	Neal, Hinton A view of the EM algorithm that justifies incremental, sparse, and other variants Learning in Graphical Models, 1999
Apr. 18	Paul Ruhlen	NA. Rao, K. Rose Deterministically annealed design of hidden Markov model speech recognizers IEEE Trans. on Speech and Audio Processing, vol. 9, (no. 2), Feb. 2001	following article builds on the Neal & Hinton paper that we read last week. It tests an incremental version of EM (carefully choosing how incremental it will be), as well as a "lazy EM" version that visits "significant" cases more often. [8]
Apr. 25	Paul Ruhlen	H. Al-Adhaileh, Kong, Melamed Malay-English Bitext Mapping and Alignment Using SIMR/GSA Algorithms Malaysian National Conference on Research and Development on Lingustics '2001

Fall 2001

Date/Time	Presenter	Paper(s)	Supporting Papers/Notes
Dec. 14	Jia Cui	Bellegarda Exploiting latent semantic information in statistical language models Proceedings of the IEEE , Volume: 88 Issue: 8 , Aug. 2000
Nov. 29	Silviu Cucerzan	Mike Collins, Yoram Singer Unsupervised Models for Named Entity Classification EMNLP/VLC'99
Nov. 20	Radu Florian	Blum, Mitchell Combining Labeled and Unlabeled Data with Co-Training Proceedings of 1998 Conference on Computational Learning Theory
Nov. 16	Richard Wicentowski	Eisner, Satta Efficient parsing for bilexical context-free grammars and head automaton grammars ACL '99	plagiarism detection systems might be relevant to bitext alignment. A message to the Corpora list yesterday announced the following review paper:[9]
Nov. 2	Paul Ruhlen	Manning, Schuetze Foundations of Statistical Natural Language Processing, Section 14 on clustering, pp. 495-527. MIT Press
Oct. 26	Gideon Mann	Tishby, Pereira, Bialek The information bottleneck method	The paper describes a clustering method which is a generalization of their earlier work on "Distributional Clustering of English Words" (pereira,tishby and lee '93).

NLP Reading Group

Contents

Spring 2008

Fall 2007

Summer 2007

Spring 2007

Fall 2006

Summer 2006

Spring 2006

Fall 2005

Summer 2005

Spring 2005

Fall 2004

Summer 2004

Spring 2004

Fall 2003

Spring 2003

Fall 2002

Summer 2002

Spring 2002

Fall 2001

Navigation menu

NLP Reading Group

Spring 2008

Fall 2007

Summer 2007

Spring 2007

Fall 2006

Summer 2006

Spring 2006

Fall 2005

Summer 2005

Spring 2005

Fall 2004

Summer 2004

Spring 2004

Fall 2003

Spring 2003

Fall 2002

Summer 2002

Spring 2002

Fall 2001

Navigation menu

Search