Difference between revisions of "NLP Reading Group"

From CLSP Wiki
Jump to navigation Jump to search
Line 23: Line 23:
  
 
;Sep.26 (Omar F Zaidan)  
 
;Sep.26 (Omar F Zaidan)  
 
 
: J. Blitzer, R. McDonald, F. Pereira ,
 
: J. Blitzer, R. McDonald, F. Pereira ,
  [http://www.cis.upenn.edu/~blitzer/papers/emnlp06.pdf Domain Adaptation with Structural Correspondence Learning] ,EMNLP 2006
+
  [http://www.cis.upenn.edu/~blitzer/papers/emnlp06.pdf Domain Adaptation with Structural Correspondence Learning] ,  
 +
EMNLP 2006
  
 
;Oct.3 (David Smith)  
 
;Oct.3 (David Smith)  
 
 
: Shai Ben-David, John Blitzer, Koby Crammer, Fernando Pereira. ,
 
: Shai Ben-David, John Blitzer, Koby Crammer, Fernando Pereira. ,
  [http://www.cis.upenn.edu/~blitzer/papers/nips06.pdf Analysis of Representations for Domain Adaptation.] ,|-
+
  [http://www.cis.upenn.edu/~blitzer/papers/nips06.pdf Analysis of Representations for Domain Adaptation.] ,  
|Oct. 10
+
;Oct. 10 (Nathaniel W Filardo)
|Nathaniel W Filardo
 
 
 
 
: Mahoney, Matthew ,
 
: Mahoney, Matthew ,
  [http://www.cs.fit.edu/~mmahoney/compression/cs200516.pdf  Adaptive Weighing of Context Models for Lossless Data Compression.] ,Florida Institue of Technology, CS Department, Technical report CS-2005-16
+
  [http://www.cs.fit.edu/~mmahoney/compression/cs200516.pdf  Adaptive Weighing of Context Models for Lossless Data Compression.] ,  
 +
Florida Institue of Technology, CS Department, Technical report CS-2005-16
  
 
EMNLP-CoNLL 2007
 
EMNLP-CoNLL 2007
  
 
;Oct. 17 (Markus Dreyer)  
 
;Oct. 17 (Markus Dreyer)  
 
 
: Nakagawa, Tetsuji ,
 
: Nakagawa, Tetsuji ,
  [http://www.aclweb.org/anthology/D/D07/D07-1100  Multilingual Dependency Parsing Using Global Features] ,EMNLP-CoNLL 2007
+
  [http://www.aclweb.org/anthology/D/D07/D07-1100  Multilingual Dependency Parsing Using Global Features] ,  
 +
EMNLP-CoNLL 2007
  
 
;Oct. 26 (Christo Kirov)  
 
;Oct. 26 (Christo Kirov)  
 
 
: Seginer, Yoav ,
 
: Seginer, Yoav ,
  [http://acl.ldc.upenn.edu/P/P07/P07-1049.pdf  Fast Unsupervised Incremental Parsing (syntax induction)] ,Proceedings ACL 2007
+
  [http://acl.ldc.upenn.edu/P/P07/P07-1049.pdf  Fast Unsupervised Incremental Parsing (syntax induction)] ,  
 +
Proceedings ACL 2007
  
  
 
;Nov. 3 (Christo Kirov)  
 
;Nov. 3 (Christo Kirov)  
 
 
: I. Titov, J. Henderson ,
 
: I. Titov, J. Henderson ,
  [http://www.aclweb.org/anthology-new/P/P07/P07-1080.pdf  Constituent Parsing with Incremental Sigmoid Belief Networks] ,ACL 2007
+
  [http://www.aclweb.org/anthology-new/P/P07/P07-1080.pdf  Constituent Parsing with Incremental Sigmoid Belief Networks] ,  
 +
ACL 2007
  
 
;Nov. 17 (David Smith)  
 
;Nov. 17 (David Smith)  
 
 
: X. Zhu ,
 
: X. Zhu ,
  [http://pages.cs.wisc.edu/~jerryzhu/pub/ssl_survey.pdf  Semi-Supervised Learning Literature Survey] ,|-
+
  [http://pages.cs.wisc.edu/~jerryzhu/pub/ssl_survey.pdf  Semi-Supervised Learning Literature Survey] ,  
|Dec. 12
+
;Dec. 12 (Delip Rao)
|Delip Rao
 
 
 
 
: M. Belkin, P. Niyogi ,
 
: M. Belkin, P. Niyogi ,
  [http://citeseer.ist.psu.edu/632472.html  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation] ,ACM 2002
+
  [http://citeseer.ist.psu.edu/632472.html  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation] ,  
 +
ACM 2002
  
 
----
 
----
Line 70: Line 66:
  
 
[http://people.cs.uchicago.edu/~vikass/aistats.pdf On Manifold Regularization]
 
[http://people.cs.uchicago.edu/~vikass/aistats.pdf On Manifold Regularization]
 
  
 
: } ,
 
: } ,
  ==  Summer 2007 == ,Topics:
+
  ==  Summer 2007 == ,  
 +
Topics:
 
* Good recent papers (mainly from 2007)
 
* Good recent papers (mainly from 2007)
  
Line 83: Line 79:
 
!  Supporting Papers/Notes
 
!  Supporting Papers/Notes
 
;May 10 (David Smith )  
 
;May 10 (David Smith )  
 
 
: M. Johnson, T. Griffiths, and S. Goldwater ,
 
: M. Johnson, T. Griffiths, and S. Goldwater ,
  [http://acl.ldc.upenn.edu/N/N07/N07-1018.pdf Bayesian Inference for PCFGs via Markov Chain Monte Carlo] ,HLT/NAACL 2007
+
  [http://acl.ldc.upenn.edu/N/N07/N07-1018.pdf Bayesian Inference for PCFGs via Markov Chain Monte Carlo] ,  
 +
HLT/NAACL 2007
  
 
;May 17 (Markus Dreyer)  
 
;May 17 (Markus Dreyer)  
 
 
: M. Galley, K. McKeown ,
 
: M. Galley, K. McKeown ,
  [http://acl.ldc.upenn.edu/N/N07/N07-1023.pdf Lexicalized Markov Grammars for Sentence Compression] ,HLT/NAACL 2007
+
  [http://acl.ldc.upenn.edu/N/N07/N07-1023.pdf Lexicalized Markov Grammars for Sentence Compression] ,  
 +
HLT/NAACL 2007
  
  
 
;June 2 (Erin Fitzgerald)  
 
;June 2 (Erin Fitzgerald)  
 
 
: J. Jiang, C. Zhai ,
 
: J. Jiang, C. Zhai ,
  [http://acl.ldc.upenn.edu/N/N07/N07-1015.pdf A Systematic Exploration of the Feature Space for Relation Extraction] ,HLT/NAACL 2007
+
  [http://acl.ldc.upenn.edu/N/N07/N07-1015.pdf A Systematic Exploration of the Feature Space for Relation Extraction] ,  
 +
HLT/NAACL 2007
  
 
;June 6 (Nikesh Garera)  
 
;June 6 (Nikesh Garera)  
 
 
: A. Alexandrescu, K. Kirchhoff ,
 
: A. Alexandrescu, K. Kirchhoff ,
  [http://acl.ldc.upenn.edu/N/N07/N07-1026.pdf Data-Driven Graph Construction for Semi-Supervised Graph-Based Learning in NLP] ,HLT/NAACL 2007
+
  [http://acl.ldc.upenn.edu/N/N07/N07-1026.pdf Data-Driven Graph Construction for Semi-Supervised Graph-Based Learning in NLP] ,  
 +
HLT/NAACL 2007
  
 
;June 14 (David Smith)  
 
;June 14 (David Smith)  
 
 
: X. Zhu, Z. Ghahramani,J. Lafferty ,
 
: X. Zhu, Z. Ghahramani,J. Lafferty ,
  [http://acl.ldc.upenn.edu/N/N07/N07-1026.pdf Semi-supervised learning using Gaussian fields and harmonic functions.] ,ICML 2003
+
  [http://acl.ldc.upenn.edu/N/N07/N07-1026.pdf Semi-supervised learning using Gaussian fields and harmonic functions.] ,  
 +
ICML 2003
  
 
;June 21 (Christopher White)  
 
;June 21 (Christopher White)  
 
 
: K. Murphy, Y. Weiss, M. Jordan ,
 
: K. Murphy, Y. Weiss, M. Jordan ,
  Propagation for approximate inference: An empirical study. ,15th UAI, pages 467-?75, 1999
+
  Propagation for approximate inference: An empirical study. ,  
 
+
15th UAI, pages 467-?75, 1999
 
: ... discussing (loopy) belief propagation as background for survey propagation, a topic which has been getting more attention lately for its ability to "solve very large hard combinatorial problems, such as determining the satisfiability of Boolean formulas. ,
 
: ... discussing (loopy) belief propagation as background for survey propagation, a topic which has been getting more attention lately for its ability to "solve very large hard combinatorial problems, such as determining the satisfiability of Boolean formulas. ,
  Chapter 8 of Chris Bishop's textbook is supposed to be a good treatment of graphical models overall.  It is available free here [http://research.microsoft.com/%7Ecmbishop/PRML/Bishop-PRML-sample.pdf].  He covers BP in section 8.4.4 after first presenting factor graphs in 8.4.3. ,David MacKay's treatment of BP, also in terms of factor graphs, is in chapter 26 of his book [http://www.inference.phy.cam.ac.uk/mackay/itprnn/book.html].  It's worth reading this chapter in full, perhaps first reading chapter 16.  ... the update equations are given as (26.11) and (26.12) ... [substantial further discussion by jason was here]
+
  Chapter 8 of Chris Bishop's textbook is supposed to be a good treatment of graphical models overall.  It is available free here [http://research.microsoft.com/%7Ecmbishop/PRML/Bishop-PRML-sample.pdf].  He covers BP in section 8.4.4 after first presenting factor graphs in 8.4.3. ,  
 +
David MacKay's treatment of BP, also in terms of factor graphs, is in chapter 26 of his book [http://www.inference.phy.cam.ac.uk/mackay/itprnn/book.html].  It's worth reading this chapter in full, perhaps first reading chapter 16.  ... the update equations are given as (26.11) and (26.12) ... [substantial further discussion by jason was here]
  
 
Some people may prefer Bishop's style, others MacKay's.
 
Some people may prefer Bishop's style, others MacKay's.
 
;July 6 (Christopher White)  
 
;July 6 (Christopher White)  
 
 
: A. Braunstein, M. Mezard, R. Zecchina. ,
 
: A. Braunstein, M. Mezard, R. Zecchina. ,
  [http://users.ictp.it/~zecchina/rsa.pdf Survey propagation: an algorithm for satisfiability.] ,Random Structures and Algorithms, 2005.
+
  [http://users.ictp.it/~zecchina/rsa.pdf Survey propagation: an algorithm for satisfiability.] ,  
 
+
Random Structures and Algorithms, 2005.
  
 
: We sent some questions to Zecchina. ,
 
: We sent some questions to Zecchina. ,
  Lukas Kroc, Ashish Sabharwal and Bart Selman. ,Survey Propagation Revisited: An Empirical Study.
+
  Lukas Kroc, Ashish Sabharwal and Bart Selman. ,  
 +
Survey Propagation Revisited: An Empirical Study.
  
 
23rd UAI, 2007.
 
23rd UAI, 2007.
  
 
;July 18 (David Smith)  
 
;July 18 (David Smith)  
 
 
: P. Liang, S. Petrov, M. Jordan, D. Klein ,
 
: P. Liang, S. Petrov, M. Jordan, D. Klein ,
  [http://acl.ldc.upenn.edu/D/D07/D07-1072.pdf The Infinite PCFG Using Hierarchical Dirichlet Processes.] ,Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning,
+
  [http://acl.ldc.upenn.edu/D/D07/D07-1072.pdf The Infinite PCFG Using Hierarchical Dirichlet Processes.] ,  
 +
Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning,
  
 
;Aug. 3 (Yi Su)  
 
;Aug. 3 (Yi Su)  
 
 
: M. Galley, K. McKeown ,
 
: M. Galley, K. McKeown ,
  [http://acl.ldc.upenn.edu/N/N07/N07-1023.pdf  Lexicalized Markov Grammars for Sentence Compression.] ,NAACL-HLT 2007
+
  [http://acl.ldc.upenn.edu/N/N07/N07-1023.pdf  Lexicalized Markov Grammars for Sentence Compression.] ,  
 +
NAACL-HLT 2007
  
 
;Aug. 11 (Nikesh Garera)  
 
;Aug. 11 (Nikesh Garera)  
 
 
: L. Shen, G. Satta, A. Joshi. ,
 
: L. Shen, G. Satta, A. Joshi. ,
  [http://acl.ldc.upenn.edu/P/P07/P07-1096.pdf  Guided learning for bidirectional sequence classification] ,ACL 2007
+
  [http://acl.ldc.upenn.edu/P/P07/P07-1096.pdf  Guided learning for bidirectional sequence classification] ,  
 +
ACL 2007
  
 
;Aug. 18 (Markus Dreyer)  
 
;Aug. 18 (Markus Dreyer)  
 
 
: D. Talbot, M. Osborne ,
 
: D. Talbot, M. Osborne ,
  [http://acl.ldc.upenn.edu/P/P07/P07-1065.pdf  Randomised Language Modelling for Statistical Machine Translation] ,ACL 2007
+
  [http://acl.ldc.upenn.edu/P/P07/P07-1065.pdf  Randomised Language Modelling for Statistical Machine Translation] ,  
 
+
ACL 2007
  
 
: They use a space-efficient randomized data structure (Bloom Filter) to store very large n-gram models. ,
 
: They use a space-efficient randomized data structure (Bloom Filter) to store very large n-gram models. ,
  There is a companion paper that people might want to have a quick look at as well, for comparison: ,D. Talbot, M. Osborne
+
  There is a companion paper that people might want to have a quick look at as well, for comparison: ,  
 +
D. Talbot, M. Osborne
  
 
[http://acl.ldc.upenn.edu/D/D07/D07-1049.pdf Smoothed Bloom Filter Language Models: Tera-Scale LMs on the Cheap]
 
[http://acl.ldc.upenn.edu/D/D07/D07-1049.pdf Smoothed Bloom Filter Language Models: Tera-Scale LMs on the Cheap]
Line 157: Line 153:
  
 
;Aug. 30 (Delip Rao)  
 
;Aug. 30 (Delip Rao)  
 
 
: Gideon S. Mann ,
 
: Gideon S. Mann ,
  [http://imls.engr.oregonstate.edu/www/htdocs/proceedings/icml2007/papers/441.pdf  Simple, Robust, Scalable Semi-supervised Learning via Expectation Regularization] ,Proceedings of the 24 th International Conference on Machine Learning 2007
+
  [http://imls.engr.oregonstate.edu/www/htdocs/proceedings/icml2007/papers/441.pdf  Simple, Robust, Scalable Semi-supervised Learning via Expectation Regularization] ,  
 
+
Proceedings of the 24 th International Conference on Machine Learning 2007
  
  
  
 
: } ,
 
: } ,
  ==  Spring 2007 == ,Topics:
+
  ==  Spring 2007 == ,  
 +
Topics:
 
* Morphology (unsupervised learning)
 
* Morphology (unsupervised learning)
 
* Recent IR/QA papers (with an NLP or multilingual focus)
 
* Recent IR/QA papers (with an NLP or multilingual focus)
Line 177: Line 173:
 
!  Supporting Papers/Notes
 
!  Supporting Papers/Notes
 
;Apr. 19 (John Blatz)  
 
;Apr. 19 (John Blatz)  
 
 
: A. Prieditis ,
 
: A. Prieditis ,
  [http://www.cs.jhu.edu/~jblatz/nlp-reading-group/prieditis93.pdf  Machine discovery of Effective Admissible Heuristics ] , Machine Learning Journal, 1993
+
  [http://www.cs.jhu.edu/~jblatz/nlp-reading-group/prieditis93.pdf  Machine discovery of Effective Admissible Heuristics ] ,  
 +
Machine Learning Journal, 1993
  
 
;Apr. 12 (Markus Dreyer)  
 
;Apr. 12 (Markus Dreyer)  
 
 
: A. Haghighi, J. DeNero and D. Klein ,
 
: A. Haghighi, J. DeNero and D. Klein ,
  [http://www.eecs.berkeley.edu/~aria42/pubs/factor-astar-naacl07.pdf  Approximate Factoring for A* Search] , NAACL-HLT 2007
+
  [http://www.eecs.berkeley.edu/~aria42/pubs/factor-astar-naacl07.pdf  Approximate Factoring for A* Search] ,  
 +
NAACL-HLT 2007
  
 
;Mar. 29 & Apr. 5 (Zhifei Li)  
 
;Mar. 29 & Apr. 5 (Zhifei Li)  
 
 
: H. Daume III, J. Langford, and D. Marcu ,
 
: H. Daume III, J. Langford, and D. Marcu ,
  [http://pub.hal3.name/daume06searn.pdf    Search-based structured prediction.] , Machine Learning Journal, forthcoming
+
  [http://pub.hal3.name/daume06searn.pdf    Search-based structured prediction.] ,  
 +
Machine Learning Journal, forthcoming
  
 
;Mar. 8 (David Smith)  
 
;Mar. 8 (David Smith)  
 
 
: H. Daume III & D. Marcu ,
 
: H. Daume III & D. Marcu ,
  [http://pub.hal3.name/daume05laso.pdf    Learning as search optimization: approximate large margin methods for structured prediction.] , ICML 2005
+
  [http://pub.hal3.name/daume05laso.pdf    Learning as search optimization: approximate large margin methods for structured prediction.] ,  
 +
ICML 2005
  
 
;Mar. 1 (Wei Chen)  
 
;Mar. 1 (Wei Chen)  
 
 
: M. Kaisser, S. Scheible, and B. Webber ,
 
: M. Kaisser, S. Scheible, and B. Webber ,
  [http://trec.nist.gov/pubs/trec15/papers/udeinburgh.qa.final.pdf    Experiments at the University of Edinburgh for the TREC 2006 QA track.] , TREC-15
+
  [http://trec.nist.gov/pubs/trec15/papers/udeinburgh.qa.final.pdf    Experiments at the University of Edinburgh for the TREC 2006 QA track.] ,  
 
+
TREC-15
  
 
: They do some fairly deep interpretation of sentences, extracting their predicate-argument structure. ,
 
: They do some fairly deep interpretation of sentences, extracting their predicate-argument structure. ,
  |- ,|Feb. 22
+
  |- ,  
 +
|Feb. 22
 
|Eric Harley
 
|Eric Harley
 
 
: K. Kan Lo & W. Lam ,
 
: K. Kan Lo & W. Lam ,
  [http://trec.nist.gov/pubs/trec15/papers/cuhk.qa.final.pdf    Using Semantic Relations with World Knowledge for Question Answering] , TREC-15
+
  [http://trec.nist.gov/pubs/trec15/papers/cuhk.qa.final.pdf    Using Semantic Relations with World Knowledge for Question Answering] ,  
 +
TREC-15
  
 
;Feb. 15 (Nikhil Bojja)  
 
;Feb. 15 (Nikhil Bojja)  
 
 
: C. Monson et. al.  ,
 
: C. Monson et. al.  ,
  [http://acl.ldc.upenn.edu/acl2004/studentws/pdf/monson.pdf      Unsupervised Induction of Natural Language Morphology Inflection Classes] , ACL Student Workshop '04
+
  [http://acl.ldc.upenn.edu/acl2004/studentws/pdf/monson.pdf      Unsupervised Induction of Natural Language Morphology Inflection Classes] ,  
 +
ACL Student Workshop '04
  
 
;Feb. 8 (Delip Rao)  
 
;Feb. 8 (Delip Rao)  
 
 
: P. Schone and D. Jurafsky  ,
 
: P. Schone and D. Jurafsky  ,
  [http://acl.ldc.upenn.edu/W/W00/W00-0712.pdf      Knowledge-free induction of morphology using latent semantic analysis ] , CoNLL 2000
+
  [http://acl.ldc.upenn.edu/W/W00/W00-0712.pdf      Knowledge-free induction of morphology using latent semantic analysis ] ,  
 
+
CoNLL 2000
 
: However, there was an extension of this work reported in NAACL-2001 that looks at circumfixes and prefix/affix combinations. [http://www.stanford.edu/people/jurafsky/NAACL2001_Morphology_final.pdf] ,
 
: However, there was an extension of this work reported in NAACL-2001 that looks at circumfixes and prefix/affix combinations. [http://www.stanford.edu/people/jurafsky/NAACL2001_Morphology_final.pdf] ,
  |- ,|Feb. 1
+
  |- ,  
 +
|Feb. 1
 
|Nikesh Garera
 
|Nikesh Garera
 
 
: D. Yarowsky and R. Wicentowski  ,
 
: D. Yarowsky and R. Wicentowski  ,
  [http://www.cs.swarthmore.edu/~richardw/pubs/acl2000.ps      Minimally supervised morphological analysis by multimodal alignment ] , ACL 2000
+
  [http://www.cs.swarthmore.edu/~richardw/pubs/acl2000.ps      Minimally supervised morphological analysis by multimodal alignment ] ,  
 
+
ACL 2000
  
 
: For more details refer to  [http://www.cs.swarthmore.edu/~richardw/pubs/thesis.pdf Chapter 4]  of Wicentowski's thesis. ,
 
: For more details refer to  [http://www.cs.swarthmore.edu/~richardw/pubs/thesis.pdf Chapter 4]  of Wicentowski's thesis. ,
  |} ,==  Fall 2006 ==
+
  |} ,  
 +
==  Fall 2006 ==
  
 
Topics:
 
Topics:
Line 242: Line 238:
 
!  Supporting Papers/Notes
 
!  Supporting Papers/Notes
 
;Dec. 13 (Delip Rao)  
 
;Dec. 13 (Delip Rao)  
 
 
: J. Carbonell et. al. ,
 
: J. Carbonell et. al. ,
  [http://www.mt-archive.info/AMTA-2006-Carbonell.pdf  Context-based machine translation] , AMTA 2006
+
  [http://www.mt-archive.info/AMTA-2006-Carbonell.pdf  Context-based machine translation] ,  
 +
AMTA 2006
  
 
;Dec. 6 (Jason Smith)  
 
;Dec. 6 (Jason Smith)  
 
 
: M. Galley et. al.  ,
 
: M. Galley et. al.  ,
  [http://www.cs.columbia.edu/nlp/papers/2006/galley_al_06.pdf    Scalable Inference and Training of Context-Rich Syntactic Translation Models] , ACL 2006
+
  [http://www.cs.columbia.edu/nlp/papers/2006/galley_al_06.pdf    Scalable Inference and Training of Context-Rich Syntactic Translation Models] ,  
 
+
ACL 2006
 
: It may also be helpful to look at: ,
 
: It may also be helpful to look at: ,
  M. Galley et. al. ,
+
  M. Galley et. al. ,  
 +
 
[http://www.isi.edu/natural-language/projects/rewrite/whatsin.pdf What's in a translation rule?]
 
[http://www.isi.edu/natural-language/projects/rewrite/whatsin.pdf What's in a translation rule?]
 
 
Line 259: Line 255:
  
 
;Nov. 29 (Balakrishnan V)  
 
;Nov. 29 (Balakrishnan V)  
 
 
: D. Marcu et. al. ,
 
: D. Marcu et. al. ,
  [http://www.isi.edu/~marcu/papers/spmt-emnlp06.pdf    SPMT: Statistical Machine Translation with Syntactified Target Language Phrases ] , EMNLP 2006
+
  [http://www.isi.edu/~marcu/papers/spmt-emnlp06.pdf    SPMT: Statistical Machine Translation with Syntactified Target Language Phrases ] ,  
 +
EMNLP 2006
  
 
;Nov. 15 (Eric Harley)  
 
;Nov. 15 (Eric Harley)  
 
 
: D. Chiang  ,
 
: D. Chiang  ,
  [http://www.isi.edu/~chiang/papers/synchtut.pdf    An introduction to synchronous grammars] , ACL 2006 Tutorial
+
  [http://www.isi.edu/~chiang/papers/synchtut.pdf    An introduction to synchronous grammars] ,  
 
+
ACL 2006 Tutorial
 
: Slides from the talk are also available. [http://www.isi.edi/~chiang/papers/synchtut-slides.pdf] ,
 
: Slides from the talk are also available. [http://www.isi.edi/~chiang/papers/synchtut-slides.pdf] ,
  |- ,|Nov. 8
+
  |- ,  
 +
|Nov. 8
 
|Elliott Drabek
 
|Elliott Drabek
 
 
: K.Shklovsky  ,
 
: K.Shklovsky  ,
  [http://nlp.cs.jhu.edu/~edrabek/grammatical-sketch/tzeltal.pdf    A Grammatical Sketch of Petalcingo Tzeltal] , Undergraduate Thesis, Reed College, 2005
+
  [http://nlp.cs.jhu.edu/~edrabek/grammatical-sketch/tzeltal.pdf    A Grammatical Sketch of Petalcingo Tzeltal] ,  
 
+
Undergraduate Thesis, Reed College, 2005
  
 
: It is 77 pages long, but not dense, and I will be skipping the following sections: ,
 
: It is 77 pages long, but not dense, and I will be skipping the following sections: ,
  Pages , 01-14 Phonetics and phonology
+
  Pages ,  
 +
01-14 Phonetics and phonology
  
 
18-18 Polyvalence
 
18-18 Polyvalence
Line 286: Line 282:
  
 
;Nov. 1 (Yi Su)  
 
;Nov. 1 (Yi Su)  
 
 
: M. Steedman ,
 
: M. Steedman ,
  Gapping as Constituent Coordination , Linguistics and Philosophy, Vol. 13, 1990, pp.207-264.
+
  Gapping as Constituent Coordination ,  
 
+
Linguistics and Philosophy, Vol. 13, 1990, pp.207-264.
  
 
: See Yi for photocopies. ,
 
: See Yi for photocopies. ,
  |- ,|Oct. 25
+
  |- ,  
 +
|Oct. 25
 
|Markus Dreyer
 
|Markus Dreyer
 
 
: S. Reizler et. al.  ,
 
: S. Reizler et. al.  ,
  [http://acl.ldc.upenn.edu/P/P02/P02-1035.pdf      Parsing the Wall Street Journal using a Lexical-Functional Grammar and Discriminative Estimation Techniques] , ACL 2002
+
  [http://acl.ldc.upenn.edu/P/P02/P02-1035.pdf      Parsing the Wall Street Journal using a Lexical-Functional Grammar and Discriminative Estimation Techniques] ,  
 +
ACL 2002
  
  
  
 
;Oct. 18 (Erin Fitzgerald)  
 
;Oct. 18 (Erin Fitzgerald)  
 
 
: J. Bresnan & R.M. Kaplan  ,
 
: J. Bresnan & R.M. Kaplan  ,
  [http://www.cs.jhu.edu/~jblatz/nlp-reading-group/bresnan-kaplan-1982.pdf      Lexical-Functional Grammar: A Formal System for Grammatical Representation ] , The Mental Representation of Grammatical Relations, MIT Press, 1982
+
  [http://www.cs.jhu.edu/~jblatz/nlp-reading-group/bresnan-kaplan-1982.pdf      Lexical-Functional Grammar: A Formal System for Grammatical Representation ] ,  
 
+
The Mental Representation of Grammatical Relations, MIT Press, 1982
 
: BTW, the edited collection that this appears in is generally interesting. Bresnan defends and develops lexicalized grammars in general; the idea of separate surface and semantic roles; and Bresnan & Kaplan's LFG in particular. You should know that she originated (in 1978) the extremely influential idea of lexicalized syntax -- the idea that a grammar is simply a collection of lexical entries to be assembled in standard language-independent ways, but that there are also "lexical redundancy rules" that relate, e.g., active and passive entries for the same verb. Some chapters address morphological and cognitive issues pertaining to lexicalization, including an essay by Pinker on lexicalist learning. ,
 
: BTW, the edited collection that this appears in is generally interesting. Bresnan defends and develops lexicalized grammars in general; the idea of separate surface and semantic roles; and Bresnan & Kaplan's LFG in particular. You should know that she originated (in 1978) the extremely influential idea of lexicalized syntax -- the idea that a grammar is simply a collection of lexical entries to be assembled in standard language-independent ways, but that there are also "lexical redundancy rules" that relate, e.g., active and passive entries for the same verb. Some chapters address morphological and cognitive issues pertaining to lexicalization, including an essay by Pinker on lexicalist learning. ,
  Slides from Erin's presentation can be found [http://www.clsp.jhu.edu/~erin/presentations/LFG.ppt here]. ,
+
  Slides from Erin's presentation can be found [http://www.clsp.jhu.edu/~erin/presentations/LFG.ppt here]. ,  
 +
 
;Oct. 11 (John Blatz)  
 
;Oct. 11 (John Blatz)  
 
 
: L.Xu, D. Wilkinson, F. Southey, & D. Schuurmans  ,
 
: L.Xu, D. Wilkinson, F. Southey, & D. Schuurmans  ,
  [http://www.cs.jhu.edu/~jblatz/nlp-reading-group/xu_et_al_ICML_2006.pdf      Discriminative Unsupervised Learning of Structured Predictors ] , ICML 2006
+
  [http://www.cs.jhu.edu/~jblatz/nlp-reading-group/xu_et_al_ICML_2006.pdf      Discriminative Unsupervised Learning of Structured Predictors ] ,  
 +
ICML 2006
  
 
;Oct. 4 (Nikesh Garera)  
 
;Oct. 4 (Nikesh Garera)  
 
 
: A. Culotta & J. Sorensen    ,
 
: A. Culotta & J. Sorensen    ,
  [http://acl.ldc.upenn.edu/acl2004/main/pdf/244_pdf_2-col.pdf      Dependency Tree Kernels for Relation Extraction ] , ACL 2004
+
  [http://acl.ldc.upenn.edu/acl2004/main/pdf/244_pdf_2-col.pdf      Dependency Tree Kernels for Relation Extraction ] ,  
 +
ACL 2004
 
-----
 
-----
 
D. Zelenko, C. Aone, & A. Richardella  
 
D. Zelenko, C. Aone, & A. Richardella  
Line 324: Line 320:
  
 
;Sept. 27 (David Smith)  
 
;Sept. 27 (David Smith)  
 
 
: C. Cortes, P. Haffner, & M. Mohri    ,
 
: C. Cortes, P. Haffner, & M. Mohri    ,
  [http://www.cs.nyu.edu/~mohri/postscript/kernel.ps      Rational Kernels ] , NIPS 2003
+
  [http://www.cs.nyu.edu/~mohri/postscript/kernel.ps      Rational Kernels ] ,  
 
+
NIPS 2003
 
: Papers extending rational kernels, including results on positive semidefinite cases, are at:[http://www.cs.nyu.edu/~mohri/rational.html] ,
 
: Papers extending rational kernels, including results on positive semidefinite cases, are at:[http://www.cs.nyu.edu/~mohri/rational.html] ,
  For the record, and not to be read, is an interesting parallel line of research in Fisher Kernels over strings, e.g. this paper by Saunders, Shawe-Taylor and Vinokourov: [http://citeseer.ist.psu.edu/524921.html] ,|-
+
  For the record, and not to be read, is an interesting parallel line of research in Fisher Kernels over strings, e.g. this paper by Saunders, Shawe-Taylor and Vinokourov: [http://citeseer.ist.psu.edu/524921.html] ,  
|Sept. 20
+
;Sept. 20 (Elliot Drabek)
|Elliot Drabek
 
 
 
 
: K.Q. Weinberger, F. Sha, & L.K. Saul      ,
 
: K.Q. Weinberger, F. Sha, & L.K. Saul      ,
  [http://www.cs.berkeley.edu/~feisha/pubs/learning_kernel04.pdf      Learning a kernel matrix for nonlinear dimensionality reduction ] , ICML 2004
+
  [http://www.cs.berkeley.edu/~feisha/pubs/learning_kernel04.pdf      Learning a kernel matrix for nonlinear dimensionality reduction ] ,  
 
+
ICML 2004
  
 
: S.T. Roweis & L.K. Saul      ,
 
: S.T. Roweis & L.K. Saul      ,
  [http://www.sciencemag.org/cgi/reprint/290/5500/2323.pdf      Nonlinear Dimensionality Reduction by Locally Linear Embedding ] , Science, 22 December 2000
+
  [http://www.sciencemag.org/cgi/reprint/290/5500/2323.pdf      Nonlinear Dimensionality Reduction by Locally Linear Embedding ] ,  
 +
Science, 22 December 2000
 
-----
 
-----
 
J.B. Tenenbaum, V. De Silva, & J.C. Langford   
 
J.B. Tenenbaum, V. De Silva, & J.C. Langford   
Line 347: Line 341:
  
 
;Sept. 13 (Roy Tromble)  
 
;Sept. 13 (Roy Tromble)  
 
 
: L. Xu, J. Neufeld, B. Larson, & D. Schuurmans      ,
 
: L. Xu, J. Neufeld, B. Larson, & D. Schuurmans      ,
  [http://books.nips.cc/papers/files/nips17/NIPS2004_0834.pdf      Maximum Margin Clustering ] , NIPS 2004
+
  [http://books.nips.cc/papers/files/nips17/NIPS2004_0834.pdf      Maximum Margin Clustering ] ,  
 
+
NIPS 2004
  
 
: } ,
 
: } ,
  ==  Summer 2006 == ,Topics:
+
  ==  Summer 2006 == ,  
 +
Topics:
 
*  Recent HLT-NAACL papers
 
*  Recent HLT-NAACL papers
  
Line 363: Line 357:
  
 
;Jun. 24 (David Smith)  
 
;Jun. 24 (David Smith)  
 
 
: Percy Liang, Ben Taskar, Dan Klein ,
 
: Percy Liang, Ben Taskar, Dan Klein ,
  [http://www.eecs.berkeley.edu/~pliang/papers/alignment-naacl2006.pdf Alignment by Agreement] , HLT-NAACL, 2006
+
  [http://www.eecs.berkeley.edu/~pliang/papers/alignment-naacl2006.pdf Alignment by Agreement] ,  
 +
HLT-NAACL, 2006
  
 
;Jun. 31 (Markus Dreyer)  
 
;Jun. 31 (Markus Dreyer)  
 
 
: Joakim Nivre, Johan Hall et al ,
 
: Joakim Nivre, Johan Hall et al ,
  [http://www.cnts.ua.ac.be/conll/pdf/22124.pdf Labeled Pseudo-Projective Dependency Parsing with Support Vector Machines] , Procceding of CoNLL, 2006
+
  [http://www.cnts.ua.ac.be/conll/pdf/22124.pdf Labeled Pseudo-Projective Dependency Parsing with Support Vector Machines] ,  
 +
Procceding of CoNLL, 2006
  
 
----
 
----
Line 381: Line 375:
  
 
;Jul. 6 (Keith Hall)  
 
;Jul. 6 (Keith Hall)  
 
 
: Charles Sutton, Michael Sindelar, Andrew McCallum ,
 
: Charles Sutton, Michael Sindelar, Andrew McCallum ,
  [http://www.cs.umass.edu/~casutton/publications/bags-hlt2006.pdf Reducing Weight Undertraining in Structured Discriminative Learning] , HLT-NAACL, 2006
+
  [http://www.cs.umass.edu/~casutton/publications/bags-hlt2006.pdf Reducing Weight Undertraining in Structured Discriminative Learning] ,  
 +
HLT-NAACL, 2006
  
 
;Jul. 20 (Roy Tromble)  
 
;Jul. 20 (Roy Tromble)  
 
 
: Mehryar Mohri, Brian Roark ,
 
: Mehryar Mohri, Brian Roark ,
  [http://www.cslu.ogi.edu/people/roark/spcfg.pdf  Probabilistic Context-Free Grammar Induction Based on Structural Zeros] , HLT-NAACL, 2006
+
  [http://www.cslu.ogi.edu/people/roark/spcfg.pdf  Probabilistic Context-Free Grammar Induction Based on Structural Zeros] ,  
 +
HLT-NAACL, 2006
  
 
;Aug. 4 (David Smith)  
 
;Aug. 4 (David Smith)  
 
 
: Sharon Goldwater, Thomas L. Griffiths, Mark Johnson ,
 
: Sharon Goldwater, Thomas L. Griffiths, Mark Johnson ,
  [http://acl.ldc.upenn.edu/P/P06/P06-1085.pdf  Contextual Dependencies in Unsupervised Word Segmentation] , ACL 2006
+
  [http://acl.ldc.upenn.edu/P/P06/P06-1085.pdf  Contextual Dependencies in Unsupervised Word Segmentation] ,  
 
+
ACL 2006
 
: Anyone looking for a more straight-up language modeling discussion can compare: ,
 
: Anyone looking for a more straight-up language modeling discussion can compare: ,
  Yee Whye Teh , [http://portal.acm.org/ft_gateway.cfm?id=1220299&type=pdf&coll=GUIDE&dl=GUIDE&CFID=15174251&CFTOKEN=31671821 A Hierarchical Bayesian Language Model Based On Pitman-Yor Processes]
+
  Yee Whye Teh ,  
 +
[http://portal.acm.org/ft_gateway.cfm?id=1220299&type=pdf&coll=GUIDE&dl=GUIDE&CFID=15174251&CFTOKEN=31671821 A Hierarchical Bayesian Language Model Based On Pitman-Yor Processes]
  
 
ACL 2006
 
ACL 2006
Line 412: Line 406:
  
 
Journal of the American Statistical Association, 2006
 
Journal of the American Statistical Association, 2006
 
  
  
 
: } ,
 
: } ,
  ==  Spring 2006 == ,Topics:
+
  ==  Spring 2006 == ,  
 +
Topics:
 
* Consensus decoding
 
* Consensus decoding
 
* Miscellous extraction (idioms)
 
* Miscellous extraction (idioms)
Line 429: Line 423:
  
 
;Feb. 9 (John Blatz)  
 
;Feb. 9 (John Blatz)  
 
 
: Dominic Widdows, Beate Dorow ,
 
: Dominic Widdows, Beate Dorow ,
  [http://acl.ldc.upenn.edu/W/W05/W05-1006.pdf Automatic Extraction of Idioms using Graph Analysis and Asymmetric Lexicosyntactic Patterns] , Proceedings of the ACL-SIGLEX Workshop on Deep Lexical Acquisition, 2005
+
  [http://acl.ldc.upenn.edu/W/W05/W05-1006.pdf Automatic Extraction of Idioms using Graph Analysis and Asymmetric Lexicosyntactic Patterns] ,  
 +
Proceedings of the ACL-SIGLEX Workshop on Deep Lexical Acquisition, 2005
  
 
----
 
----
Line 442: Line 436:
  
 
;Feb. 16 (Noah A Smith)  
 
;Feb. 16 (Noah A Smith)  
 
 
: Khalil Sima'an ,
 
: Khalil Sima'an ,
  [http://arxiv.org/abs/cmp-lg/9606019 Computational Complexity of Probabilistic Disambiguation by means of Tree-Grammars] , COLING 1996
+
  [http://arxiv.org/abs/cmp-lg/9606019 Computational Complexity of Probabilistic Disambiguation by means of Tree-Grammars] ,  
 +
COLING 1996
  
 
----
 
----
Line 453: Line 447:
  
 
LNAI 1981
 
LNAI 1981
 
  
 
: For more HMM/Comp, bio view, and extended results view: ,
 
: For more HMM/Comp, bio view, and extended results view: ,
  Rune B. Lyngsoe, Christian N. S. Pederson , The Consensus String Problem and the Complexity of Comparing Hidden
+
  Rune B. Lyngsoe, Christian N. S. Pederson ,  
 +
The Consensus String Problem and the Complexity of Comparing Hidden
 
   
 
   
 
Journal of Computer and System Sciences 65, 2002
 
Journal of Computer and System Sciences 65, 2002
Line 463: Line 457:
  
 
;Feb. 23 (Omar F. Zaidan)  
 
;Feb. 23 (Omar F. Zaidan)  
 
 
: Ravichandran, Pantel, Hovy ,
 
: Ravichandran, Pantel, Hovy ,
  [http://arxiv.org/abs/cmp-lg/9606019 Randomized Algorithms and NLP: Using Locality Sensitive Hash Function for High Speed Noun Clustering] , Proceedings of the 43rd Annual Meeting of the ACL, 2005
+
  [http://arxiv.org/abs/cmp-lg/9606019 Randomized Algorithms and NLP: Using Locality Sensitive Hash Function for High Speed Noun Clustering] ,  
 +
Proceedings of the 43rd Annual Meeting of the ACL, 2005
  
 
----
 
----
Line 476: Line 470:
  
 
;Mar.3 (Jason Riesa)  
 
;Mar.3 (Jason Riesa)  
 
 
: Hal Daume III, Daniel Marcu ,
 
: Hal Daume III, Daniel Marcu ,
  [http://www.isi.edu/~hdaume/docs/daume06megam.pdf Domain Adaptation for Statistical Classifiers] , Journal of Artificial Intelligence Research, 2006
+
  [http://www.isi.edu/~hdaume/docs/daume06megam.pdf Domain Adaptation for Statistical Classifiers] ,  
 +
Journal of Artificial Intelligence Research, 2006
  
 
;Mar.10 (Roy Tromble)  
 
;Mar.10 (Roy Tromble)  
 
 
: Terry Koo, Michael Collins ,
 
: Terry Koo, Michael Collins ,
  [http://www.aclweb.org/anthology/H/H05/H05-1064 Hidden-Variable Models for Discriminative Reranking] , Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, 2005
+
  [http://www.aclweb.org/anthology/H/H05/H05-1064 Hidden-Variable Models for Discriminative Reranking] ,  
 +
Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, 2005
  
 
;Mar.17 (Elliott Franco Drabek)  
 
;Mar.17 (Elliott Franco Drabek)  
 
 
: Necip Fazil Ayan, Bonnie J. Dorr, Christof Monz ,
 
: Necip Fazil Ayan, Bonnie J. Dorr, Christof Monz ,
  [http://www.cs.umd.edu/~nfa/Publications/ayan-emnlp05-alp.pdf Alignment Link Projection Using Transformation-Based Learning] , HLT-EMNLP 2005
+
  [http://www.cs.umd.edu/~nfa/Publications/ayan-emnlp05-alp.pdf Alignment Link Projection Using Transformation-Based Learning] ,  
 +
HLT-EMNLP 2005
  
  
 
;Mar.31 (Eric Harley)  
 
;Mar.31 (Eric Harley)  
 
 
: Ben Taskar, Lacoste-Julien Simon, Klein Dan ,
 
: Ben Taskar, Lacoste-Julien Simon, Klein Dan ,
  [http://acl.ldc.upenn.edu/H/H05/H05-1010.pdf A Discriminative Matching Approach to Word Alignment] , ACL 2005
+
  [http://acl.ldc.upenn.edu/H/H05/H05-1010.pdf A Discriminative Matching Approach to Word Alignment] ,  
 +
ACL 2005
 
 
 
;Apr.6 (Eric Harley)  
 
;Apr.6 (Eric Harley)  
 
 
: Ben Taskar, Lacoste-Julien Simon, Klein Dan ,
 
: Ben Taskar, Lacoste-Julien Simon, Klein Dan ,
  [http://acl.ldc.upenn.edu/H/H05/H05-1010.pdf A Discriminative Matching Approach to Word Alignment] , ACL 2005
+
  [http://acl.ldc.upenn.edu/H/H05/H05-1010.pdf A Discriminative Matching Approach to Word Alignment] ,  
 +
ACL 2005
 
----
 
----
 
 
Line 510: Line 504:
  
 
;Apr. 20 (Balakrishnan V)  
 
;Apr. 20 (Balakrishnan V)  
 
 
: Richard M. Karp, Michael 0. Rabin ,
 
: Richard M. Karp, Michael 0. Rabin ,
  [http://www.research.ibm.com/journal/rd/312/ibmrd3102P.pdf Efficient randomized Pattern matching Algorithms] , IBM Journal of Research and Development, 1987
+
  [http://www.research.ibm.com/journal/rd/312/ibmrd3102P.pdf Efficient randomized Pattern matching Algorithms] ,  
 +
IBM Journal of Research and Development, 1987
  
 
;May 4 (David Smith)  
 
;May 4 (David Smith)  
 
 
: C. E. R. Alves, E. N. C′aceres F. Dehne ,
 
: C. E. R. Alves, E. N. C′aceres F. Dehne ,
  [http://citeseer.ist.psu.edu/724170.html Parallel dynamic programming for solving the string editing problem on a CGM/BSP] , SPAA 2002
+
  [http://citeseer.ist.psu.edu/724170.html Parallel dynamic programming for solving the string editing problem on a CGM/BSP] ,  
 +
SPAA 2002
  
 
;May 11 (John Blatz)  
 
;May 11 (John Blatz)  
 
 
: M. Gengler ,
 
: M. Gengler ,
  [http://www.cs.jhu.edu/~jblatz/gengler.pdf An introduction to parallel dynamic programming] , Lecture Notes in Computer Science, 1996
+
  [http://www.cs.jhu.edu/~jblatz/gengler.pdf An introduction to parallel dynamic programming] ,  
 +
Lecture Notes in Computer Science, 1996
  
 
;May 18 (Markus Dreyer)  
 
;May 18 (Markus Dreyer)  
 
 
: Jonathan May, Kevin Knight ,
 
: Jonathan May, Kevin Knight ,
  [http://www.isi.edu/~jonmay/pubs/naacl06.pdf A Better N-Best List: Practical Determinization of Weighted Finite Tree Automata] , Proc. NAACL-HLT, 2006
+
  [http://www.isi.edu/~jonmay/pubs/naacl06.pdf A Better N-Best List: Practical Determinization of Weighted Finite Tree Automata] ,  
 
+
Proc. NAACL-HLT, 2006
  
 
: } ,
 
: } ,
  ==  Fall 2005 == ,{| style="width:800px" border="1"
+
  ==  Fall 2005 == ,  
 +
{| style="width:800px" border="1"
 
!  width="10%"|Date/Time  
 
!  width="10%"|Date/Time  
 
!  width="10%"|Presenter  
 
!  width="10%"|Presenter  
Line 538: Line 532:
  
 
;Sept. 14 (Nikesh Garera)  
 
;Sept. 14 (Nikesh Garera)  
 
 
: M. Jordan      ,
 
: M. Jordan      ,
  Statistical Learning Theory Chapter 8 (Exponential family and Generalized linear models) ,|-
+
  Statistical Learning Theory Chapter 8 (Exponential family and Generalized linear models) ,  
|Sept. 21
+
;Sept. 21 (Arnab Ghoshal)
|Arnab Ghoshal
 
 
 
 
: M. Jordan      ,
 
: M. Jordan      ,
  Statistical Learning Theory Chapter 2&3 ,|-
+
  Statistical Learning Theory Chapter 2&3 ,  
|Oct. 20
+
;Oct. 20 (Roy Tromble)
|Roy Tromble
 
 
 
 
: Sheila M. Reynolds, Jeff A. Bilmes    ,
 
: Sheila M. Reynolds, Jeff A. Bilmes    ,
  [http://ssli.ee.washington.edu/people/bilmes/mypapers/sheila-hlt05.pdf Part-of-Speech Tagging using Virtual Evidence and Negative Training.] , Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing.  2005.  pp 459--466.
+
  [http://ssli.ee.washington.edu/people/bilmes/mypapers/sheila-hlt05.pdf Part-of-Speech Tagging using Virtual Evidence and Negative Training.] ,  
 +
Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing.  2005.  pp 459--466.
  
 
;Oct. 27 (Markus Dreyer)  
 
;Oct. 27 (Markus Dreyer)  
 
 
: D. Roth and W. Yih  ,
 
: D. Roth and W. Yih  ,
  [http://l2r.cs.uiuc.edu/~danr/Papers/RothYi05.pdf Integer Linear Programming Inference for Conditional Random Fields.] , ICML '2005
+
  [http://l2r.cs.uiuc.edu/~danr/Papers/RothYi05.pdf Integer Linear Programming Inference for Conditional Random Fields.] ,  
 +
ICML '2005
  
  
 
;Nov. 4 (Jason Riesa)  
 
;Nov. 4 (Jason Riesa)  
 
 
: Luke S. Zettlemoyer, Michael Collins.  ,
 
: Luke S. Zettlemoyer, Michael Collins.  ,
  [http://people.csail.mit.edu/lsz/papers/uai05.pdf  Learning to Map Sentences to Logical Form: Structured Classification with Probabilistic Categorial] , Proceedings of UAI 2005
+
  [http://people.csail.mit.edu/lsz/papers/uai05.pdf  Learning to Map Sentences to Logical Form: Structured Classification with Probabilistic Categorial] ,  
 +
Proceedings of UAI 2005
  
 
;Nov. 16 (Safiullah Shareef)  
 
;Nov. 16 (Safiullah Shareef)  
 
 
: Hassan Sawaf, J?rg Zaplo, Hermann Ney ,
 
: Hassan Sawaf, J?rg Zaplo, Hermann Ney ,
  [http://www.elsnet.org/arabic2001/sawaf.pdf  Statistical Classification Methods for Arabic News Articles] ,|-
+
  [http://www.elsnet.org/arabic2001/sawaf.pdf  Statistical Classification Methods for Arabic News Articles] ,  
|Nov. 23
+
;Nov. 23 (Roy Tromble)
|Roy Tromble
 
 
 
 
: Sutton, Charles and McCallum, Andrew ,
 
: Sutton, Charles and McCallum, Andrew ,
  [http://www.aclweb.org/anthology/H/H05/H05-1094  Composition of Conditional Random Fields for Transfer Learning] , Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing 2005
+
  [http://www.aclweb.org/anthology/H/H05/H05-1094  Composition of Conditional Random Fields for Transfer Learning] ,  
 
+
Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing 2005
  
 
: } ,
 
: } ,
  ==  Summer 2005 == ,Topics:
+
  ==  Summer 2005 == ,  
 +
Topics:
  
 
*Recent papers on ACL / CoNLL / parallel-text workshop
 
*Recent papers on ACL / CoNLL / parallel-text workshop
Line 596: Line 584:
  
 
;July 14 (Roy Tromble)  
 
;July 14 (Roy Tromble)  
 
 
: Goldwater and Johnson ,
 
: Goldwater and Johnson ,
  [http://www.cog.brown.edu:16080/~sgwater/papers/OTvar03.pdf Learning OT Constraint Rankings Using a Maximum  ,Entropy Model]
+
  [http://www.cog.brown.edu:16080/~sgwater/papers/OTvar03.pdf Learning OT Constraint Rankings Using a Maximum  ,  
 +
Entropy Model]
  
 
In Proceedings of the Workshop on Variation within Optimality Theory, 2003
 
In Proceedings of the Workshop on Variation within Optimality Theory, 2003
Line 604: Line 592:
  
 
;July 21 (Keith and Damianos)  
 
;July 21 (Keith and Damianos)  
 
 
: Sharon Goldwater, Mark Johnson ,
 
: Sharon Goldwater, Mark Johnson ,
  [http://www.aclweb.org/anthology/W/W05/W05-0615 Representational Bias in Unsupervised Learning of Syllable  ,Structure]
+
  [http://www.aclweb.org/anthology/W/W05/W05-0615 Representational Bias in Unsupervised Learning of Syllable  ,  
 +
Structure]
  
 
ACL 2005
 
ACL 2005
Line 622: Line 610:
  
 
;July 28 (Zak)  
 
;July 28 (Zak)  
 
 
: Takuya Matsuzaki, Yusuke Miyao, Jun'ichi Tsujii ,
 
: Takuya Matsuzaki, Yusuke Miyao, Jun'ichi Tsujii ,
  [http://www.aclweb.org/anthology/P/P05/P05-1010  Probabilistic CFG with Latent Annotations] , ACL 2005
+
  [http://www.aclweb.org/anthology/P/P05/P05-1010  Probabilistic CFG with Latent Annotations] ,  
 +
ACL 2005
  
 
;Aug 5 (Adam)  
 
;Aug 5 (Adam)  
 
 
: Duh, Kevin  and  Kirchhoff, Katrin ,
 
: Duh, Kevin  and  Kirchhoff, Katrin ,
  [http://www.aclweb.org/anthology/W/W05/W05-0708 Tagging of Dialectal Arabic: A Minimally Supervised  ,Approach]
+
  [http://www.aclweb.org/anthology/W/W05/W05-0708 Tagging of Dialectal Arabic: A Minimally Supervised  ,  
 +
Approach]
  
 
ACL 2005
 
ACL 2005
Line 635: Line 623:
  
 
;Aug 19 (John Blatz)  
 
;Aug 19 (John Blatz)  
 
 
: Niyogi, Sourabh ,
 
: Niyogi, Sourabh ,
  [http://www.aclweb.org/anthology/W/W05/W05-0511  Steps Toward Deep Lexical Acquisition] , ACL 2005
+
  [http://www.aclweb.org/anthology/W/W05/W05-0511  Steps Toward Deep Lexical Acquisition] ,  
 +
ACL 2005
  
 
;Aug 26 (Roy Tromble)  
 
;Aug 26 (Roy Tromble)  
 
 
: Jenny Rose Finkel, Trond Grenager, Christopher Manning ,
 
: Jenny Rose Finkel, Trond Grenager, Christopher Manning ,
  [http://www.aclweb.org/anthology/W/W05/W05-0511  Incorporating Non-local Information into Information  ,Extraction Systems by Gibbs Sampling]
+
  [http://www.aclweb.org/anthology/W/W05/W05-0511  Incorporating Non-local Information into Information  ,  
 +
Extraction Systems by Gibbs Sampling]
  
 
ACL 2005
 
ACL 2005
  
 
;Sep.1 (Markus Nikesh, John Blatz )  
 
;Sep.1 (Markus Nikesh, John Blatz )  
 
 
: B. Walsh ,
 
: B. Walsh ,
  [http://nitro.biosci.arizona.edu/courses/EEB581-2004/handouts/Gibbs.pdf  Markov Chain Monte Carlo ,and Gibbs Sampling]
+
  [http://nitro.biosci.arizona.edu/courses/EEB581-2004/handouts/Gibbs.pdf  Markov Chain Monte Carlo ,  
 +
and Gibbs Sampling]
  
 
Lecture Notes for EEB 581, version 26 April 2004
 
Lecture Notes for EEB 581, version 26 April 2004
 
  
  
  
 
: } ,
 
: } ,
  ==  Spring 2005 == ,Topics:
+
  ==  Spring 2005 == ,  
 +
Topics:
  
 
* Bayesian Nets / inference (tutorials in Michael Jordan's book)
 
* Bayesian Nets / inference (tutorials in Michael Jordan's book)
Line 670: Line 658:
  
 
;Feb. 25 (David Smith)  
 
;Feb. 25 (David Smith)  
 
 
: M. I. Jordan, Z. Ghahramani, T. S. Jaakkola, and L. K. Saul ,
 
: M. I. Jordan, Z. Ghahramani, T. S. Jaakkola, and L. K. Saul ,
  [http://www.cs.berkeley.edu/~jordan/papers/variational-intro.ps.gz Learning in Graphical Models] ,
+
  [http://www.cs.berkeley.edu/~jordan/papers/variational-intro.ps.gz Learning in Graphical Models] ,  
 +
 
MIT Press, 1999
 
MIT Press, 1999
  
 
;Mar. 4 (David Smith)  
 
;Mar. 4 (David Smith)  
 
 
: M. I. Jordan, Z. Ghahramani, T. S. Jaakkola, and L. K. Saul ,
 
: M. I. Jordan, Z. Ghahramani, T. S. Jaakkola, and L. K. Saul ,
  [http://www.cs.berkeley.edu/~jordan/papers/variational-intro.ps.gz Learning in Graphical Models] ,
+
  [http://www.cs.berkeley.edu/~jordan/papers/variational-intro.ps.gz Learning in Graphical Models] ,  
 +
 
MIT Press, 1999
 
MIT Press, 1999
  
 
;Mar. 11 (David Smith)  
 
;Mar. 11 (David Smith)  
 
 
: M. I. Jordan, Z. Ghahramani, T. S. Jaakkola, and L. K. Saul ,
 
: M. I. Jordan, Z. Ghahramani, T. S. Jaakkola, and L. K. Saul ,
  [http://www.cs.berkeley.edu/~jordan/papers/variational-intro.ps.gz Learning in Graphical Models] ,
+
  [http://www.cs.berkeley.edu/~jordan/papers/variational-intro.ps.gz Learning in Graphical Models] ,  
 +
 
MIT Press, 1999
 
MIT Press, 1999
  
 
;Apr. 2 (David Smith)  
 
;Apr. 2 (David Smith)  
 
 
: M. I. Jordan, Z. Ghahramani, T. S. Jaakkola, and L. K. Saul ,
 
: M. I. Jordan, Z. Ghahramani, T. S. Jaakkola, and L. K. Saul ,
  [http://www.cs.berkeley.edu/~jordan/papers/variational-intro.ps.gz Learning in Graphical Models] ,
+
  [http://www.cs.berkeley.edu/~jordan/papers/variational-intro.ps.gz Learning in Graphical Models] ,  
 +
 
MIT Press, 1999
 
MIT Press, 1999
  
 
;Apr. 9 (Noah A Smith)  
 
;Apr. 9 (Noah A Smith)  
 
 
: G. Elidan and N. Friedman. ,
 
: G. Elidan and N. Friedman. ,
  [http://www.cs.huji.ac.il/~nirf/Abstracts/ElF2.html The Information Bottleneck EM Algorithm] ,
+
  [http://www.cs.huji.ac.il/~nirf/Abstracts/ElF2.html The Information Bottleneck EM Algorithm] ,  
 +
 
UAI 2003
 
UAI 2003
  
Line 707: Line 695:
  
 
;Apr. 16 (Noah A Smith)  
 
;Apr. 16 (Noah A Smith)  
 
 
: V. Lavrenko, S.L Feng, R. Manmatha ,
 
: V. Lavrenko, S.L Feng, R. Manmatha ,
  [http://ciir.cs.umass.edu/pubfiles/mm-325.pdf  Statistical models for automatic video annotation and retrieval] ,
+
  [http://ciir.cs.umass.edu/pubfiles/mm-325.pdf  Statistical models for automatic video annotation and retrieval] ,  
 +
 
Acoustics, Speech, and Signal Processing, 2004. Proceedings.
 
Acoustics, Speech, and Signal Processing, 2004. Proceedings.
  
 
;Apr. 21 (Omar F. Zaidan)  
 
;Apr. 21 (Omar F. Zaidan)  
 
 
: Tin Kam Ho, Jonathan J. Hull, Sargur N. Stihari ,
 
: Tin Kam Ho, Jonathan J. Hull, Sargur N. Stihari ,
  [http://www.crc.ricoh.com/~hull/pubs/ho_pami94.pdf  Decision Combination in Multiple Classifier Systems] ,
+
  [http://www.crc.ricoh.com/~hull/pubs/ho_pami94.pdf  Decision Combination in Multiple Classifier Systems] ,  
 +
 
IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.16. No I. Jan. 1994
 
IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.16. No I. Jan. 1994
  
Line 726: Line 714:
  
 
;Apr. 28 (Damianos Karakos)  
 
;Apr. 28 (Damianos Karakos)  
 
 
: Alessandro Moschitti and Roberto Basili ,
 
: Alessandro Moschitti and Roberto Basili ,
  [http://ai-nlp.info.uniroma2.it/moschitti/publications.htm  Complex Linguistic Features for Text Classification: a comprehensive study] ,
+
  [http://ai-nlp.info.uniroma2.it/moschitti/publications.htm  Complex Linguistic Features for Text Classification: a comprehensive study] ,  
 +
 
In proceedings of the 26th European Conference on Information Retrieval Research (ECIR 2004)
 
In proceedings of the 26th European Conference on Information Retrieval Research (ECIR 2004)
  
 
;May 7 (Markus Dreyer)  
 
;May 7 (Markus Dreyer)  
 
 
: M. Diligenti, F.M. Coetzee, S. Lawrence, C.L. Giles, M. Gori ,
 
: M. Diligenti, F.M. Coetzee, S. Lawrence, C.L. Giles, M. Gori ,
  [http://citeseer.ist.psu.edu/diligenti00focused.html  Focused Crawling Using Context Graphs] ,
+
  [http://citeseer.ist.psu.edu/diligenti00focused.html  Focused Crawling Using Context Graphs] ,  
 +
 
26th International Conference on Very Large Databases, VLDB 2000
 
26th International Conference on Very Large Databases, VLDB 2000
  
Line 743: Line 731:
 
 
 
Computational Lingustics, 2003
 
Computational Lingustics, 2003
 
  
  
  
 
: } ,
 
: } ,
  ==  Fall 2004 == ,Topics:
+
  ==  Fall 2004 == ,  
 +
Topics:
  
 
* Recent papers from ACL/EMNLP 2004
 
* Recent papers from ACL/EMNLP 2004
Line 765: Line 753:
  
 
;Aug. 20 (Damianos Karakos, Charles Schafer)  
 
;Aug. 20 (Damianos Karakos, Charles Schafer)  
 
 
: P. Pantel and D. Lin ,
 
: P. Pantel and D. Lin ,
  [http://www.cs.ualberta.ca/~lindek/papers/kdd02.pdf Discovering word senses from text] , Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, 2002
+
  [http://www.cs.ualberta.ca/~lindek/papers/kdd02.pdf Discovering word senses from text] ,  
 +
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, 2002
  
 
----
 
----
Line 780: Line 768:
  
 
;Aug. 27 (David Smith)  
 
;Aug. 27 (David Smith)  
 
 
: I. Dan Melamed ,
 
: I. Dan Melamed ,
  [http://acl.ldc.upenn.edu/acl2004/main/pdf/113_pdf_2-col.pdf Statistical Machine Translation by Parsing] , ACL 2004
+
  [http://acl.ldc.upenn.edu/acl2004/main/pdf/113_pdf_2-col.pdf Statistical Machine Translation by Parsing] ,  
 +
ACL 2004
 
----
 
----
 
 
Line 794: Line 782:
  
 
;Sep. 2 (Gideon Mann)  
 
;Sep. 2 (Gideon Mann)  
 
 
: Xin Li, Paul Morie, and Dan Roth ,
 
: Xin Li, Paul Morie, and Dan Roth ,
  [http://acl.ldc.upenn.edu/hlt-naacl2004/main/pdf/139_Paper.pdf Robust Reading: Identification and Tracing  ,of Ambiguous Names]
+
  [http://acl.ldc.upenn.edu/hlt-naacl2004/main/pdf/139_Paper.pdf Robust Reading: Identification and Tracing  ,  
 +
of Ambiguous Names]
  
 
ACL 2004
 
ACL 2004
Line 811: Line 799:
  
 
;Sep. 9 (John Blatz)  
 
;Sep. 9 (John Blatz)  
 
 
: Pascale Fung and Percy Cheung ,
 
: Pascale Fung and Percy Cheung ,
  [http://acl.ldc.upenn.edu/acl2004/emnlp/pdf/Fung.pdf Mining Very-Non-Parallel Corpora: Parallel Sentence  ,and Lexicon Extraction via Bootstrapping and EM]
+
  [http://acl.ldc.upenn.edu/acl2004/emnlp/pdf/Fung.pdf Mining Very-Non-Parallel Corpora: Parallel Sentence  ,  
 +
and Lexicon Extraction via Bootstrapping and EM]
  
 
ACL 2004
 
ACL 2004
Line 828: Line 816:
  
 
;Sep. 24 (Roy Tromble)  
 
;Sep. 24 (Roy Tromble)  
 
 
: B. Taskar, C. Guestrin and D. Koller ,
 
: B. Taskar, C. Guestrin and D. Koller ,
  [http://robotics.stanford.edu/~btaskar/pubs/mmmn.ps Max-Margin Markov Networks] , Neural Information Processing Systems Conference (NIPS03), 2003
+
  [http://robotics.stanford.edu/~btaskar/pubs/mmmn.ps Max-Margin Markov Networks] ,  
 +
Neural Information Processing Systems Conference (NIPS03), 2003
  
 
----
 
----
Line 845: Line 833:
 
|-
 
|-
 
|Oct. 2
 
|Oct. 2
 
 
: Nguyen Bach ,
 
: Nguyen Bach ,
  |Background knowledge on SVM and Graphical Models ,[http://www.cse.msu.edu/~lawhiu/intro_SVM.ppt Intro SVM]
+
  |Background knowledge on SVM and Graphical Models ,  
 +
[http://www.cse.msu.edu/~lawhiu/intro_SVM.ppt Intro SVM]
  
 
[http://www.ai.mit.edu/~murphyk/Bayes/bnintro.html Intro Graphical Models]
 
[http://www.ai.mit.edu/~murphyk/Bayes/bnintro.html Intro Graphical Models]
Line 853: Line 841:
  
 
;Oct. 15 (Nguyen Bach)  
 
;Oct. 15 (Nguyen Bach)  
 
 
: Daichi Mochihashi, Genichiro Kikui, Kenji Kita ,
 
: Daichi Mochihashi, Genichiro Kikui, Kenji Kita ,
  [http://acl.ldc.upenn.edu/acl2004/emnlp/pdf/Mochihashi.pdf Learning Nonstructural Distance Metric by  ,Minimum Cluster Distortions]
+
  [http://acl.ldc.upenn.edu/acl2004/emnlp/pdf/Mochihashi.pdf Learning Nonstructural Distance Metric by  ,  
 +
Minimum Cluster Distortions]
 
 
 
EMNLP 2004
 
EMNLP 2004
  
 
;Oct. 22 (Michelle Vanni)  
 
;Oct. 22 (Michelle Vanni)  
 
 
: Lin and Och ,
 
: Lin and Och ,
  [http://acl.ldc.upenn.edu/acl2004/main/pdf/215_pdf_2-col.pdf Automatic Evaluation of Machine Translation  ,Quality Using Longest Common Subsequence]
+
  [http://acl.ldc.upenn.edu/acl2004/main/pdf/215_pdf_2-col.pdf Automatic Evaluation of Machine Translation  ,  
 +
Quality Using Longest Common Subsequence]
 
 
 
ACL 2004
 
ACL 2004
Line 877: Line 865:
  
 
;Oct. 29 (Eric Goldlust)  
 
;Oct. 29 (Eric Goldlust)  
 
 
: Clark and Curran ,
 
: Clark and Curran ,
  [http://web.comlab.ox.ac.uk/oucl/work/stephen.clark/papers/acl04.pdf Parsing the WSJ using CCG and Log- ,Linear Models]
+
  [http://web.comlab.ox.ac.uk/oucl/work/stephen.clark/papers/acl04.pdf Parsing the WSJ using CCG and Log- ,  
 +
Linear Models]
 
 
 
ACL 2004
 
ACL 2004
  
 
;Nov. 5 (Michelle Vanni)  
 
;Nov. 5 (Michelle Vanni)  
 
 
: Robert S. Swier and Suzanne Stevenson ,
 
: Robert S. Swier and Suzanne Stevenson ,
  [http://acl.ldc.upenn.edu/acl2004/emnlp/pdf/Swier.pdf Unsupervised Semantic Role Labelling] ,
+
  [http://acl.ldc.upenn.edu/acl2004/emnlp/pdf/Swier.pdf Unsupervised Semantic Role Labelling] ,  
 +
 
EMNLP 2004
 
EMNLP 2004
  
Line 898: Line 886:
  
 
;Nov. 13 (Michelle Vanni)  
 
;Nov. 13 (Michelle Vanni)  
 
 
: Robert S. Swier and Suzanne Stevenson ,
 
: Robert S. Swier and Suzanne Stevenson ,
  [nlp.cs.jhu.edu/~cschafer/david/Ch2.pdf Inexact Graph Matching Using Estimation of Distribution  ,Algorithms,Chapter 2, The graph matching problem]
+
  [nlp.cs.jhu.edu/~cschafer/david/Ch2.pdf Inexact Graph Matching Using Estimation of Distribution  ,  
 +
Algorithms,Chapter 2, The graph matching problem]
  
 
Submitted to the Ecole Nationale Supérieure des Télécommunications (Paris), for the Degree of Doctor of  
 
Submitted to the Ecole Nationale Supérieure des Télécommunications (Paris), for the Degree of Doctor of  
Line 913: Line 901:
  
 
Computer Vision and Pattern Recognition, 2003. Proceedings. 2003 IEEE
 
Computer Vision and Pattern Recognition, 2003. Proceedings. 2003 IEEE
 
  
 
: ...This chapter is general to the field although pretty sweeping and unspecific as a result. It probably makes a  ,
 
: ...This chapter is general to the field although pretty sweeping and unspecific as a result. It probably makes a  ,
  good introduction, since it gives an idea of the scope and diversity of the problem and proposed techniques... ,...this is a state of the art paper which is quite dense but quite interesting. solves a very general formulation  
+
  good introduction, since it gives an idea of the scope and diversity of the problem and proposed techniques... ,  
 +
...this is a state of the art paper which is quite dense but quite interesting. solves a very general formulation  
  
 
of inexact graph matching by first imbedding graphs into a normed space...
 
of inexact graph matching by first imbedding graphs into a normed space...
Line 922: Line 910:
  
 
;Nov. 20 (David Smith)  
 
;Nov. 20 (David Smith)  
 
 
: Olle H鋑gstr鰉 and Karin Nelander ,
 
: Olle H鋑gstr鰉 and Karin Nelander ,
  [http://nlp.cs.jhu.edu/~dasmith/mrfcftp.pdf  On Exact Simulation of Markov Random Fields Using Coupling from the Past] , Foundation of the Scandinavian Journal of Statistics, 1999
+
  [http://nlp.cs.jhu.edu/~dasmith/mrfcftp.pdf  On Exact Simulation of Markov Random Fields Using Coupling from the Past] ,  
 +
Foundation of the Scandinavian Journal of Statistics, 1999
  
 
----
 
----
Line 936: Line 924:
  
 
;Nov. 27 (Jia Cui)  
 
;Nov. 27 (Jia Cui)  
 
 
: David M. Blei, Andrew Y. Ng, Michael I. Jordan ,
 
: David M. Blei, Andrew Y. Ng, Michael I. Jordan ,
  [http://citeseer.ist.psu.edu/blei03latent.html Latent Dirichlet Allocation] , Journal of machine Learning Research 3, 2003
+
  [http://citeseer.ist.psu.edu/blei03latent.html Latent Dirichlet Allocation] ,  
 
+
Journal of machine Learning Research 3, 2003
  
 
: A additional related report on LDA ,
 
: A additional related report on LDA ,
  [www.cs.toronto.edu/~ywteh/research/npbayes/report.pdf] , Another introduction to LDA
+
  [www.cs.toronto.edu/~ywteh/research/npbayes/report.pdf] ,  
 +
Another introduction to LDA
  
 
[http://citeseer.ist.psu.edu/541352.html]
 
[http://citeseer.ist.psu.edu/541352.html]
 
  
  
 
: } ,
 
: } ,
  ==  Spring 2004 == ,Topics:
+
  ==  Spring 2004 == ,  
 +
Topics:
  
 
* combinatorial optimization (software)
 
* combinatorial optimization (software)
Line 962: Line 950:
  
 
;Feb. 5 (Brock)  
 
;Feb. 5 (Brock)  
 
 
: Jessica A. Barlow and Judith A. Gierut      ,
 
: Jessica A. Barlow and Judith A. Gierut      ,
  [http://www.cs.jhu.edu/~cschafer/15241_1.pdf Optimality theory in phonological acquisition] , Journal of Speech, Language and Hearing 42, 1999
+
  [http://www.cs.jhu.edu/~cschafer/15241_1.pdf Optimality theory in phonological acquisition] ,  
 +
Journal of Speech, Language and Hearing 42, 1999
  
 
----
 
----
Line 975: Line 963:
  
 
;Feb. 12 (Brock)  
 
;Feb. 12 (Brock)  
 
 
: Bob Frank, Giorgio Satta ,
 
: Bob Frank, Giorgio Satta ,
  [http://www.cogsci.jhu.edu/faculty/frank/papers/ot-revised.pdf Optimality theory and the Generative Complexity of Constraint Violability] , MIT Press
+
  [http://www.cogsci.jhu.edu/faculty/frank/papers/ot-revised.pdf Optimality theory and the Generative Complexity of Constraint Violability] ,  
 
+
MIT Press
  
 
: A glimpse (from MIT Press): ,
 
: A glimpse (from MIT Press): ,
  It has been argued that rule-based phonological descriptions can uniformly be expressed as mappings carried out by finite-state transducers, and therefore fall within the class of rational relations. If this property of generative capacity is an empirically correct characterization of phonological mappings, it should hold of any sufficiently restrictive theory of phonology, whether it utilizes constraints or rewrite rules. In this paper, we investigate the conditions under which the phonological descriptions that are possible within the view of constraint interaction embodied in Optimality Theory (Prince and Smolensky 1993) remain within the class of rational relations. We show that this is true when GEN is itself a rational relation, and each of the constraints distinguishes among finitely many regular sets of candidates. ,|-
+
  It has been argued that rule-based phonological descriptions can uniformly be expressed as mappings carried out by finite-state transducers, and therefore fall within the class of rational relations. If this property of generative capacity is an empirically correct characterization of phonological mappings, it should hold of any sufficiently restrictive theory of phonology, whether it utilizes constraints or rewrite rules. In this paper, we investigate the conditions under which the phonological descriptions that are possible within the view of constraint interaction embodied in Optimality Theory (Prince and Smolensky 1993) remain within the class of rational relations. We show that this is true when GEN is itself a rational relation, and each of the constraints distinguishes among finitely many regular sets of candidates. ,  
|Feb. 19
+
;Feb. 19 (David Smith)
|David Smith
 
 
 
 
: Barzilay and Lee ,
 
: Barzilay and Lee ,
  [http://www.google.com/url?sa=t&ct=res&cd=2&url=http%3A%2F%2Fpeople.csail.mit.edu%2Fregina%2Fmy_papers%2Fstatpar.ps&ei=RX-nR7CIBoTEebPsjPAC&usg=AFQjCNHksPHRtwentpXGd1GRVPS1j6rhVw&sig2=wmLuV0QR2BrkTBQtRmz-vg Learning to Paraphrase: An Unsupervise Approach Using Multiple-Sequen7:12 PM 2/4/2008ce Alignment] , HTL 2003
+
  [http://www.google.com/url?sa=t&ct=res&cd=2&url=http%3A%2F%2Fpeople.csail.mit.edu%2Fregina%2Fmy_papers%2Fstatpar.ps&ei=RX-nR7CIBoTEebPsjPAC&usg=AFQjCNHksPHRtwentpXGd1GRVPS1j6rhVw&sig2=wmLuV0QR2BrkTBQtRmz-vg Learning to Paraphrase: An Unsupervise Approach Using Multiple-Sequen7:12 PM 2/4/2008ce Alignment] ,  
 +
HTL 2003
  
 
;Mar. 5 (Charles Schafer)  
 
;Mar. 5 (Charles Schafer)  
 
 
: Daniel Marcu ,
 
: Daniel Marcu ,
  Theory and Practice of Discourse Parsing and Summarization, Chapters 2 & 3 , The MIT Press, 2000
+
  Theory and Practice of Discourse Parsing and Summarization, Chapters 2 & 3 ,  
 +
The MIT Press, 2000
  
 
;Mar. 18 (Markus Dreyer)  
 
;Mar. 18 (Markus Dreyer)  
Line 1,000: Line 986:
  
 
;Mar. 25 (Eric Goldlust)  
 
;Mar. 25 (Eric Goldlust)  
 
 
: Boyan and Moore ,
 
: Boyan and Moore ,
  [http://citeseer.ist.psu.edu/418699.html Learning Evaluation Functions to Improve Optimization by Local Search] , Journal of Machine Learning Research, 2000
+
  [http://citeseer.ist.psu.edu/418699.html Learning Evaluation Functions to Improve Optimization by Local Search] ,  
 +
Journal of Machine Learning Research, 2000
  
 
;Apr. 3 (Roy Tromble)  
 
;Apr. 3 (Roy Tromble)  
 
 
: Roman Bartak ,
 
: Roman Bartak ,
  [http://kti.ms.mff.cuni.cz/~bartak/downloads/WDS99.pdf Constraint Programming: In Pursuit of the Holy Grail] , 1999
+
  [http://kti.ms.mff.cuni.cz/~bartak/downloads/WDS99.pdf Constraint Programming: In Pursuit of the Holy Grail] ,  
 +
1999
  
 
;Apr. 10 (Noah Ashton Smith)  
 
;Apr. 10 (Noah Ashton Smith)  
 
 
: Denys Duchier ,
 
: Denys Duchier ,
  [http://www.ps.uni-sb.de/Papers/abstracts/duchier-mol6.html Axiomatizing Dependency Parsing Using Set Constraints] , Sixth Meeting on Mathematics of Language, 2000
+
  [http://www.ps.uni-sb.de/Papers/abstracts/duchier-mol6.html Axiomatizing Dependency Parsing Using Set Constraints] ,  
 +
Sixth Meeting on Mathematics of Language, 2000
  
 
;Apr. 10 (Noah Ashton Smith)  
 
;Apr. 10 (Noah Ashton Smith)  
 
 
: Denys Duchier ,
 
: Denys Duchier ,
  [http://www.ps.uni-sb.de/Papers/abstracts/duchier-mol6.html Axiomatizing Dependency Parsing Using Set Constraints] , Sixth Meeting on Mathematics of Language, 2000
+
  [http://www.ps.uni-sb.de/Papers/abstracts/duchier-mol6.html Axiomatizing Dependency Parsing Using Set Constraints] ,  
 +
Sixth Meeting on Mathematics of Language, 2000
  
 
;Apr. 17 (Elliott Franco Drabek)  
 
;Apr. 17 (Elliott Franco Drabek)  
 
 
: Rina Dechter ,
 
: Rina Dechter ,
  [http://www.ics.uci.edu/~dechter/publications/r62.html Mini-Buckets: A General Scheme for Generating Approximations in Automated Reasoning] , 2001
+
  [http://www.ics.uci.edu/~dechter/publications/r62.html Mini-Buckets: A General Scheme for Generating Approximations in Automated Reasoning] ,  
 +
2001
  
 
;Apr. 24 (David Smith)  
 
;Apr. 24 (David Smith)  
 
 
: McCallum and Jensen ,
 
: McCallum and Jensen ,
  [http://www.cs.umass.edu/~mccallum/papers/iedatamining-ijcaiws03.pdf Extraction and Data Mining using Conditional-Probability, Relational Models] ,   IJCAI'03 Workshop on Learning Statistical Models from Relational Data, 2003
+
  [http://www.cs.umass.edu/~mccallum/papers/iedatamining-ijcaiws03.pdf Extraction and Data Mining using Conditional-Probability, Relational Models] ,  
 
+
  IJCAI'03 Workshop on Learning Statistical Models from Relational Data, 2003
  
 
: The paper is a survey of recent trends in IE and data mining (biased of course towards the authors' work) and a proposal to unify them with conditional random fields. ,
 
: The paper is a survey of recent trends in IE and data mining (biased of course towards the authors' work) and a proposal to unify them with conditional random fields. ,
  |- ,|May. 1
+
  |- ,  
 +
|May. 1
 
|Izhak Shafran
 
|Izhak Shafran
 
 
: Eric J. Friedman ,
 
: Eric J. Friedman ,
  [http://citeseer.ist.psu.edu/377160.html Strong Monotonicity in Surplus Sharing] , 1999
+
  [http://citeseer.ist.psu.edu/377160.html Strong Monotonicity in Surplus Sharing] ,  
 
+
1999
  
 
: Used Tom Dietterich has a web page on probabilistic relational models: ,
 
: Used Tom Dietterich has a web page on probabilistic relational models: ,
  [http://web.engr.oregonstate.edu/~tgd/classes/539/] ,|-
+
  [http://web.engr.oregonstate.edu/~tgd/classes/539/] ,  
|May. 15
+
;May. 15 (Roy Tromble)
|Roy Tromble
 
 
 
 
: Fuchun Peng, Andrew McCallum ,
 
: Fuchun Peng, Andrew McCallum ,
  [http://www.cs.umass.edu/~mccallum/papers/hlt2004.pdf Accurate Information Extraction from Research Papers using Conditional Random Fields] , 2004
+
  [http://www.cs.umass.edu/~mccallum/papers/hlt2004.pdf Accurate Information Extraction from Research Papers using Conditional Random Fields] ,  
 
+
2004
  
 
: } ,
 
: } ,
  ==  Fall 2003 == ,{| style="width:800px" border="1"
+
  ==  Fall 2003 == ,  
 +
{| style="width:800px" border="1"
 
!  width="10%"|Date/Time  
 
!  width="10%"|Date/Time  
 
!  width="10%"|Presenter  
 
!  width="10%"|Presenter  
Line 1,054: Line 1,038:
 
!  Supporting Papers/Notes
 
!  Supporting Papers/Notes
 
;Sep.11 (Elliott Franco Drabek)  
 
;Sep.11 (Elliott Franco Drabek)  
 
 
: Bernard Comrie      ,
 
: Bernard Comrie      ,
  Language Universals Linguistic Typology: Syntax and Morphology Language Universals Linguistic Typology:  ,Syntax and Morphology, Chapter 1
+
  Language Universals Linguistic Typology: Syntax and Morphology Language Universals Linguistic Typology:  ,  
 +
Syntax and Morphology, Chapter 1
  
 
Blackwell Pub (1989)
 
Blackwell Pub (1989)
  
 
;Sep.18 (David Smith)  
 
;Sep.18 (David Smith)  
 
 
: Bernard Comrie      ,
 
: Bernard Comrie      ,
  Language Universals Linguistic Typology: Syntax and Morphology Language Universals Linguistic Typology:  ,Syntax and Morphology, Chapter 2-3
+
  Language Universals Linguistic Typology: Syntax and Morphology Language Universals Linguistic Typology:  ,  
 +
Syntax and Morphology, Chapter 2-3
  
 
Blackwell Pub (1989)
 
Blackwell Pub (1989)
  
 
;Oct. 3  (Michelle Vanni)  
 
;Oct. 3  (Michelle Vanni)  
 
 
: Bernard Comrie      ,
 
: Bernard Comrie      ,
  Language Universals Linguistic Typology: Syntax and Morphology Language Universals Linguistic Typology:  ,Syntax and Morphology, Chapter 4-6
+
  Language Universals Linguistic Typology: Syntax and Morphology Language Universals Linguistic Typology:  ,  
 +
Syntax and Morphology, Chapter 4-6
  
 
Blackwell Pub (1989)
 
Blackwell Pub (1989)
  
 
;Oct. 10 (David Smith)  
 
;Oct. 10 (David Smith)  
 
 
: Bernard Comrie      ,
 
: Bernard Comrie      ,
  Language Universals Linguistic Typology: Syntax and Morphology Language Universals Linguistic Typology:  ,Syntax and Morphology, Chapter 6-7
+
  Language Universals Linguistic Typology: Syntax and Morphology Language Universals Linguistic Typology:  ,  
 +
Syntax and Morphology, Chapter 6-7
  
 
Blackwell Pub (1989)
 
Blackwell Pub (1989)
  
 
;Oct. 24 (Markus Dreyer)  
 
;Oct. 24 (Markus Dreyer)  
 
 
: Stuart M. Shieber, Yves Schabes      ,
 
: Stuart M. Shieber, Yves Schabes      ,
  [http://acl.ldc.upenn.edu/C/C90/C90-3045.pdf Synchronous Tree-Adjoining Grammars] , Coling 1990
+
  [http://acl.ldc.upenn.edu/C/C90/C90-3045.pdf Synchronous Tree-Adjoining Grammars] ,  
 +
Coling 1990
  
 +
: An additional closely related paper ,
 +
Stuart M. Shieber, Yves Schabes ,
  
: An additional closely related paper ,
 
Stuart M. Shieber, Yves Schabes ,
 
 
[http://acl.ldc.upenn.edu/W/W90/W90-0102.pdf Generation and Synchronous Tree-Adjoining Grammars]
 
[http://acl.ldc.upenn.edu/W/W90/W90-0102.pdf Generation and Synchronous Tree-Adjoining Grammars]
  
Line 1,094: Line 1,078:
  
 
;Oct. 31 (Roy Tromble)  
 
;Oct. 31 (Roy Tromble)  
 
 
: Dekai Wu    ,
 
: Dekai Wu    ,
  [http://acl.ldc.upenn.edu/C/C90/C90-3045.pdf An algorithm for simultaneously bracketing parallel texts by aligning words] , ACL 1995
+
  [http://acl.ldc.upenn.edu/C/C90/C90-3045.pdf An algorithm for simultaneously bracketing parallel texts by aligning words] ,  
 +
ACL 1995
  
 
;Nov. 6 (Brock Pytlik)  
 
;Nov. 6 (Brock Pytlik)  
 
 
: Stuart M. Shieber    ,
 
: Stuart M. Shieber    ,
  [http://www.eecs.harvard.edu/~shieber/Courses/Esslli2003/esslli-slides.pdf Transducers as a Substrate for Natural Language Processing] ,|-
+
  [http://www.eecs.harvard.edu/~shieber/Courses/Esslli2003/esslli-slides.pdf Transducers as a Substrate for Natural Language Processing] ,  
|Nov. 13
+
;Nov. 13 (Markus Dreyer)
|Markus Dreyer
 
 
 
 
: Goldman and Zhou    ,
 
: Goldman and Zhou    ,
  [http://citeseer.nj.nec.com/goldman00enhancing.html Enhancing Supervised Learning with Unlabeled Data] , 27th Int. Conf. on Mach. Learn. 2000
+
  [http://citeseer.nj.nec.com/goldman00enhancing.html Enhancing Supervised Learning with Unlabeled Data] ,  
 
+
27th Int. Conf. on Mach. Learn. 2000
  
 
: An additional paper with some experiments ,
 
: An additional paper with some experiments ,
  Clark, Curran and Osborne ,
+
  Clark, Curran and Osborne ,  
 +
 
[http://www.cogsci.ed.ac.uk/~osborne/conll03-cco.pdf Bootstrapping POS taggers using Unlabelled Data]
 
[http://www.cogsci.ed.ac.uk/~osborne/conll03-cco.pdf Bootstrapping POS taggers using Unlabelled Data]
  
Line 1,116: Line 1,098:
  
 
;Nov. 20 (Noah A. Smith)  
 
;Nov. 20 (Noah A. Smith)  
 
 
: Rebecca Hwa, Miles Osborne, Anoop Sarkar, Mark Steedman    ,
 
: Rebecca Hwa, Miles Osborne, Anoop Sarkar, Mark Steedman    ,
  [http://www.cogsci.ed.ac.uk/~osborne/icmlworkshop03.ps.gz Corrected Co-training for Statistical Parsers] , ICML 2003
+
  [http://www.cogsci.ed.ac.uk/~osborne/icmlworkshop03.ps.gz Corrected Co-training for Statistical Parsers] ,  
 +
ICML 2003
  
 
;Dec. 12 (Paola Virga)  
 
;Dec. 12 (Paola Virga)  
 
 
: Kamal Nigam and Rayid Ghani  ,
 
: Kamal Nigam and Rayid Ghani  ,
  [http://www.kamalnigam.com/papers/cotrain-CIKM00.pdf  Analyzing the Effectiveness and Applicability of Co-training] , Ninth International Conference on Information and Knowledge Management 2000
+
  [http://www.kamalnigam.com/papers/cotrain-CIKM00.pdf  Analyzing the Effectiveness and Applicability of Co-training] ,  
 
+
Ninth International Conference on Information and Knowledge Management 2000
  
 
: } ,
 
: } ,
  ==  Spring 2003 == ,{| style="width:800px" border="1"
+
  ==  Spring 2003 == ,  
 +
{| style="width:800px" border="1"
 
!  width="10%"|Date/Time  
 
!  width="10%"|Date/Time  
 
!  width="10%"|Presenter  
 
!  width="10%"|Presenter  
Line 1,134: Line 1,116:
  
 
;Feb. 13 (David Smith)  
 
;Feb. 13 (David Smith)  
 
 
: K. Church      ,
 
: K. Church      ,
  [http://www.research.att.com/~kwc/published_2000_Coling.pdf Empirical Estimates of Adaptation: The chance of Two Noriega's is closer to p/2 than p^2] , Coling 2000, pp. 173-179
+
  [http://www.research.att.com/~kwc/published_2000_Coling.pdf Empirical Estimates of Adaptation: The chance of Two Noriega's is closer to p/2 than p^2] ,  
 +
Coling 2000, pp. 173-179
  
  
 
;Feb. 19 (Elliott Drabek)  
 
;Feb. 19 (Elliott Drabek)  
 
 
: A. Lopez??, M. Nossal??, R. Hwa, P. Resnik  ,
 
: A. Lopez??, M. Nossal??, R. Hwa, P. Resnik  ,
  [http://www.cs.umd.edu/users/alopez/pub/lrec02-lnhr.pdf Word-level Alignment for Multilingual Resource Acquisition] , Proceedings of the 2002 LREC Workshop on Linguistic Knowledge Acquisition and Representation: Bootstrapping Annotated Language Data
+
  [http://www.cs.umd.edu/users/alopez/pub/lrec02-lnhr.pdf Word-level Alignment for Multilingual Resource Acquisition] ,  
 +
Proceedings of the 2002 LREC Workshop on Linguistic Knowledge Acquisition and Representation: Bootstrapping Annotated Language Data
  
  
 
;Feb. 26 (Elliott Drabek)  
 
;Feb. 26 (Elliott Drabek)  
 
 
: Steven Abney  ,
 
: Steven Abney  ,
  [http://www.vinartus.net/spa/02a.pdf Bootstrapping] , ACL'02
+
  [http://www.vinartus.net/spa/02a.pdf Bootstrapping] ,  
 +
ACL'02
  
 
;Mar.6 (Paola Virga)  
 
;Mar.6 (Paola Virga)  
 
 
: Carl M. Kadie, Christopher Meek, David Heckerman  ,
 
: Carl M. Kadie, Christopher Meek, David Heckerman  ,
  [http://research.microsoft.com/~carlk/papers/cfw.htm A Collaborative Filtering System Using Posteriors Over Weights of Evidence] , Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence, 2002.
+
  [http://research.microsoft.com/~carlk/papers/cfw.htm A Collaborative Filtering System Using Posteriors Over Weights of Evidence] ,  
 +
Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence, 2002.
  
  
 
;Mar.20 (Roy Tromble)  
 
;Mar.20 (Roy Tromble)  
 
 
: Nikita Schmid, Ahmed Patel ,
 
: Nikita Schmid, Ahmed Patel ,
  [ttp://arXiv.org/abs/cs/0201008 Using Tree Automata and Regular Expressions to Manipulate Hierarchically Structured Data] ,|-
+
  [ttp://arXiv.org/abs/cs/0201008 Using Tree Automata and Regular Expressions to Manipulate Hierarchically Structured Data] ,  
 +
|-
 
|Apr.10
 
|Apr.10
 
 
: |V. N. Vapnik ,
 
: |V. N. Vapnik ,
  [http://www.cscs.umich.edu/~crshalizi/reviews/vapnik-nature/ The Nature of Statistical Learning Theory], Intro and Chapters 1, 2A ,|-
+
  [http://www.cscs.umich.edu/~crshalizi/reviews/vapnik-nature/ The Nature of Statistical Learning Theory], Intro and Chapters 1, 2A ,  
|Apr.17
+
;Apr.17 (Roy Tromble)
|Roy Tromble
 
 
 
 
: V. N. Vapnik ,
 
: V. N. Vapnik ,
  [http://www.cscs.umich.edu/~crshalizi/reviews/vapnik-nature/ The Nature of Statistical Learning Theory],Chapters 2B - 4A ,|-
+
  [http://www.cscs.umich.edu/~crshalizi/reviews/vapnik-nature/ The Nature of Statistical Learning Theory],Chapters 2B - 4A ,  
|Apr. 24
+
;Apr. 24 (Paola)
|Paola
 
 
 
 
: V. N. Vapnik ,
 
: V. N. Vapnik ,
  [http://www.cscs.umich.edu/~crshalizi/reviews/vapnik-nature/ The Nature of Statistical Learning Theory], Chapters 4B - 5A ,|-
+
  [http://www.cscs.umich.edu/~crshalizi/reviews/vapnik-nature/ The Nature of Statistical Learning Theory], Chapters 4B - 5A ,  
|May 1
+
;May 1 (Noah)
|Noah
 
 
 
 
: V. N. Vapnik ,
 
: V. N. Vapnik ,
  [http://www.cscs.umich.edu/~crshalizi/reviews/vapnik-nature/ The Nature of Statistical Learning Theory], Chapters 5B - 6A ,|-
+
  [http://www.cscs.umich.edu/~crshalizi/reviews/vapnik-nature/ The Nature of Statistical Learning Theory], Chapters 5B - 6A ,  
|May 8
+
;May 8 (Noah)
|Noah
 
 
 
 
: V. N. Vapnik ,
 
: V. N. Vapnik ,
  [http://www.cscs.umich.edu/~crshalizi/reviews/vapnik-nature/ The Nature of Statistical Learning Theory], Chapters 6B - 7A ,|-
+
  [http://www.cscs.umich.edu/~crshalizi/reviews/vapnik-nature/ The Nature of Statistical Learning Theory], Chapters 6B - 7A ,  
|May 15
+
;May 15 (Chal)
|Chal
 
 
 
 
: V. N. Vapnik ,
 
: V. N. Vapnik ,
  [http://www.cscs.umich.edu/~crshalizi/reviews/vapnik-nature/ The Nature of Statistical Learning Theory], Chapters 7B - ,
+
  [http://www.cscs.umich.edu/~crshalizi/reviews/vapnik-nature/ The Nature of Statistical Learning Theory], Chapters 7B - ,  
 
: } ,
 
: } ,
  ==  Fall 2002 == ,{| style="width:800px" border="1"
+
  ==  Fall 2002 == ,  
 +
{| style="width:800px" border="1"
 
!  width="10%"|Date/Time  
 
!  width="10%"|Date/Time  
 
!  width="10%"|Presenter  
 
!  width="10%"|Presenter  
Line 1,197: Line 1,169:
  
 
;Sep. 10 (Noah A. Smith)  
 
;Sep. 10 (Noah A. Smith)  
 
 
: Collins, Duffy. ,
 
: Collins, Duffy. ,
  [http://www.research.att.com/~mcollins/papers/finalacl2002.ps New Ranking Algorithms for Parsing and Tagging: Kernels over Discrete Structures, and the Voted Perceptron.] ,
+
  [http://www.research.att.com/~mcollins/papers/finalacl2002.ps New Ranking Algorithms for Parsing and Tagging: Kernels over Discrete Structures, and the Voted Perceptron.] ,  
 +
 
ACL '2002
 
ACL '2002
  
 
;Sep. 19 (Paola Virga)  
 
;Sep. 19 (Paola Virga)  
 
 
: Yamada, Knight ,
 
: Yamada, Knight ,
  [http://acl.ldc.upenn.edu/P/P02/P02-1039.pdf A decoder for Syntax-based Statistical MT] ,
+
  [http://acl.ldc.upenn.edu/P/P02/P02-1039.pdf A decoder for Syntax-based Statistical MT] ,  
 +
 
ACL '2002
 
ACL '2002
  
 
;Sep. 26 (Paul Ruhlen)  
 
;Sep. 26 (Paul Ruhlen)  
 
 
: Hwa, Resnik, Weinberg, Kolak ,
 
: Hwa, Resnik, Weinberg, Kolak ,
  [http://acl.ldc.upenn.edu/P/P02/P02-1050.pdf Evaluating Translational Correspondence using Annotation Projection] ,
+
  [http://acl.ldc.upenn.edu/P/P02/P02-1050.pdf Evaluating Translational Correspondence using Annotation Projection] ,  
 +
 
ACL '2002
 
ACL '2002
  
 
;Oct. 2 (Gideon Mann)  
 
;Oct. 2 (Gideon Mann)  
 
 
: Gildea, Jurafsky ,
 
: Gildea, Jurafsky ,
  [http://www.colorado.edu/ling/jurafsky/cl01.ps Automatic Labeling of Semantics Roles] ,
+
  [http://www.colorado.edu/ling/jurafsky/cl01.ps Automatic Labeling of Semantics Roles] ,  
 +
 
ACL '2001
 
ACL '2001
  
 
;Oct. 8 (Elliott Franco Drabek)  
 
;Oct. 8 (Elliott Franco Drabek)  
 
 
: Ravichandran, Hovy ,
 
: Ravichandran, Hovy ,
  [http://www.isi.edu/~ravichan/papers/P0351.pdf Learning Surface Text Patterns for a Question Answering System.] ,
+
  [http://www.isi.edu/~ravichan/papers/P0351.pdf Learning Surface Text Patterns for a Question Answering System.] ,  
 +
 
ACL '2001
 
ACL '2001
 
  
 
: A similar paper ,
 
: A similar paper ,
  Lin, Pantel ,[http://www.cs.ualberta.ca/~ppantel/Download/Papers/kdd01-1.pdf Discovery of Inference Rules for Question Answwering]
+
  Lin, Pantel ,  
 +
[http://www.cs.ualberta.ca/~ppantel/Download/Papers/kdd01-1.pdf Discovery of Inference Rules for Question Answwering]
  
 
;Oct. 17 (David Smith)  
 
;Oct. 17 (David Smith)  
 
 
: Cotton, Bird ,
 
: Cotton, Bird ,
  [http://arxiv.org/abs/cs/0204007 An Integrated Framework for Treebanks and Multilayer Annotations] ,
+
  [http://arxiv.org/abs/cs/0204007 An Integrated Framework for Treebanks and Multilayer Annotations] ,  
 +
 
LREC '2002
 
LREC '2002
  
 
;Oct. 24 (Roy Tromble)  
 
;Oct. 24 (Roy Tromble)  
 
 
: Han, Benjamin ,
 
: Han, Benjamin ,
  [http://www.cs.cmu.edu/~benhdj/papers/bhan_naccl_2001.pdf Building a Bilingual Dictionary with Scarce Resources: A Genetic Algorithm Approach.] ,|-
+
  [http://www.cs.cmu.edu/~benhdj/papers/bhan_naccl_2001.pdf Building a Bilingual Dictionary with Scarce Resources: A Genetic Algorithm Approach.] ,  
|Nov. 1
+
;Nov. 1 (Chalaporn Hathaidharm)
|Chalaporn Hathaidharm
 
 
 
 
: J.Gao, J.Goodman, M.Li, K.Lee ,
 
: J.Gao, J.Goodman, M.Li, K.Lee ,
  [http://www.microsoft.com/china/research/dload_files/g-nlps/NLPSP/talip01-4th.pdf Toward A Unified Approach To Statistical Language Modeling For Chinese] ,
+
  [http://www.microsoft.com/china/research/dload_files/g-nlps/NLPSP/talip01-4th.pdf Toward A Unified Approach To Statistical Language Modeling For Chinese] ,  
 +
 
ACM Transactions on Asian Language Information Processing, Vol. 1, No. 1, pp 3-33. 2002.
 
ACM Transactions on Asian Language Information Processing, Vol. 1, No. 1, pp 3-33. 2002.
  
 
;Nov. 7 (Neda Khalili)  
 
;Nov. 7 (Neda Khalili)  
 
 
: Yamamoto, Church ,
 
: Yamamoto, Church ,
  [http://acl.ldc.upenn.edu/J/J01/J01-1001.pdf Using Suffix Arrays to Compute Term Frequency and Document Frequency for All Substrings in a Corpus] ,
+
  [http://acl.ldc.upenn.edu/J/J01/J01-1001.pdf Using Suffix Arrays to Compute Term Frequency and Document Frequency for All Substrings in a Corpus] ,  
 +
 
Computational Linguistics '2001
 
Computational Linguistics '2001
 
  
 
: A relative paper: ,
 
: A relative paper: ,
  Kageura ,[http://research.nii.ac.jp/~kyo/papers/qualico.ps Bigram Statistics Revisited A Comparative Examination of Some Statistical Measures in Morphological Analysis of Japanese Kanji Sequences]
+
  Kageura ,  
 +
[http://research.nii.ac.jp/~kyo/papers/qualico.ps Bigram Statistics Revisited A Comparative Examination of Some Statistical Measures in Morphological Analysis of Japanese Kanji Sequences]
  
 
;Nov. 14 (Michelle Vanni)  
 
;Nov. 14 (Michelle Vanni)  
 
 
: Hearst ,
 
: Hearst ,
  [http://www.sims.berkeley.edu/~hearst/papers/acl99/acl99-tdm.html Untangling Text Data Mining.] ,
+
  [http://www.sims.berkeley.edu/~hearst/papers/acl99/acl99-tdm.html Untangling Text Data Mining.] ,  
 +
 
  ACL '1999
 
  ACL '1999
  
 
;Nov. 21 (Silviu Cucerzan)  
 
;Nov. 21 (Silviu Cucerzan)  
 
 
: Ueda, Nakano, Ghahramani, Hinton ,
 
: Ueda, Nakano, Ghahramani, Hinton ,
  [http://www.cs.toronto.edu/~hinton/absps/ueda.html SMEM Algorithm for Mixture Models] ,
+
  [http://www.cs.toronto.edu/~hinton/absps/ueda.html SMEM Algorithm for Mixture Models] ,  
 +
 
Neural Information Processing Systems '1998
 
Neural Information Processing Systems '1998
  
 
;Dec.5 (Silviu Cucerzan)  
 
;Dec.5 (Silviu Cucerzan)  
 
 
: Pearce ,
 
: Pearce ,
  [http://www.cogs.susx.ac.uk/users/darrenp/academic/dphil/publications/data/Conferences/lrec2002/paper.pdf  A Comparative Evaluation of Collocation Extraction Techniques. Darren Pearce.] ,
+
  [http://www.cogs.susx.ac.uk/users/darrenp/academic/dphil/publications/data/Conferences/lrec2002/paper.pdf  A Comparative Evaluation of Collocation Extraction Techniques. Darren Pearce.] ,  
 +
 
Third International Conference on Language Resources and Evaluation. May. 2002
 
Third International Conference on Language Resources and Evaluation. May. 2002
  
Line 1,282: Line 1,252:
 
 
 
In Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, 317--324.
 
In Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, 317--324.
 
  
  
 
: } ,
 
: } ,
  ==  Summer 2002 == ,{| style="width:800px" border="1"
+
  ==  Summer 2002 == ,  
 +
{| style="width:800px" border="1"
 
!  width="10%"|Date/Time  
 
!  width="10%"|Date/Time  
 
!  width="10%"|Presenter  
 
!  width="10%"|Presenter  
Line 1,293: Line 1,263:
  
 
;July. 24 (Michelle Vanni)  
 
;July. 24 (Michelle Vanni)  
 
 
: Merlo ,
 
: Merlo ,
  [http://perun.si.umich.edu/clair/ACL02/ A Multilingual Paradigm for Automatic Verb Classification] ,
+
  [http://perun.si.umich.edu/clair/ACL02/ A Multilingual Paradigm for Automatic Verb Classification] ,  
 +
 
ACL '2002  
 
ACL '2002  
  
 
;July. 31 (Paola Virga)  
 
;July. 31 (Paola Virga)  
 
 
: Yamada, Knight ,
 
: Yamada, Knight ,
  [http://acl.ldc.upenn.edu/P/P02/P02-1039.pdf A decoder for Syntax-based Statistical MT] ,
+
  [http://acl.ldc.upenn.edu/P/P02/P02-1039.pdf A decoder for Syntax-based Statistical MT] ,  
 +
 
ACL '2002
 
ACL '2002
 
  
 
: } ,
 
: } ,
  ==  Spring 2002 == ,{| style="width:800px" border="1"
+
  ==  Spring 2002 == ,  
 +
{| style="width:800px" border="1"
 
!  width="10%"|Date/Time  
 
!  width="10%"|Date/Time  
 
!  width="10%"|Presenter  
 
!  width="10%"|Presenter  
Line 1,313: Line 1,283:
  
 
;Feb. 7 (Paola Virga)  
 
;Feb. 7 (Paola Virga)  
 
 
: Knight, Graehl ,
 
: Knight, Graehl ,
  [http://citeseer.nj.nec.com/knight97machine.html Machine Transliteration] ,
+
  [http://citeseer.nj.nec.com/knight97machine.html Machine Transliteration] ,  
 +
 
Proceedings of the Thirty-Fifth Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
 
Proceedings of the Thirty-Fifth Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
  
 
;Feb. 14 (Charles Schafer )  
 
;Feb. 14 (Charles Schafer )  
 
 
: Yaser, Germann ,
 
: Yaser, Germann ,
  [http://nlp.cs.jhu.edu/~cschafer/trans.ps Translating with Scarce Resources] ,
+
  [http://nlp.cs.jhu.edu/~cschafer/trans.ps Translating with Scarce Resources] ,  
 +
 
American Association for Arti?cial Intelligence 2000
 
American Association for Arti?cial Intelligence 2000
  
 
;Feb. 21 (Jia Cui)  
 
;Feb. 21 (Jia Cui)  
 
 
: Barzilay, McKeown ,
 
: Barzilay, McKeown ,
  [http://citeseer.nj.nec.com/452341.html Extracting Paraphrases from a Parallel Corpus] ,
+
  [http://citeseer.nj.nec.com/452341.html Extracting Paraphrases from a Parallel Corpus] ,  
 +
 
Computer Science Department Columbia.Univ.
 
Computer Science Department Columbia.Univ.
  
 
;Feb. 28 (Silviu Cucerzan)  
 
;Feb. 28 (Silviu Cucerzan)  
 
 
: Marcu ,
 
: Marcu ,
  [http://www.isi.edu/natural-language/projects/rewrite/transmem1.pdf Towards a Unified Approach to Memory- and Statistical-Based Machine Translation.] ,
+
  [http://www.isi.edu/natural-language/projects/rewrite/transmem1.pdf Towards a Unified Approach to Memory- and Statistical-Based Machine Translation.] ,  
 +
 
Annual Meeting of the ACL, Proceedings of the 39th Annual Meeting on Association for Computational Linguistics '2001
 
Annual Meeting of the ACL, Proceedings of the 39th Annual Meeting on Association for Computational Linguistics '2001
  
 
;Mar. 14 (Noah A. Smith)  
 
;Mar. 14 (Noah A. Smith)  
 
 
: Ratnaparkhi ,
 
: Ratnaparkhi ,
  [ftp://ftp.cis.upenn.edu/pub/ircs/tr/97-08.ps.Z A Simple Introduction to Maximum Entropy Models for NLP] ,
+
  [ftp://ftp.cis.upenn.edu/pub/ircs/tr/97-08.ps.Z A Simple Introduction to Maximum Entropy Models for NLP] ,  
 +
 
Institute for Research in Cognitive Science, Univ. of Penn.
 
Institute for Research in Cognitive Science, Univ. of Penn.
  
 
;Mar. 28 (Swapna Somasundaran)  
 
;Mar. 28 (Swapna Somasundaran)  
 
 
: Crestan, El-Beze ,
 
: Crestan, El-Beze ,
  [http://www.hcrc.ed.ac.uk/~sempro/papers/5.pdf Improving supervised WSD by including rough semantic features in a Multilevel view of the Context] ,
+
  [http://www.hcrc.ed.ac.uk/~sempro/papers/5.pdf Improving supervised WSD by including rough semantic features in a Multilevel view of the Context] ,  
 +
 
SEMPRO Workshop, Edinburgh, 2001.
 
SEMPRO Workshop, Edinburgh, 2001.
 
 
 
;Apr. 11 (Paola Virga)  
 
;Apr. 11 (Paola Virga)  
 
 
: Neal, Hinton ,
 
: Neal, Hinton ,
  [http://www.gatsby.ucl.ac.uk/Hinton/chronological.html A view of the EM algorithm that justifies incremental, sparse, and other variants] ,
+
  [http://www.gatsby.ucl.ac.uk/Hinton/chronological.html A view of the EM algorithm that justifies incremental, sparse, and other variants] ,  
 +
 
Learning in Graphical Models, 1999
 
Learning in Graphical Models, 1999
  
 
;Apr. 18 (Paul Ruhlen)  
 
;Apr. 18 (Paul Ruhlen)  
 
 
: NA. Rao, K. Rose ,
 
: NA. Rao, K. Rose ,
  [http://scl.ece.ucsb.edu/html/papers_B.htm Deterministically annealed design of hidden Markov model speech recognizers] ,
+
  [http://scl.ece.ucsb.edu/html/papers_B.htm Deterministically annealed design of hidden Markov model speech recognizers] ,  
 +
 
IEEE Trans. on Speech and Audio Processing, vol. 9, (no. 2), Feb. 2001
 
IEEE Trans. on Speech and Audio Processing, vol. 9, (no. 2), Feb. 2001
 
  
 
: following article builds on the Neal & Hinton paper that we read last week.  It tests an incremental version of EM (carefully choosing how incremental it will be), as well as a "lazy EM" version that visits "significant" cases more often. [http://ipsapp008.lwwonline.com/content/getfile/4984/53/3/fulltext.pdf] ,
 
: following article builds on the Neal & Hinton paper that we read last week.  It tests an incremental version of EM (carefully choosing how incremental it will be), as well as a "lazy EM" version that visits "significant" cases more often. [http://ipsapp008.lwwonline.com/content/getfile/4984/53/3/fulltext.pdf] ,
  |- ,|Apr. 25
+
  |- ,  
 +
|Apr. 25
 
|Paul Ruhlen
 
|Paul Ruhlen
 
 
: H. Al-Adhaileh, Kong, Melamed ,
 
: H. Al-Adhaileh, Kong, Melamed ,
  [http://www.cs.nyu.edu/~melamed/ftp/papers/redecs01.pdf Malay-English Bitext Mapping and Alignment Using SIMR/GSA Algorithms] ,
+
  [http://www.cs.nyu.edu/~melamed/ftp/papers/redecs01.pdf Malay-English Bitext Mapping and Alignment Using SIMR/GSA Algorithms] ,  
 +
 
Malaysian National Conference on Research and Development on Lingustics '2001
 
Malaysian National Conference on Research and Development on Lingustics '2001
 
  
 
: } ,
 
: } ,
  ==  Fall 2001 == ,{| style="width:800px" border="1"
+
  ==  Fall 2001 == ,  
 +
{| style="width:800px" border="1"
 
!  width="10%"|Date/Time  
 
!  width="10%"|Date/Time  
 
!  width="10%"|Presenter  
 
!  width="10%"|Presenter  
Line 1,378: Line 1,348:
  
 
;Dec. 14 (Jia Cui)  
 
;Dec. 14 (Jia Cui)  
 
 
: Bellegarda ,
 
: Bellegarda ,
  [http://ieeexplore.ieee.org/lpdocs/epic03/EarlierIssue.HTM?punumber=5&isyr=2000 Exploiting latent semantic information in statistical language models] ,
+
  [http://ieeexplore.ieee.org/lpdocs/epic03/EarlierIssue.HTM?punumber=5&isyr=2000 Exploiting latent semantic information in statistical language models] ,  
 +
 
Proceedings of the IEEE , Volume: 88 Issue: 8 , Aug. 2000
 
Proceedings of the IEEE , Volume: 88 Issue: 8 , Aug. 2000
  
 
;Nov. 29 (Silviu Cucerzan)  
 
;Nov. 29 (Silviu Cucerzan)  
 
 
: Mike Collins, Yoram Singer ,
 
: Mike Collins, Yoram Singer ,
  [http://www.research.att.com/~mcollins/papers/emnlp99.ps Unsupervised Models for Named Entity Classification] ,
+
  [http://www.research.att.com/~mcollins/papers/emnlp99.ps Unsupervised Models for Named Entity Classification] ,  
 +
 
EMNLP/VLC'99
 
EMNLP/VLC'99
  
 
;Nov. 20 (Radu Florian)  
 
;Nov. 20 (Radu Florian)  
 
 
: Blum, Mitchell ,
 
: Blum, Mitchell ,
  [http://nlp.cs.jhu.edu/~rflorian/cotraining.ps Combining Labeled and Unlabeled Data with Co-Training] ,
+
  [http://nlp.cs.jhu.edu/~rflorian/cotraining.ps Combining Labeled and Unlabeled Data with Co-Training] ,  
 +
 
Proceedings of 1998 Conference on Computational Learning Theory  
 
Proceedings of 1998 Conference on Computational Learning Theory  
  
 
;Nov. 16 (Richard Wicentowski)  
 
;Nov. 16 (Richard Wicentowski)  
 
 
: Eisner, Satta ,
 
: Eisner, Satta ,
  [http://cs.jhu.edu/~jason/papers/#acl99 Efficient parsing for bilexical context-free grammars and head automaton grammars] ,
+
  [http://cs.jhu.edu/~jason/papers/#acl99 Efficient parsing for bilexical context-free grammars and head automaton grammars] ,  
 +
 
ACL '99
 
ACL '99
 
  
 
: plagiarism detection systems might be relevant to bitext alignment.  A message to the Corpora list yesterday announced the following review paper:[http://www.dcs.shef.ac.uk/~cloughie/papers/Plagiarism.pdf] ,
 
: plagiarism detection systems might be relevant to bitext alignment.  A message to the Corpora list yesterday announced the following review paper:[http://www.dcs.shef.ac.uk/~cloughie/papers/Plagiarism.pdf] ,
  |- ,|Nov. 2
+
  |- ,  
 +
|Nov. 2
 
|Paul Ruhlen
 
|Paul Ruhlen
 
 
: Manning, Schuetze ,
 
: Manning, Schuetze ,
  Foundations of Statistical Natural Language Processing, Section 14 on clustering, pp. 495-527. ,
+
  Foundations of Statistical Natural Language Processing, Section 14 on clustering, pp. 495-527. ,  
 +
 
MIT Press
 
MIT Press
  
 
;Oct. 26 (Gideon Mann )  
 
;Oct. 26 (Gideon Mann )  
 
 
: Tishby, Pereira, Bialek ,
 
: Tishby, Pereira, Bialek ,
  [http://www.arxiv.org/find/physics/1/au:+Pereira_F/0/1/0/all/0/1 The information bottleneck method] ,
+
  [http://www.arxiv.org/find/physics/1/au:+Pereira_F/0/1/0/all/0/1 The information bottleneck method] ,  
 
+
 
: The paper describes a clustering method which is a generalization of their earlier work on "Distributional Clustering of English Words" (pereira,tishby and lee '93). ,
 
: The paper describes a clustering method which is a generalization of their earlier work on "Distributional Clustering of English Words" (pereira,tishby and lee '93). ,
 
  |} ,
 
  |} ,

Revision as of 18:38, 10 February 2008

The reading group attempts to keep abreast of current trends in natural language processing research. We typically read one or two recent NLP conference papers each week, and occasionally look at material from the machine learning, statistics, and linguistics communities as well.

Starting in 2008, we will be posting the weekly readings here. Past readings since 2001 are being filled in presently.

Spring 2008

First meeting of the term will be on Thursday, Jan. 31, at noon in NEB 317. Feel free to bring lunch.

Fall 2007

Topics:

  • Domain adaptation
  • Recent parsing work
  • Text compression
  • Semisupervised learning


Date/Time Presenter Paper(s) Supporting Papers/Notes
Sep.26 (Omar F Zaidan)
J. Blitzer, R. McDonald, F. Pereira ,
Domain Adaptation with Structural Correspondence Learning , 

EMNLP 2006

Oct.3 (David Smith)
Shai Ben-David, John Blitzer, Koby Crammer, Fernando Pereira. ,
Analysis of Representations for Domain Adaptation. , 
Oct. 10 (Nathaniel W Filardo)
Mahoney, Matthew ,
Adaptive Weighing of Context Models for Lossless Data Compression. , 

Florida Institue of Technology, CS Department, Technical report CS-2005-16

EMNLP-CoNLL 2007

Oct. 17 (Markus Dreyer)
Nakagawa, Tetsuji ,
Multilingual Dependency Parsing Using Global Features , 

EMNLP-CoNLL 2007

Oct. 26 (Christo Kirov)
Seginer, Yoav ,
Fast Unsupervised Incremental Parsing (syntax induction) , 

Proceedings ACL 2007


Nov. 3 (Christo Kirov)
I. Titov, J. Henderson ,
Constituent Parsing with Incremental Sigmoid Belief Networks , 

ACL 2007

Nov. 17 (David Smith)
X. Zhu ,
Semi-Supervised Learning Literature Survey , 
Dec. 12 (Delip Rao)
M. Belkin, P. Niyogi ,
Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 

ACM 2002


Mikhail Belkin, Partha Niyogi, Vikas Sindhwani

On Manifold Regularization

} ,
==  Summer 2007 == , 

Topics:

  • Good recent papers (mainly from 2007)


Date/Time Presenter Paper(s) Supporting Papers/Notes
May 10 (David Smith )
M. Johnson, T. Griffiths, and S. Goldwater ,
Bayesian Inference for PCFGs via Markov Chain Monte Carlo , 

HLT/NAACL 2007

May 17 (Markus Dreyer)
M. Galley, K. McKeown ,
Lexicalized Markov Grammars for Sentence Compression , 

HLT/NAACL 2007


June 2 (Erin Fitzgerald)
J. Jiang, C. Zhai ,
A Systematic Exploration of the Feature Space for Relation Extraction , 

HLT/NAACL 2007

June 6 (Nikesh Garera)
A. Alexandrescu, K. Kirchhoff ,
Data-Driven Graph Construction for Semi-Supervised Graph-Based Learning in NLP , 

HLT/NAACL 2007

June 14 (David Smith)
X. Zhu, Z. Ghahramani,J. Lafferty ,
Semi-supervised learning using Gaussian fields and harmonic functions. , 

ICML 2003

June 21 (Christopher White)
K. Murphy, Y. Weiss, M. Jordan ,
Propagation for approximate inference: An empirical study. , 

15th UAI, pages 467-?75, 1999

... discussing (loopy) belief propagation as background for survey propagation, a topic which has been getting more attention lately for its ability to "solve very large hard combinatorial problems, such as determining the satisfiability of Boolean formulas. ,
Chapter 8 of Chris Bishop's textbook is supposed to be a good treatment of graphical models overall.  It is available free here [1].  He covers BP in section 8.4.4 after first presenting factor graphs in 8.4.3. , 

David MacKay's treatment of BP, also in terms of factor graphs, is in chapter 26 of his book [2]. It's worth reading this chapter in full, perhaps first reading chapter 16. ... the update equations are given as (26.11) and (26.12) ... [substantial further discussion by jason was here]

Some people may prefer Bishop's style, others MacKay's.

July 6 (Christopher White)
A. Braunstein, M. Mezard, R. Zecchina. ,
Survey propagation: an algorithm for satisfiability. , 

Random Structures and Algorithms, 2005.

We sent some questions to Zecchina. ,
Lukas Kroc, Ashish Sabharwal and Bart Selman. , 

Survey Propagation Revisited: An Empirical Study.

23rd UAI, 2007.

July 18 (David Smith)
P. Liang, S. Petrov, M. Jordan, D. Klein ,
The Infinite PCFG Using Hierarchical Dirichlet Processes. , 

Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning,

Aug. 3 (Yi Su)
M. Galley, K. McKeown ,
Lexicalized Markov Grammars for Sentence Compression. , 

NAACL-HLT 2007

Aug. 11 (Nikesh Garera)
L. Shen, G. Satta, A. Joshi. ,
Guided learning for bidirectional sequence classification , 

ACL 2007

Aug. 18 (Markus Dreyer)
D. Talbot, M. Osborne ,
Randomised Language Modelling for Statistical Machine Translation , 

ACL 2007

They use a space-efficient randomized data structure (Bloom Filter) to store very large n-gram models. ,
There is a companion paper that people might want to have a quick look at as well, for comparison: , 

D. Talbot, M. Osborne

Smoothed Bloom Filter Language Models: Tera-Scale LMs on the Cheap

ACL 2007

Aug. 30 (Delip Rao)
Gideon S. Mann ,
Simple, Robust, Scalable Semi-supervised Learning via Expectation Regularization , 

Proceedings of the 24 th International Conference on Machine Learning 2007


} ,
==  Spring 2007 == , 

Topics:

  • Morphology (unsupervised learning)
  • Recent IR/QA papers (with an NLP or multilingual focus)
  • Integrating search and learning


Date/Time Presenter Paper(s) Supporting Papers/Notes
Apr. 19 (John Blatz)
A. Prieditis ,
Machine discovery of Effective Admissible Heuristics  , 

Machine Learning Journal, 1993

Apr. 12 (Markus Dreyer)
A. Haghighi, J. DeNero and D. Klein ,
Approximate Factoring for A* Search , 

NAACL-HLT 2007

Mar. 29 & Apr. 5 (Zhifei Li)
H. Daume III, J. Langford, and D. Marcu ,
Search-based structured prediction. , 

Machine Learning Journal, forthcoming

Mar. 8 (David Smith)
H. Daume III & D. Marcu ,
Learning as search optimization: approximate large margin methods for structured prediction. , 

ICML 2005

Mar. 1 (Wei Chen)
M. Kaisser, S. Scheible, and B. Webber ,
Experiments at the University of Edinburgh for the TREC 2006 QA track. , 

TREC-15

They do some fairly deep interpretation of sentences, extracting their predicate-argument structure. ,
Feb. 22 Eric Harley
K. Kan Lo & W. Lam ,
Using Semantic Relations with World Knowledge for Question Answering , 

TREC-15

Feb. 15 (Nikhil Bojja)
C. Monson et. al. ,
Unsupervised Induction of Natural Language Morphology Inflection Classes , 

ACL Student Workshop '04

Feb. 8 (Delip Rao)
P. Schone and D. Jurafsky ,
Knowledge-free induction of morphology using latent semantic analysis  , 

CoNLL 2000

However, there was an extension of this work reported in NAACL-2001 that looks at circumfixes and prefix/affix combinations. [3] ,
Feb. 1 Nikesh Garera
D. Yarowsky and R. Wicentowski ,
Minimally supervised morphological analysis by multimodal alignment  , 

ACL 2000

For more details refer to Chapter 4 of Wicentowski's thesis. ,
,

Fall 2006

Topics:

  • Machine learning: Margin methods and structured classification
  • Linguistics: Syntactic formalisms
  • Syntax-based MT


Date/Time Presenter Paper(s) Supporting Papers/Notes
Dec. 13 (Delip Rao)
J. Carbonell et. al. ,
Context-based machine translation , 

AMTA 2006

Dec. 6 (Jason Smith)
M. Galley et. al. ,
Scalable Inference and Training of Context-Rich Syntactic Translation Models , 

ACL 2006

It may also be helpful to look at: ,
M. Galley et. al. , 

What's in a translation rule?

HLT/NAACL 2004


Nov. 29 (Balakrishnan V)
D. Marcu et. al. ,
SPMT: Statistical Machine Translation with Syntactified Target Language Phrases  , 

EMNLP 2006

Nov. 15 (Eric Harley)
D. Chiang ,
An introduction to synchronous grammars , 

ACL 2006 Tutorial

Slides from the talk are also available. [4] ,
Nov. 8 Elliott Drabek
K.Shklovsky ,
A Grammatical Sketch of Petalcingo Tzeltal , 

Undergraduate Thesis, Reed College, 2005

It is 77 pages long, but not dense, and I will be skipping the following sections: ,
Pages , 

01-14 Phonetics and phonology

18-18 Polyvalence

21-21 Inherent possession and ...

46-55 Tense and aspect and other sections

Nov. 1 (Yi Su)
M. Steedman ,
Gapping as Constituent Coordination , 

Linguistics and Philosophy, Vol. 13, 1990, pp.207-264.

See Yi for photocopies. ,
Oct. 25 Markus Dreyer
S. Reizler et. al. ,
Parsing the Wall Street Journal using a Lexical-Functional Grammar and Discriminative Estimation Techniques , 

ACL 2002


Oct. 18 (Erin Fitzgerald)
J. Bresnan & R.M. Kaplan ,
Lexical-Functional Grammar: A Formal System for Grammatical Representation  , 

The Mental Representation of Grammatical Relations, MIT Press, 1982

BTW, the edited collection that this appears in is generally interesting. Bresnan defends and develops lexicalized grammars in general; the idea of separate surface and semantic roles; and Bresnan & Kaplan's LFG in particular. You should know that she originated (in 1978) the extremely influential idea of lexicalized syntax -- the idea that a grammar is simply a collection of lexical entries to be assembled in standard language-independent ways, but that there are also "lexical redundancy rules" that relate, e.g., active and passive entries for the same verb. Some chapters address morphological and cognitive issues pertaining to lexicalization, including an essay by Pinker on lexicalist learning. ,
Slides from Erin's presentation can be found here. , 
Oct. 11 (John Blatz)
L.Xu, D. Wilkinson, F. Southey, & D. Schuurmans ,
Discriminative Unsupervised Learning of Structured Predictors  , 

ICML 2006

Oct. 4 (Nikesh Garera)
A. Culotta & J. Sorensen ,
Dependency Tree Kernels for Relation Extraction  , 

ACL 2004


D. Zelenko, C. Aone, & A. Richardella

Kernel Methods for Relation Extraction

JMLR, Volume 3, 2003

Sept. 27 (David Smith)
C. Cortes, P. Haffner, & M. Mohri ,
Rational Kernels  , 

NIPS 2003

Papers extending rational kernels, including results on positive semidefinite cases, are at:[5] ,
For the record, and not to be read, is an interesting parallel line of research in Fisher Kernels over strings, e.g. this paper by Saunders, Shawe-Taylor and Vinokourov: [6] , 
Sept. 20 (Elliot Drabek)
K.Q. Weinberger, F. Sha, & L.K. Saul ,
Learning a kernel matrix for nonlinear dimensionality reduction  , 

ICML 2004

S.T. Roweis & L.K. Saul ,
Nonlinear Dimensionality Reduction by Locally Linear Embedding  , 

Science, 22 December 2000


J.B. Tenenbaum, V. De Silva, & J.C. Langford

A global geometric framework for nonlinear dimensionality reduction

Science, 22 December 2000

Sept. 13 (Roy Tromble)
L. Xu, J. Neufeld, B. Larson, & D. Schuurmans ,
Maximum Margin Clustering  , 

NIPS 2004

} ,
==  Summer 2006 == , 

Topics:

  • Recent HLT-NAACL papers
Date/Time Presenter Paper(s) Supporting Papers/Notes
Jun. 24 (David Smith)
Percy Liang, Ben Taskar, Dan Klein ,
Alignment by Agreement , 

HLT-NAACL, 2006

Jun. 31 (Markus Dreyer)
Joakim Nivre, Johan Hall et al ,
Labeled Pseudo-Projective Dependency Parsing with Support Vector Machines , 

Procceding of CoNLL, 2006


J. Nivre, J. Nilsson

Pseudo-Projective Dependency Parsing

ACL 2005

Jul. 6 (Keith Hall)
Charles Sutton, Michael Sindelar, Andrew McCallum ,
Reducing Weight Undertraining in Structured Discriminative Learning , 

HLT-NAACL, 2006

Jul. 20 (Roy Tromble)
Mehryar Mohri, Brian Roark ,
Probabilistic Context-Free Grammar Induction Based on Structural Zeros , 

HLT-NAACL, 2006

Aug. 4 (David Smith)
Sharon Goldwater, Thomas L. Griffiths, Mark Johnson ,
Contextual Dependencies in Unsupervised Word Segmentation , 

ACL 2006

Anyone looking for a more straight-up language modeling discussion can compare: ,
Yee Whye Teh , 

A Hierarchical Bayesian Language Model Based On Pitman-Yor Processes

ACL 2006


More resources:

Machine Learning MLPedia page on Dirichlet Processes

Michael Jordan's NIPS 2005 tutorial: Nonparametric Bayesian Methods: Dirichlet Processes, Chinese Restaurant Processes and All That

Y. Teh, M. Jordan, M. Beal, and D. Blei

Hierarchical Dirichlet processes

Journal of the American Statistical Association, 2006


} ,
==  Spring 2006 == , 

Topics:

  • Consensus decoding
  • Miscellous extraction (idioms)
  • Algorithmic speedups/search/dynmaic programming/hard problems
  • Disctance reranking
Date/Time Presenter Paper(s) Supporting Papers/Notes
Feb. 9 (John Blatz)
Dominic Widdows, Beate Dorow ,
Automatic Extraction of Idioms using Graph Analysis and Asymmetric Lexicosyntactic Patterns , 

Proceedings of the ACL-SIGLEX Workshop on Deep Lexical Acquisition, 2005


Afsaneh Fazly, Suzanne Stevenson

Automatic Acquisition of Knowledge about Multiword Predicates

Proceedings of the 19th Pacific Asia Conference on Language, Information, and Computation (PACLIC 2005).

Feb. 16 (Noah A Smith)
Khalil Sima'an ,
Computational Complexity of Probabilistic Disambiguation by means of Tree-Grammars , 

COLING 1996


Francisco Casacuberta, Colin de la Higuera

Computational complexity of problems on probabilistic grammars and

LNAI 1981

For more HMM/Comp, bio view, and extended results view: ,
Rune B. Lyngsoe, Christian N. S. Pederson , 

The Consensus String Problem and the Complexity of Comparing Hidden

Journal of Computer and System Sciences 65, 2002


Feb. 23 (Omar F. Zaidan)
Ravichandran, Pantel, Hovy ,
Randomized Algorithms and NLP: Using Locality Sensitive Hash Function for High Speed Noun Clustering , 

Proceedings of the 43rd Annual Meeting of the ACL, 2005


J. Gorman, J. Curran

Approximate Searching for Distributional Similarity

Proceedings of the ACL-SIGLEX Workshop on Deep Lexical Acquisition, 2005

Mar.3 (Jason Riesa)
Hal Daume III, Daniel Marcu ,
Domain Adaptation for Statistical Classifiers , 

Journal of Artificial Intelligence Research, 2006

Mar.10 (Roy Tromble)
Terry Koo, Michael Collins ,
Hidden-Variable Models for Discriminative Reranking , 

Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, 2005

Mar.17 (Elliott Franco Drabek)
Necip Fazil Ayan, Bonnie J. Dorr, Christof Monz ,
Alignment Link Projection Using Transformation-Based Learning , 

HLT-EMNLP 2005


Mar.31 (Eric Harley)
Ben Taskar, Lacoste-Julien Simon, Klein Dan ,
A Discriminative Matching Approach to Word Alignment , 

ACL 2005

Apr.6 (Eric Harley)
Ben Taskar, Lacoste-Julien Simon, Klein Dan ,
A Discriminative Matching Approach to Word Alignment , 

ACL 2005


Ryan McDonald, Fernando Pereira, Kiril Ribarov, Jan Hajie

Non-projective Dependency Parsing using Spanning Tree Algorithms

Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP), 2005.

Apr. 20 (Balakrishnan V)
Richard M. Karp, Michael 0. Rabin ,
Efficient randomized Pattern matching Algorithms , 

IBM Journal of Research and Development, 1987

May 4 (David Smith)
C. E. R. Alves, E. N. C′aceres F. Dehne ,
Parallel dynamic programming for solving the string editing problem on a CGM/BSP , 

SPAA 2002

May 11 (John Blatz)
M. Gengler ,
An introduction to parallel dynamic programming , 

Lecture Notes in Computer Science, 1996

May 18 (Markus Dreyer)
Jonathan May, Kevin Knight ,
A Better N-Best List: Practical Determinization of Weighted Finite Tree Automata , 

Proc. NAACL-HLT, 2006

} ,
==  Fall 2005 == , 
Date/Time Presenter Paper(s) Supporting Papers/Notes
Sept. 14 (Nikesh Garera)
M. Jordan ,
Statistical Learning Theory Chapter 8 (Exponential family and Generalized linear models) , 
Sept. 21 (Arnab Ghoshal)
M. Jordan ,
Statistical Learning Theory Chapter 2&3 , 
Oct. 20 (Roy Tromble)
Sheila M. Reynolds, Jeff A. Bilmes ,
Part-of-Speech Tagging using Virtual Evidence and Negative Training. , 

Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing. 2005. pp 459--466.

Oct. 27 (Markus Dreyer)
D. Roth and W. Yih ,
Integer Linear Programming Inference for Conditional Random Fields. , 

ICML '2005


Nov. 4 (Jason Riesa)
Luke S. Zettlemoyer, Michael Collins. ,
Learning to Map Sentences to Logical Form: Structured Classification with Probabilistic Categorial , 

Proceedings of UAI 2005

Nov. 16 (Safiullah Shareef)
Hassan Sawaf, J?rg Zaplo, Hermann Ney ,
Statistical Classification Methods for Arabic News Articles , 
Nov. 23 (Roy Tromble)
Sutton, Charles and McCallum, Andrew ,
Composition of Conditional Random Fields for Transfer Learning , 

Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing 2005

} ,
==  Summer 2005 == , 

Topics:

  • Recent papers on ACL / CoNLL / parallel-text workshop
  • Optimality Theory
  • Unsupervised/semisupervised/EM
  • AI
  • Graphical Models
  • Dependency Parsing
  • Kernels
  • Algorithms
  • Syntactic MT
  • MT Techniques
  • Non-dependency parsing
Date/Time Presenter Paper(s) Supporting Papers/Notes
July 14 (Roy Tromble)
Goldwater and Johnson ,
[http://www.cog.brown.edu:16080/~sgwater/papers/OTvar03.pdf Learning OT Constraint Rankings Using a Maximum  , 

Entropy Model]

In Proceedings of the Workshop on Variation within Optimality Theory, 2003


July 21 (Keith and Damianos)
Sharon Goldwater, Mark Johnson ,
[http://www.aclweb.org/anthology/W/W05/W05-0615 Representational Bias in Unsupervised Learning of Syllable  , 

Structure]

ACL 2005


Ando, Rie and Zhang, Tong

[http://www.aclweb.org/anthology/P/P05/P05-1001 A High-Performance Semi-Supervised Learning Method for Text

Chunking]

ACL 2005


July 28 (Zak)
Takuya Matsuzaki, Yusuke Miyao, Jun'ichi Tsujii ,
Probabilistic CFG with Latent Annotations , 

ACL 2005

Aug 5 (Adam)
Duh, Kevin and Kirchhoff, Katrin ,
[http://www.aclweb.org/anthology/W/W05/W05-0708 Tagging of Dialectal Arabic: A Minimally Supervised  , 

Approach]

ACL 2005


Aug 19 (John Blatz)
Niyogi, Sourabh ,
Steps Toward Deep Lexical Acquisition , 

ACL 2005

Aug 26 (Roy Tromble)
Jenny Rose Finkel, Trond Grenager, Christopher Manning ,
[http://www.aclweb.org/anthology/W/W05/W05-0511  Incorporating Non-local Information into Information  , 

Extraction Systems by Gibbs Sampling]

ACL 2005

Sep.1 (Markus Nikesh, John Blatz )
B. Walsh ,
[http://nitro.biosci.arizona.edu/courses/EEB581-2004/handouts/Gibbs.pdf  Markov Chain Monte Carlo , 

and Gibbs Sampling]

Lecture Notes for EEB 581, version 26 April 2004


} ,
==  Spring 2005 == , 

Topics:

  • Bayesian Nets / inference (tutorials in Michael Jordan's book)
  • Dependency Networks
  • Using the web as a corpus & extracting corpora from the web
Date/Time Presenter Paper(s) Supporting Papers/Notes
Feb. 25 (David Smith)
M. I. Jordan, Z. Ghahramani, T. S. Jaakkola, and L. K. Saul ,
Learning in Graphical Models , 

MIT Press, 1999

Mar. 4 (David Smith)
M. I. Jordan, Z. Ghahramani, T. S. Jaakkola, and L. K. Saul ,
Learning in Graphical Models , 

MIT Press, 1999

Mar. 11 (David Smith)
M. I. Jordan, Z. Ghahramani, T. S. Jaakkola, and L. K. Saul ,
Learning in Graphical Models , 

MIT Press, 1999

Apr. 2 (David Smith)
M. I. Jordan, Z. Ghahramani, T. S. Jaakkola, and L. K. Saul ,
Learning in Graphical Models , 

MIT Press, 1999

Apr. 9 (Noah A Smith)
G. Elidan and N. Friedman. ,
The Information Bottleneck EM Algorithm , 

UAI 2003


G. Elidan, Nir Friedman

Learning Hidden Variable Networks

JMLR 2005

Apr. 16 (Noah A Smith)
V. Lavrenko, S.L Feng, R. Manmatha ,
Statistical models for automatic video annotation and retrieval , 

Acoustics, Speech, and Signal Processing, 2004. Proceedings.

Apr. 21 (Omar F. Zaidan)
Tin Kam Ho, Jonathan J. Hull, Sargur N. Stihari ,
Decision Combination in Multiple Classifier Systems , 

IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.16. No I. Jan. 1994


Dan Klein, Kristina Toutanova, H. Tolga Ilhan, Sepandar D. Kamvar and Christopher D. Manning

Combining Heterogeneous Classifiers forWord-Sense Disambiguation

ACL 2002

Apr. 28 (Damianos Karakos)
Alessandro Moschitti and Roberto Basili ,
Complex Linguistic Features for Text Classification: a comprehensive study , 

In proceedings of the 26th European Conference on Information Retrieval Research (ECIR 2004)

May 7 (Markus Dreyer)
M. Diligenti, F.M. Coetzee, S. Lawrence, C.L. Giles, M. Gori ,
Focused Crawling Using Context Graphs , 

26th International Conference on Very Large Databases, VLDB 2000


Adam Kilgarriff and Gregory Grefenstette

Introduction to the Special Issue on the Web as Corpus

Computational Lingustics, 2003


} ,
==  Fall 2004 == , 

Topics:

  • Recent papers from ACL/EMNLP 2004
  • Graph methods
  • Unification parsing
  • Parsing strategies
  • Syntax for MT or vice-versa
  • TAG-based noisy channel model of speech repairs
  • Collective information extraction with relational Markov networks
Date/Time Presenter Paper(s) Supporting Papers/Notes
Aug. 20 (Damianos Karakos, Charles Schafer)
P. Pantel and D. Lin ,
Discovering word senses from text , 

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, 2002


Diana McCarthy, Rob Koeling, Julie Weeds, John Carroll

[ftp://ftp.informatics.susx.ac.uk/pub/users/dianam/senseranks.pdf Finding Predominant Word Senses in

Untagged Text]

2004

Aug. 27 (David Smith)
I. Dan Melamed ,
Statistical Machine Translation by Parsing , 

ACL 2004


Daniel Gildea

[http://acl.ldc.upenn.edu/acl2004/emnlp/pdf/Gildea.pdf Dependencies vs. Constituents for Tree-Based

Alignment]

ACL 2004

Sep. 2 (Gideon Mann)
Xin Li, Paul Morie, and Dan Roth ,
[http://acl.ldc.upenn.edu/hlt-naacl2004/main/pdf/139_Paper.pdf Robust Reading: Identification and Tracing  , 

of Ambiguous Names]

ACL 2004


Cheng Niu, Wei Li, Rohini K. Srihari

[http://acl.ldc.upenn.edu/acl2004/main/pdf/372_pdf_2-col.pdf Weakly Supervised Learning for Cross-Document

Person-Name Disambiguation Supported by Information Extraction]

ACL 2004

Sep. 9 (John Blatz)
Pascale Fung and Percy Cheung ,
[http://acl.ldc.upenn.edu/acl2004/emnlp/pdf/Fung.pdf Mining Very-Non-Parallel Corpora: Parallel Sentence  , 

and Lexicon Extraction via Bootstrapping and EM]

ACL 2004


Dragos Stefan Munteanu, Alexander Fraser and Daniel Marcu

[http://acl.ldc.upenn.edu/hlt-naacl2004/main/pdf/93_Paper.pdf Improved Machine Translation Performance via

Parallel Sentence Extraction from Comparable Corpora]

ACL 2004

Sep. 24 (Roy Tromble)
B. Taskar, C. Guestrin and D. Koller ,
Max-Margin Markov Networks , 

Neural Information Processing Systems Conference (NIPS03), 2003


B. Taskar, D. Klein, M. Collins, D. Koller and C. Manning

Max-Margin Parsing

EMNLP 2004



Oct. 2
Nguyen Bach ,
Background knowledge on SVM and Graphical Models ,

Intro SVM

Intro Graphical Models


Oct. 15 (Nguyen Bach)
Daichi Mochihashi, Genichiro Kikui, Kenji Kita ,
[http://acl.ldc.upenn.edu/acl2004/emnlp/pdf/Mochihashi.pdf Learning Nonstructural Distance Metric by  , 

Minimum Cluster Distortions]

EMNLP 2004

Oct. 22 (Michelle Vanni)
Lin and Och ,
[http://acl.ldc.upenn.edu/acl2004/main/pdf/215_pdf_2-col.pdf Automatic Evaluation of Machine Translation  , 

Quality Using Longest Common Subsequence]

ACL 2004


Babych and Hartley

[http://acl.ldc.upenn.edu/acl2004/main/pdf/349_pdf_2-col.pdf Extending the BLEU MT Evaluation Method with

Frequency Weightings]

ACL 2004

Oct. 29 (Eric Goldlust)
Clark and Curran ,
[http://web.comlab.ox.ac.uk/oucl/work/stephen.clark/papers/acl04.pdf Parsing the WSJ using CCG and Log- , 

Linear Models]

ACL 2004

Nov. 5 (Michelle Vanni)
Robert S. Swier and Suzanne Stevenson ,
Unsupervised Semantic Role Labelling , 

EMNLP 2004


Nianwen Xue, Martha Palmer

Calibrating Features for Semantic Role Labelling

EMNLP 2004

Nov. 13 (Michelle Vanni)
Robert S. Swier and Suzanne Stevenson ,
[nlp.cs.jhu.edu/~cschafer/david/Ch2.pdf Inexact Graph Matching Using Estimation of Distribution  , 

Algorithms,Chapter 2, The graph matching problem]

Submitted to the Ecole Nationale Supérieure des Télécommunications (Paris), for the Degree of Doctor of

Philosophy. 2002


Yakov Keselman, Ali Shokoufandeh, M. Fatih Demirci, Sven Dickinson

[nlp.cs.jhu.edu/~cschafer/david/many-to-many-graph.pdf Many-to-Many Graph Matching via Metric Embedding]

Computer Vision and Pattern Recognition, 2003. Proceedings. 2003 IEEE

...This chapter is general to the field although pretty sweeping and unspecific as a result. It probably makes a ,
good introduction, since it gives an idea of the scope and diversity of the problem and proposed techniques... , 

...this is a state of the art paper which is quite dense but quite interesting. solves a very general formulation

of inexact graph matching by first imbedding graphs into a normed space...


Nov. 20 (David Smith)
Olle H鋑gstr鰉 and Karin Nelander ,
On Exact Simulation of Markov Random Fields Using Coupling from the Past , 

Foundation of the Scandinavian Journal of Statistics, 1999


James Fill and Mark Huber

The Randomness Recycler: A New Technique for erfect Sampling.

IEEE Symposium on Foundations of Computer Science, 2000


Nov. 27 (Jia Cui)
David M. Blei, Andrew Y. Ng, Michael I. Jordan ,
Latent Dirichlet Allocation , 

Journal of machine Learning Research 3, 2003

A additional related report on LDA ,
[www.cs.toronto.edu/~ywteh/research/npbayes/report.pdf] , 

Another introduction to LDA

[7]


} ,
==  Spring 2004 == , 

Topics:

  • combinatorial optimization (software)
  • optimality theory
  • information extraction
Date/Time Presenter Paper(s) Supporting Papers/Notes
Feb. 5 (Brock)
Jessica A. Barlow and Judith A. Gierut ,
Optimality theory in phonological acquisition , 

Journal of Speech, Language and Hearing 42, 1999

----

Paul Boersma, Joost Dekkers and Jeroen van de Weijer

Introduction. In Optimality Theory: Phonology, Syntax and Acquisition

Oxford University Press 2000

Feb. 12 (Brock)
Bob Frank, Giorgio Satta ,
Optimality theory and the Generative Complexity of Constraint Violability , 

MIT Press

A glimpse (from MIT Press): ,
It has been argued that rule-based phonological descriptions can uniformly be expressed as mappings carried out by finite-state transducers, and therefore fall within the class of rational relations. If this property of generative capacity is an empirically correct characterization of phonological mappings, it should hold of any sufficiently restrictive theory of phonology, whether it utilizes constraints or rewrite rules. In this paper, we investigate the conditions under which the phonological descriptions that are possible within the view of constraint interaction embodied in Optimality Theory (Prince and Smolensky 1993) remain within the class of rational relations. We show that this is true when GEN is itself a rational relation, and each of the constraints distinguishes among finitely many regular sets of candidates. , 
Feb. 19 (David Smith)
Barzilay and Lee ,
Learning to Paraphrase: An Unsupervise Approach Using Multiple-Sequen7:12 PM 2/4/2008ce Alignment , 

HTL 2003

Mar. 5 (Charles Schafer)
Daniel Marcu ,
Theory and Practice of Discourse Parsing and Summarization, Chapters 2 & 3 , 

The MIT Press, 2000

Mar. 18 (Markus Dreyer)
Eugene Charniak, Niyu Ge, John Hale

A Statistical Approach to Anaphora Resolution

Proceedings of the Sixth Workshop on Very Large Corpora, 1998

Mar. 25 (Eric Goldlust)
Boyan and Moore ,
Learning Evaluation Functions to Improve Optimization by Local Search , 

Journal of Machine Learning Research, 2000

Apr. 3 (Roy Tromble)
Roman Bartak ,
Constraint Programming: In Pursuit of the Holy Grail , 

1999

Apr. 10 (Noah Ashton Smith)
Denys Duchier ,
Axiomatizing Dependency Parsing Using Set Constraints , 

Sixth Meeting on Mathematics of Language, 2000

Apr. 10 (Noah Ashton Smith)
Denys Duchier ,
Axiomatizing Dependency Parsing Using Set Constraints , 

Sixth Meeting on Mathematics of Language, 2000

Apr. 17 (Elliott Franco Drabek)
Rina Dechter ,
Mini-Buckets: A General Scheme for Generating Approximations in Automated Reasoning , 

2001

Apr. 24 (David Smith)
McCallum and Jensen ,
Extraction and Data Mining using Conditional-Probability, Relational Models , 

IJCAI'03 Workshop on Learning Statistical Models from Relational Data, 2003

The paper is a survey of recent trends in IE and data mining (biased of course towards the authors' work) and a proposal to unify them with conditional random fields. ,
May. 1 Izhak Shafran
Eric J. Friedman ,
Strong Monotonicity in Surplus Sharing , 

1999

Used Tom Dietterich has a web page on probabilistic relational models: ,
[8] , 
May. 15 (Roy Tromble)
Fuchun Peng, Andrew McCallum ,
Accurate Information Extraction from Research Papers using Conditional Random Fields , 

2004

} ,
==  Fall 2003 == , 
Date/Time Presenter Paper(s) Supporting Papers/Notes
Sep.11 (Elliott Franco Drabek)
Bernard Comrie ,
Language Universals Linguistic Typology: Syntax and Morphology Language Universals Linguistic Typology:  , 

Syntax and Morphology, Chapter 1

Blackwell Pub (1989)

Sep.18 (David Smith)
Bernard Comrie ,
Language Universals Linguistic Typology: Syntax and Morphology Language Universals Linguistic Typology:  , 

Syntax and Morphology, Chapter 2-3

Blackwell Pub (1989)

Oct. 3 (Michelle Vanni)
Bernard Comrie ,
Language Universals Linguistic Typology: Syntax and Morphology Language Universals Linguistic Typology:  , 

Syntax and Morphology, Chapter 4-6

Blackwell Pub (1989)

Oct. 10 (David Smith)
Bernard Comrie ,
Language Universals Linguistic Typology: Syntax and Morphology Language Universals Linguistic Typology:  , 

Syntax and Morphology, Chapter 6-7

Blackwell Pub (1989)

Oct. 24 (Markus Dreyer)
Stuart M. Shieber, Yves Schabes ,
Synchronous Tree-Adjoining Grammars , 

Coling 1990

An additional closely related paper ,
Stuart M. Shieber, Yves Schabes , 

Generation and Synchronous Tree-Adjoining Grammars

Fifth International Workshop on Natural Language Generation.

Oct. 31 (Roy Tromble)
Dekai Wu ,
An algorithm for simultaneously bracketing parallel texts by aligning words , 

ACL 1995

Nov. 6 (Brock Pytlik)
Stuart M. Shieber ,
Transducers as a Substrate for Natural Language Processing , 
Nov. 13 (Markus Dreyer)
Goldman and Zhou ,
Enhancing Supervised Learning with Unlabeled Data , 

27th Int. Conf. on Mach. Learn. 2000

An additional paper with some experiments ,
Clark, Curran and Osborne , 

Bootstrapping POS taggers using Unlabelled Data

CoNLL 2003

Nov. 20 (Noah A. Smith)
Rebecca Hwa, Miles Osborne, Anoop Sarkar, Mark Steedman ,
Corrected Co-training for Statistical Parsers , 

ICML 2003

Dec. 12 (Paola Virga)
Kamal Nigam and Rayid Ghani ,
Analyzing the Effectiveness and Applicability of Co-training , 

Ninth International Conference on Information and Knowledge Management 2000

} ,
==  Spring 2003 == , 
Date/Time Presenter Paper(s) Supporting Papers/Notes
Feb. 13 (David Smith)
K. Church ,
Empirical Estimates of Adaptation: The chance of Two Noriega's is closer to p/2 than p^2 , 

Coling 2000, pp. 173-179


Feb. 19 (Elliott Drabek)
A. Lopez??, M. Nossal??, R. Hwa, P. Resnik ,
Word-level Alignment for Multilingual Resource Acquisition , 

Proceedings of the 2002 LREC Workshop on Linguistic Knowledge Acquisition and Representation: Bootstrapping Annotated Language Data


Feb. 26 (Elliott Drabek)
Steven Abney ,
Bootstrapping , 

ACL'02

Mar.6 (Paola Virga)
Carl M. Kadie, Christopher Meek, David Heckerman ,
A Collaborative Filtering System Using Posteriors Over Weights of Evidence , 

Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence, 2002.


Mar.20 (Roy Tromble)
Nikita Schmid, Ahmed Patel ,
[ttp://arXiv.org/abs/cs/0201008 Using Tree Automata and Regular Expressions to Manipulate Hierarchically Structured Data] , 
Apr.10
|V. N. Vapnik ,
The Nature of Statistical Learning Theory, Intro and Chapters 1, 2A , 
Apr.17 (Roy Tromble)
V. N. Vapnik ,
The Nature of Statistical Learning Theory,Chapters 2B - 4A , 
Apr. 24 (Paola)
V. N. Vapnik ,
The Nature of Statistical Learning Theory, Chapters 4B - 5A , 
May 1 (Noah)
V. N. Vapnik ,
The Nature of Statistical Learning Theory, Chapters 5B - 6A , 
May 8 (Noah)
V. N. Vapnik ,
The Nature of Statistical Learning Theory, Chapters 6B - 7A , 
May 15 (Chal)
V. N. Vapnik ,
The Nature of Statistical Learning Theory, Chapters 7B - , 
} ,
==  Fall 2002 == , 
Date/Time Presenter Paper(s) Supporting Papers/Notes
Sep. 10 (Noah A. Smith)
Collins, Duffy. ,
New Ranking Algorithms for Parsing and Tagging: Kernels over Discrete Structures, and the Voted Perceptron. , 

ACL '2002

Sep. 19 (Paola Virga)
Yamada, Knight ,
A decoder for Syntax-based Statistical MT , 

ACL '2002

Sep. 26 (Paul Ruhlen)
Hwa, Resnik, Weinberg, Kolak ,
Evaluating Translational Correspondence using Annotation Projection , 

ACL '2002

Oct. 2 (Gideon Mann)
Gildea, Jurafsky ,
Automatic Labeling of Semantics Roles , 

ACL '2001

Oct. 8 (Elliott Franco Drabek)
Ravichandran, Hovy ,
Learning Surface Text Patterns for a Question Answering System. , 

ACL '2001

A similar paper ,
Lin, Pantel , 

Discovery of Inference Rules for Question Answwering

Oct. 17 (David Smith)
Cotton, Bird ,
An Integrated Framework for Treebanks and Multilayer Annotations , 

LREC '2002

Oct. 24 (Roy Tromble)
Han, Benjamin ,
Building a Bilingual Dictionary with Scarce Resources: A Genetic Algorithm Approach. , 
Nov. 1 (Chalaporn Hathaidharm)
J.Gao, J.Goodman, M.Li, K.Lee ,
Toward A Unified Approach To Statistical Language Modeling For Chinese , 

ACM Transactions on Asian Language Information Processing, Vol. 1, No. 1, pp 3-33. 2002.

Nov. 7 (Neda Khalili)
Yamamoto, Church ,
Using Suffix Arrays to Compute Term Frequency and Document Frequency for All Substrings in a Corpus , 

Computational Linguistics '2001

A relative paper: ,
Kageura , 

Bigram Statistics Revisited A Comparative Examination of Some Statistical Measures in Morphological Analysis of Japanese Kanji Sequences

Nov. 14 (Michelle Vanni)
Hearst ,
Untangling Text Data Mining. , 

ACL '1999

Nov. 21 (Silviu Cucerzan)
Ueda, Nakano, Ghahramani, Hinton ,
SMEM Algorithm for Mixture Models , 

Neural Information Processing Systems '1998

Dec.5 (Silviu Cucerzan)
Pearce ,
A Comparative Evaluation of Collocation Extraction Techniques. Darren Pearce. , 

Third International Conference on Language Resources and Evaluation. May. 2002


D. Lin

Automatic identification of non-compositional phrases.

In Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, 317--324.


} ,
==  Summer 2002 == , 
Date/Time Presenter Paper(s) Supporting Papers/Notes
July. 24 (Michelle Vanni)
Merlo ,
A Multilingual Paradigm for Automatic Verb Classification , 

ACL '2002

July. 31 (Paola Virga)
Yamada, Knight ,
A decoder for Syntax-based Statistical MT , 

ACL '2002

} ,
==  Spring 2002 == , 
Date/Time Presenter Paper(s) Supporting Papers/Notes
Feb. 7 (Paola Virga)
Knight, Graehl ,
Machine Transliteration , 

Proceedings of the Thirty-Fifth Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics

Feb. 14 (Charles Schafer )
Yaser, Germann ,
Translating with Scarce Resources , 

American Association for Arti?cial Intelligence 2000

Feb. 21 (Jia Cui)
Barzilay, McKeown ,
Extracting Paraphrases from a Parallel Corpus , 

Computer Science Department Columbia.Univ.

Feb. 28 (Silviu Cucerzan)
Marcu ,
Towards a Unified Approach to Memory- and Statistical-Based Machine Translation. , 

Annual Meeting of the ACL, Proceedings of the 39th Annual Meeting on Association for Computational Linguistics '2001

Mar. 14 (Noah A. Smith)
Ratnaparkhi ,
A Simple Introduction to Maximum Entropy Models for NLP , 

Institute for Research in Cognitive Science, Univ. of Penn.

Mar. 28 (Swapna Somasundaran)
Crestan, El-Beze ,
Improving supervised WSD by including rough semantic features in a Multilevel view of the Context , 

SEMPRO Workshop, Edinburgh, 2001.

Apr. 11 (Paola Virga)
Neal, Hinton ,
A view of the EM algorithm that justifies incremental, sparse, and other variants , 

Learning in Graphical Models, 1999

Apr. 18 (Paul Ruhlen)
NA. Rao, K. Rose ,
Deterministically annealed design of hidden Markov model speech recognizers , 

IEEE Trans. on Speech and Audio Processing, vol. 9, (no. 2), Feb. 2001

following article builds on the Neal & Hinton paper that we read last week. It tests an incremental version of EM (carefully choosing how incremental it will be), as well as a "lazy EM" version that visits "significant" cases more often. [9] ,
Apr. 25 Paul Ruhlen
H. Al-Adhaileh, Kong, Melamed ,
Malay-English Bitext Mapping and Alignment Using SIMR/GSA Algorithms , 

Malaysian National Conference on Research and Development on Lingustics '2001

} ,
==  Fall 2001 == , 
Date/Time Presenter Paper(s) Supporting Papers/Notes
Dec. 14 (Jia Cui)
Bellegarda ,
Exploiting latent semantic information in statistical language models , 

Proceedings of the IEEE , Volume: 88 Issue: 8 , Aug. 2000

Nov. 29 (Silviu Cucerzan)
Mike Collins, Yoram Singer ,
Unsupervised Models for Named Entity Classification , 

EMNLP/VLC'99

Nov. 20 (Radu Florian)
Blum, Mitchell ,
Combining Labeled and Unlabeled Data with Co-Training , 

Proceedings of 1998 Conference on Computational Learning Theory

Nov. 16 (Richard Wicentowski)
Eisner, Satta ,
Efficient parsing for bilexical context-free grammars and head automaton grammars , 

ACL '99

plagiarism detection systems might be relevant to bitext alignment. A message to the Corpora list yesterday announced the following review paper:[10] ,
Nov. 2 Paul Ruhlen
Manning, Schuetze ,
Foundations of Statistical Natural Language Processing, Section 14 on clustering, pp. 495-527. , 

MIT Press

Oct. 26 (Gideon Mann )
Tishby, Pereira, Bialek ,
The information bottleneck method , 
The paper describes a clustering method which is a generalization of their earlier work on "Distributional Clustering of English Words" (pereira,tishby and lee '93). ,
,