itLING 5200 Corpus Linguistics

LING 5200 Computational Corpus Linguistics

Spring, 2006

Instructor

Course Structure

Date Topic Readings Due
Jan. 17, 19 Overview and Unix Intro
Unix Basics
John Searle, Chomsky's Revolution in Linguistics, Sec I; Unix, Chaps 2,3 (skip 43-48), 4 to p. 69  
Lab: getting started with Unix  
Jan 24,26 More on Unix and Corpus Annotation In the Beginning was the Command Line, Neal Stephenson HW1 due Jan 24
  Lab: more Unix    
Jan 31, Feb 2 Corpus Search, Unix and babel Chaps 5,6,7,8 (ignore zsh/bash/ksh), Tutorial: Unix HW2 due Feb 2
Lab: grepping Regular Expressions  
Feb 7,9 More Regular expressions, Emacs & Xkwic pg 158-163 (Unix), hour 12 HW3 due Feb 14
2/7 history and Vicky's notes Friendlier Regular Expressions  
Feb 14,16 Linguistic Annotation, English and Chinese: Segmentation and POS tagging | Treebanking (1) Guest Lecturer: Nianwen Xue  
Feb 21, 23 Linguistic Annotation, English and Chinese: Treebanking (2) | English PropBank | Chinese Propbank Guest Lecturer: Nianwen Xue HW4 due Feb 28 Tree 1 Tree 2
Feb 28, Mar 2 Corpus Search: xkwic and tgrep Tgrep HW5 due March 9
  Lab: Introduction to xkwic and tgrep Corpus Workbench User Manual  
Mar 7, 9 More xkwic and tgrep   HW6 due March 16
     
Mar 14, 16 Introduction to Python Python Chaps 4,5
First Python Lab    
Mar 21, 23 Lists and Functions (scripts from the 03-21-2006 lecture) Python, Chap 6 and 12 HW7 Due March 23
Detailed project proposal
  Names and Lists Lab    
March 28, 30 SPRING BREAK!    
Apr 4, 6 Loops and Files Python Chaps 10,7 HW 8 Due April 11
  For Loops and Files Lab    
Apr 11, 13 Assignments, conditionals Python Chaps 8, 9 HW 9 Due April 18
  Conditionals Lab    
Apr 18, 20 Dictionaries and Classes Python Chap 7 and Chap 20 HW10, Due April 25
  Dictionaries Lab Python Code with Class def WordCount code  
Apr 25, 27 NLTK-Lite, More Tgrep Arabic keyboard input, Python version  
May 2, 4 Python labs    
May 10 Project presentations 3-6pm;