LING 5200 Computational Corpus Linguistics

Fall, 2008

Co-instructors

Course Structure

Date Topic Readings Due
August 26 Introduction and overview
   
August 28 Intro to Unix: I Ray and Ray 11-18, 21-24, 30-31, 34-36, 38, 115-116, 125  
Sept. 2 Intro to Unix: II Ray and Ray 18-20, 97-101, 106-108, 153-160, 117-120  
Sept. 4 Intro to Unix: III McEnery and Wilson, Ch. 1 HW 1 due
Sept. 9, 11 CLASS LOCATION: Old Main Chapel Word sense grouping
The Proposition Bank
 
Sept. 16 Regular expressions Ray and Ray 117-120  
Sept. 18 More regular expressions   HW2 (ignore question 1)
Sept. 23,25 Python: Slides 1 and 2 Learning Python (LP), ch 4, 8  
  Python LP ch 5, 6, 7  
Sept. 30, Oct. 2 Python: Slide 3 LP 9, 10  
Oct. 7 Python: Slide 4 NLTK ch 1, 2, 3  
Oct. 9 Tokenization: 5   HW 3 due
Oct. 14 Lemmatization/Normalization    
Oct. 16 Control of flow: 6    
Oct. 21 POS tagging: 7    
Oct. 23 POS tagging: 8   HW4 due
Oct. 28 Functions: 9    
Oct. 30 Classes: 10   Final project proposal due.
Nov 4 Treebanks: 11    
Nov. 6 Using treebanks: 11    
Nov. 11 Building a treebank Guest lecturer: Arrick Lanfranchi  
Nov. 13 Tgrep    
Nov. 18 Dominance    
Nov. 20 xkwic   HW5 due
Nov. 25, 27 Fall break and Thanskgiving
   
Dec. 2 xkwic practicum    
Dec. 4 Corpus Search: Lexicography   Interim project reports; xkwic homework due
Dec. 9, 11 Forensic linguistics    
Dec. 13 Project due at noon