Dalhousie University    [  http://web.cs.dal.ca/~vlado/csci6509/coursecalendar.html  ]
Fall 2021 (Sep7-Dec7)
Faculty of Computer Science
Dalhousie University

CSCI 4152/6509 — Course Calendar (tentative)

[ Home | Calendar | Project | P0 | Misc | A0 | A1 | A2 | Login ]
  Part I: Introduction
1 Tu Sep  7Course Introduction
(On-line) Course introduction: logistics, administrivia, references, evaluation, policies, schedule, A0; Introduction to NLP (reading Ch.1 [JM]): natural language and other languages, NLP applications, NLP as a research area, NLP Research Links and NLP Anthology http://aclweb.org/anthology/. Short history of NLP.
Files: Syllabus (PDF), slides, lecture notes. Reading: [JM] Ch.1
A0 out
2 Th Sep  9 Levels of NLP; Course Project
(On-line) NLP methodology overview; Levels of NLP; Why is NLP generally hard; Ambiguities at different levels of NLP. About Course Project: topics and teams, deliverables, P0, P1, P.
Files: slides, lecture notes.
  Part II: Stream-based Text Processing
L1 Tu Sep 14 Lab 1: FCS Computing Environment, Perl Tutorial 1
(On-line) Logging in using CSID, timberlea environment; Introduction to Perl programming language.
Files: lab notes, slides.
3 Tu Sep 14 Finite Automata and Regular Expressions
(In-person) About course project (continued): R (report), project types, choosing topic, resources, themse and previous topics. Part II: Stream-based Text Processing: Deterministic and Non-deterministic Automata. (Reading: Chapter 2 [JM]) Review of Deterministic Finite Automata (DFA) and Non-deterministic Finite Automata (NFA), and their use in NLP; NFA-to-DFA conversion. Review of regular expressions.
Files: slides, lecture notes. Reading: [JM] Ch.2
4 Th Sep 16 Text Processing in Perl
(In-person) Regular expressions review continued: some regex references, history, examples; Introduction to Perl, main Perl features, program examples, syntactic elements, I/O, regular expressions in Perl.
Files: slides, lecture notes.
  Fr Sep 17Last day to add/drop courses A0 due
  Sa Sep 18A1 out A1 out
L2 Tu Sep 21 Lab 2: Perl Tutorial 2
(On-line) Regular expressions and character n-grams in Perl.
Files: lab notes, slides.
5 Tu Sep 21 Elements of Morphology
(In-person) More on Perl regular expressions; Text processing examples: tokenization, countil letters. Elements of Morphology: reading: Section 3.1 [JM]; morphemes, stems, affixes, tokenization, stemming, lemmatization; morphological processes. Characters, Words, and N-grams: counting words, Zipf's law, n-grams.
Files: slides, lecture notes.
6 Th Sep 23 Elements of Information Retrieval
(In-person) Perl examples with n-gram collection. Elements of information retrieval: typical IR system architecture, vector space model. Reading: [JM] 23.1 (Information Retrieval), [MS] Ch.15 (Topics in Information Retrieval). Some interesting links: Lucene, IR book by Manning, Raghavan, and Schutze. IR Evaluation: precision, recall, F-measure, precision-recall curve.
Files: slides, lecture notes. Reading: [JM] 23.1 (Information Retrieval), [MS] Ch.15 (Topics in Information Retrieval)
L3 Tu Sep 28 Lab 3: Perl Tutorial 3
(On-line) Perl: Arrays or lists; associative arrays or hashes; references.
Files: lab notes, slides.
7 Tu Sep 28 Text Classification
(On-line) Interpolated Precision-Recall curve. Text mining. Text Classification: classifier evaluation, evaluation measures for text classification, evaluation methods for text classification; Text clustering; Similarity-based text classification: CNG classification method for authorship attribution.
Files: slides, lecture notes.
A1 due
  Th Sep 30National Day for Truth and Reconciliation, University closed  
  Fr Oct  1Last day to drop classes without "W", change audit to credit or vv. P0 due
L4 Tu Oct  5 Lab 4: Git and GitLab Tutorial
(On-line) Introduction to GitLab and Git; adding and modifying files, setting up SSH key, add, commit, and push commands, checkout; creating branches and working collaboratively, pull, merge, rebase, resolving conflicts.
Files: lab notes, slides.
  Part III: Probabilistic Approach to NLP
8 Tu Oct  5 Edit Distance; Probabilistic Modeling
(In-person) Edit distance: introduction, properties, dynamic programming approach, example, algorithm. Probabilistic approach to NLP: logical vs. plausible reasoning in AI and NLP; Brief review of elements of probability theory.
Files: slides, lecture notes.
9 Th Oct  7 P0 Topics Discussion (1)
(On-line) Projects discussion: P-01, P-02, P-03, P-04, P-05, P-07, P-08, P-09, P-10, P-11, P-12, P-13, P-14, P-15.
Files: P0 slides, slides, lecture notes.
  Mo Oct 11Thanksgiving Day, University closed  
L5 Tu Oct 12 Lab 5: Python NLTK Tutorial 1
(On-line) Introduction to Python: basics, lists, tuples, dictionaries; Introduction to NLTK: tokenization, stop-words, stemming, n-grams, frequency distribution, classification.
Files: lab notes, slides.
10 Tu Oct 12 P0 Topics Discussion (2); Probabilistic Modeling
(On-line) Projects discussion: P-06, P-16, P-17, P-18, P-19, P-21, P-23, P-24, P-26, P-27, P-28, P-29. Probabilistic modelling: probability theory review (continued)
Files: P0 slides, slides, lecture notes.
  We Oct 13 A2 out A2 out
11 Th Oct 14 Probabilistic Modeling
(In-person) Bayesian inference, generative models. Probabilistic modeling: random variables, configurations, and models; computational tasks; joint distribution model; fully independent model.
Files: slides, lecture notes.
L6 Tu Oct 19 Lab 6: Python NLTK Tutorial 2  
12 Tu Oct 19 Naive Bayes Classification Model Files: slides, lecture notes. 
13 Th Oct 21N-gram Model  
  Fr Oct 22 A2 due A2 due
L7 Tu Oct 26 Lab 7: Fetching Tweets with Python  
14 Tu Oct 26 N-gram Model Smoothing  
15 Th Oct 28 POS Tagging  
  Fr Oct 29P1 due P1 due
  Mo Nov  1Last day to drop classes with "W"  
L8 Tu Nov  2 Lab 8: Prolog Tutorial 1  
16 Th Nov  4 Hidden Markov Model (HMM)  
  Mo Nov  8Fall Study Break Nov 8-12, no classes, University open  
  Th Nov 11Remembrance Day, University closed  
  Part IV: Parsing (Syntactic Processing)
L9 Tu Nov 16 Lab 9: Prolog Tutorial 2  
17 Tu Nov 16 Introduction to Prolog and Unification  
18 Th Nov 18 Natural Language Syntax  
  Fr Nov 19 A3 due tentatively A3 due
19 Tu Nov 23 NL Parsing in Prolog  
20 Th Nov 25 Probabilistic Context-Free Grammars  
21 Tu Nov 30 Efficient Parsing with PCFGs  
22 Th Dec 2 CFGs for Natural Languages  
  Part VI: Student Presentations
23 Mo Dec  6 Student Presentations (during day)  
24 Tu Dec  7 Student Presentations (during day)
A4 due tentatively
A4 due
  Tu Dec  7Classes end, Monday schedule used, Report due Reports due
  Final Exam
  Fr Dec 17Final Exam (8:30-10:30am)
Final exam, duration 2 hours, starting at 08:30am. Exams schedule URL: http://www.dal.ca/academics/exam_schedule/halifax_campus_exam_schedule.html

Maintained by: Vlado Keselj, last update: 14-Oct-2021