ANLP 2019 Course Revision Guide

Disclaimer:

This page provides a list of concepts you should be familiar with and questions you should be able to answer if you are thoroughly familiar with the material in the course. It is safe to assume that if you have a good grasp of everything listed here, you will do well on the exam. However, we will not guarantee that only the topics mentioned here, and nothing else, will appear on the exam. In a few cases (mainly formulas) we do specify what *will not* be required for the exam, but otherwise we make no guarantees.

Exam rubric

The exam rubric is as follows:

"All questions are compulsory. Different questions may have different numbers of total marks. Take note of this in allocating time to questions. Calculators may not be used in this examination."

Past papers

Previous versions of this course were called Advanced Natural Language Processing through the 2014-15 academic year. You can look up past exam papers if you wish.

However, there was a major revision of the course in 2014-15, and several additional updates since then (in particular, about 25% of the material changed in 2017, and around 10% in 2018). Exams prior to 2014-15 therefore include a lot of material that we no longer cover (and vice versa), and use a somewhat different question style. These are not recommended as study guides.

In comparison to delivery in 2014-2016, the main topics we have eliminated are active chart parsing, details of advanced smoothing methods, and nearly all topics related to discourse. The main topics we have added (some in 2017 and some in 2018) are dependency parsing and more material on distributional semantics, data analysis, experiments, evaluation, and ethical issues.

In working through papers from 2014-2018, you should not expect to be able to answer the following questions:

In addition, the rubric of the exam is different this year, as noted above.

Generative probabilistic models

We have discussed the following generative probabilistic models:

For each of these, you should be able to

Discriminative probabilistic models

We have discussed the following discriminative probabilistic model:

For this model, you should be able to

Other formulas

In addition to the equations for the models listed above, you should know the formulas for the following concepts, what they may be used for, and be able to apply them appropriately. Where relevant you should be able to discuss strengths and weaknesses of the associated method, and alternatives.

Algorithms and computational methods

For each of the following algorithms, you should be able to explain what each of these computes (its input and output), what it is used for, and be able to hand simulate each one. Some of these algorithms are naive, or solve problems that more naive algorithms face. You should be able to explain what those problems are and how the better algorithms solve them.

For each of the following methods, we haven't discussed algorithms at the level of data structures or implemention, but you should still be able to explain what each method computes (its input and output), what it is used for, and be able to hand simulate each one.

For each of the following methods, you should be able to explain what it computes (its input and output), what it is used for, and be able to describe how it works in some detail.

Additional Mathematical and Computational Concepts

Overarching concepts:

In addition, for the following concepts you should be able to explain each one, give one or two examples where appropriate, and be able to identify examples if given to you. You should be able to say what NLP tasks these are relevant to and why.

Linguistic and Representational Concepts

You should be able to explain each of these concepts, give one or two examples where appropriate, and be able to identify examples if given to you. You should be able to say what NLP tasks these are relevant to and why.

Tasks

You should be able to explain each of these tasks, give one or two examples where appropriate, and discuss cases of ambiguity or what makes the task difficult. In most cases you should be able to say what algorithm(s) or general method(s) can be used to solve the task, and what evaluation method(s) are typically used.

Resources

You should be able to describe what linguistic information is captured in each of these resources, and how it might be used in an NLP system.

You should also be able to identify legal and ethical issues in the creation and collection of linguistic resources.

Evaluation concepts and methods

For each of the following, you should be able to explain what each of the specific methods measures, what tasks it would be appropriate for, and why.

In addition:

Additional ethical issues

In addition to the topics listed under Resources and Evaluation, you should be able to identify and briefly discuss other potential ethical issues arising from developing or deploying NLP tools (e.g., fairness/bias, licensing and privacy concerns), and say how they might be relevant when presented with an example task. You should be able to describe methods for measuring and/or mitigating bias that were discussed in lectures.


Home : Teaching : Courses : Anlp