Students should be able to:
- Program advanced agents using learning and planning techniques for solving sequential decision-making tasks that involve other agents.
- Analyse autonomy in dynamic, partially observable settings involving a single agent or multiple agents.
- Develop methods for optimising control policies in complex sequential decision making problems.
- Implement techniques to balance exploration and exploitation in decision-making tasks that require learning from the environment while acting on it.
- Use linear time logic as a specification language for formulating complex tasks as well as environment properties.
- Apply synthesis from LTL and LTLf specifications to solve planning problems in nondeterministic environments.