📚 Question Bank Q5 — Algorithms
Tags
Algorithms
Q5. Marks: +2.0 UGC NET Paper 2: Computer Science 2nd January 2026 Shift 1
Consider the following statements about reinforcement learning.
A. The adaptive dynamic programming agent leaves the transition model between states utilizes to solve the corresponding Markov decision process using dynamic programming.
B. Temporal difference needs a transition model to perform its updates.
C. The prioritized sweeping heuristic focuses on adjusting states with successors that have undergone significant changes in utility estimates.
D. The approach of modified policy iteration involves adopting a simplified value utility estimates following each change to the learned model.
Choose the correct answer from the options given below:
1.A, B & C Only
2.A, B & D Only
3.A, C & D Only ✓ Correct
4.B, C & D Only
📄 All “Algorithms” questions across papers
🏷 Change Tag for this Question