Reinforcement Learning Specialization
Language: English | Size:4.43 GB
Genre:eLearning
Files Included :
01 course-4-introduction.mp4 (22.11 MB)
MP4
02 meet-your-instructors.mp4 (43.87 MB)
MP4
01 initial-project-meeting-with-martha-formalizing-the-problem.mp4 (13.25 MB)
MP4
02 andy-barto-on-what-are-eligibility-traces-and-why-are-they-so-named.mp4 (38.51 MB)
MP4
01 lets-review-markov-decision-processes.mp4 (12.36 MB)
MP4
02 lets-review-examples-of-episodic-and-continuing-tasks.mp4 (9.14 MB)
MP4
01 meeting-with-niko-choosing-the-learning-algorithm.mp4 (7.88 MB)
MP4
01 lets-review-expected-sarsa.mp4 (6.26 MB)
MP4
02 lets-review-what-is-q-learning.mp4 (7.84 MB)
MP4
03 lets-review-average-reward-a-new-way-of-formulating-control-problems.mp4 (19.08 MB)
MP4
04 lets-review-actor-critic-algorithm.mp4 (14.07 MB)
MP4
05 csaba-szepesvari-on-problem-landscape.mp4 (38.81 MB)
MP4
06 andy-and-rich-advice-for-students.mp4 (33.39 MB)
MP4
01 agent-architecture-meeting-with-martha-overview-of-design-choices.mp4 (15.62 MB)
MP4
01 lets-review-non-linear-approximation-with-neural-networks.mp4 (9.59 MB)
MP4
02 drew-bagnell-on-system-id-optimal-control.mp4 (31.29 MB)
MP4
03 susan-murphy-on-rl-in-mobile-health.mp4 (27.63 MB)
MP4
01 meeting-with-adam-getting-the-agent-details-right.mp4 (12.6 MB)
MP4
01 lets-review-optimization-strategies-for-nns.mp4 (14.28 MB)
MP4
02 lets-review-expected-sarsa-with-function-approximation.mp4 (7.63 MB)
MP4
03 lets-review-dyna-q-learning-in-a-simple-maze.mp4 (10.76 MB)
MP4
04 meeting-with-martha-in-depth-on-experience-replay.mp4 (21.42 MB)
MP4
05 martin-riedmiller-on-the-collect-and-infer-framework-for-data-efficient-rl.mp4 (23.54 MB)
MP4
01 meeting-with-adam-parameter-studies-in-rl.mp4 (11.49 MB)
MP4
01 lets-review-comparing-td-and-monte-carlo.mp4 (9.81 MB)
MP4
02 joelle-pineau-about-rl-that-matters.mp4 (29.5 MB)
MP4
01 meeting-with-martha-discussing-your-results.mp4 (10.95 MB)
MP4
02 course-wrap-up.mp4 (7.76 MB)
MP4
03 specialization-wrap-up.mp4 (18.62 MB)
MP4
01 specialization-introduction.mp4 (18.26 MB)
MP4
02 course-introduction.mp4 (32.39 MB)
MP4
03 meet-your-instructors.mp4 (43.87 MB)
MP4
04 your-specialization-roadmap.mp4 (14.88 MB)
MP4
03 sequential-decision-making-with-evaluative-feedback.mp4 (16.27 MB)
MP4
01 learning-action-values.mp4 (14.22 MB)
MP4
02 estimating-action-values-incrementally.mp4 (19.4 MB)
MP4
01 what-is-the-trade-off.mp4 (21.58 MB)
MP4
02 optimistic-initial-values.mp4 (13.13 MB)
MP4
03 upper-confidence-bound-ucb-action-selection.mp4 (11.77 MB)
MP4
04 jonathan-langford-contextual-bandits-for-real-world-reinforcement-learning.mp4 (11.94 MB)
MP4
05 week-1-summary.mp4 (9.48 MB)
MP4
03 markov-decision-processes.mp4 (12.36 MB)
MP4
04 examples-of-mdps.mp4 (12.2 MB)
MP4
01 the-goal-of-reinforcement-learning.mp4 (8.02 MB)
MP4
02 michael-littman-the-reward-hypothesis.mp4 (84.01 MB)
MP4
01 continuing-tasks.mp4 (12.67 MB)
MP4
02 examples-of-episodic-and-continuing-tasks.mp4 (9.14 MB)
MP4
03 week-2-summary.mp4 (5.42 MB)
MP4
03 specifying-policies.mp4 (14.99 MB)
MP4
04 value-functions.mp4 (21.1 MB)
MP4
05 rich-sutton-and-andy-barto-a-brief-history-of-rl.mp4 (48.75 MB)
MP4
01 bellman-equation-derivation.mp4 (17.03 MB)
MP4
02 why-bellman-equations.mp4 (11.87 MB)
MP4
01 optimal-policies.mp4 (18.46 MB)
MP4
02 optimal-value-functions.mp4 (10.19 MB)
MP4
03 using-optimal-value-functions-to-get-optimal-policies.mp4 (16.73 MB)
MP4
04 week-3-summary.mp4 (11.95 MB)
MP4
03 policy-evaluation-vs-control.mp4 (13.32 MB)
MP4
04 iterative-policy-evaluation.mp4 (18.79 MB)
MP4
01 policy-improvement.mp4 (9.99 MB)
MP4
02 policy-iteration.mp4 (17.86 MB)
MP4
01 flexibility-of-the-policy-iteration-framework.mp4 (12.44 MB)
MP4
02 efficiency-of-dynamic-programming.mp4 (14.03 MB)
MP4
03 warren-powell-approximate-dynamic-programming-for-fleet-management-short.mp4 (47.13 MB)
MP4
04 warren-powell-approximate-dynamic-programming-for-fleet-management-long.mp4 (145.35 MB)
MP4
05 week-4-summary.mp4 (9.61 MB)
MP4
01 congratulations.mp4 (11.18 MB)
MP4
01 course-3-introduction.mp4 (16.33 MB)
MP4
02 meet-your-instructors.mp4 (43.87 MB)
MP4
03 moving-to-parameterized-functions.mp4 (24.38 MB)
MP4
04 generalization-and-discrimination.mp4 (12.86 MB)
MP4
05 framing-value-estimation-as-supervised-learning.mp4 (10.69 MB)
MP4
01 the-value-error-objective.mp4 (10.86 MB)
MP4
02 introducing-gradient-descent.mp4 (15.1 MB)
MP4
03 gradient-monte-for-policy-evaluation.mp4 (15.24 MB)
MP4
04 state-aggregation-with-monte-carlo.mp4 (20.26 MB)
MP4
01 semi-gradient-td-for-policy-evaluation.mp4 (15.35 MB)
MP4
02 comparing-td-and-monte-carlo-with-state-aggregation.mp4 (11.54 MB)
MP4
03 doina-precup-building-knowledge-for-ai-agents-with-reinforcement-learning.mp4 (55.29 MB)
MP4
01 the-linear-td-update.mp4 (9.9 MB)
MP4
02 the-true-objective-for-td.mp4 (13.66 MB)
MP4
03 week-1-summary.mp4 (16.31 MB)
MP4
03 coarse-coding.mp4 (9.59 MB)
MP4
04 generalization-properties-of-coarse-coding.mp4 (17.98 MB)
MP4
05 tile-coding.mp4 (7.57 MB)
MP4
06 using-tile-coding-in-td.mp4 (23.07 MB)
MP4
01 what-is-a-neural-network.mp4 (7.03 MB)
MP4
02 non-linear-approximation-with-neural-networks.mp4 (9.59 MB)
MP4
03 deep-neural-networks.mp4 (15.33 MB)
MP4
01 gradient-descent-for-training-neural-networks.mp4 (15.53 MB)
MP4
02 optimization-strategies-for-nns.mp4 (14.28 MB)
MP4
03 david-silver-on-deep-learning-rl-ai.mp4 (41.41 MB)
MP4
04 week-2-review.mp4 (8.5 MB)
MP4
03 episodic-sarsa-with-function-approximation.mp4 (18.05 MB)
MP4
04 episodic-sarsa-in-mountain-car.mp4 (15.47 MB)
MP4
05 expected-sarsa-with-function-approximation.mp4 (7.63 MB)
MP4
01 exploration-under-function-approximation.mp4 (11.05 MB)
MP4
01 average-reward-a-new-way-of-formulating-control-problems.mp4 (19.08 MB)
MP4
02 satinder-singh-on-intrinsic-rewards.mp4 (26.91 MB)
MP4
03 week-3-review.mp4 (8.88 MB)
MP4
03 learning-policies-directly.mp4 (17.1 MB)
MP4
04 advantages-of-policy-parameterization.mp4 (26.06 MB)
MP4
01 the-objective-for-learning-policies.mp4 (13.35 MB)
MP4
02 the-policy-gradient-theorem.mp4 (9.31 MB)
MP4
01 estimating-the-policy-gradient.mp4 (13.63 MB)
MP4
02 actor-critic-algorithm.mp4 (14.07 MB)
MP4
01 actor-critic-with-softmax-policies.mp4 (16.53 MB)
MP4
02 demonstration-with-actor-critic.mp4 (28.82 MB)
MP4
03 gaussian-policies-for-continuous-actions.mp4 (19.95 MB)
MP4
04 week-4-summary.mp4 (9.96 MB)
MP4
01 congratulations-course-4-preview.mp4 (22.11 MB)
MP4
01 course-introduction.mp4 (11.27 MB)
MP4
02 meet-your-instructors.mp4 (43.87 MB)
MP4
03 what-is-monte-carlo.mp4 (14.88 MB)
MP4
04 using-monte-carlo-for-prediction.mp4 (16.17 MB)
MP4
01 using-monte-carlo-for-action-values.mp4 (6.47 MB)
MP4
02 using-monte-carlo-methods-for-generalized-policy-iteration.mp4 (5.17 MB)
MP4
03 solving-the-blackjack-example.mp4 (13.91 MB)
MP4
01 epsilon-soft-policies.mp4 (12.69 MB)
MP4
01 why-does-off-policy-learning-matter.mp4 (14.39 MB)
MP4
02 importance-sampling.mp4 (7.41 MB)
MP4
03 off-policy-monte-carlo-prediction.mp4 (12.52 MB)
MP4
04 emma-brunskill-batch-reinforcement-learning.mp4 (37.38 MB)
MP4
05 week-1-summary.mp4 (9.59 MB)
MP4
03 what-is-temporal-difference-td-learning.mp4 (10.31 MB)
MP4
04 rich-sutton-the-importance-of-td-learning.mp4 (35.65 MB)
MP4
01 the-advantages-of-temporal-difference-learning.mp4 (9.1 MB)
MP4
02 comparing-td-and-monte-carlo.mp4 (9.81 MB)
MP4
03 andy-barto-and-rich-sutton-more-on-the-history-of-rl.mp4 (80.21 MB)
MP4
04 week-2-summary.mp4 (7.41 MB)
MP4
03 sarsa-gpi-with-td.mp4 (7.38 MB)
MP4
04 sarsa-in-the-windy-grid-world.mp4 (5.85 MB)
MP4
01 what-is-q-learning.mp4 (7.84 MB)
MP4
02 q-learning-in-the-windy-grid-world.mp4 (7.24 MB)
MP4
03 how-is-q-learning-off-policy.mp4 (9.96 MB)
MP4
01 expected-sarsa.mp4 (6.26 MB)
MP4
02 expected-sarsa-in-the-cliff-world.mp4 (5.69 MB)
MP4
03 generality-of-expected-sarsa.mp4 (5.21 MB)
MP4
04 week-3-summary.mp4 (3.68 MB)
MP4
03 what-is-a-model.mp4 (11.33 MB)
MP4
04 comparing-sample-and-distribution-models.mp4 (6.65 MB)
MP4
01 random-tabular-q-planning.mp4 (7.83 MB)
MP4
01 the-dyna-architecture.mp4 (9.59 MB)
MP4
02 the-dyna-algorithm.mp4 (11.24 MB)
MP4
03 dyna-q-learning-in-a-simple-maze.mp4 (10.76 MB)
MP4
01 what-if-the-model-is-inaccurate.mp4 (7.69 MB)
MP4
02 in-depth-with-changing-environments.mp4 (11.94 MB)
MP4
03 drew-bagnell-self-driving-robotics-and-model-based-rl.mp4 (35.21 MB)
MP4
04 week-4-summary.mp4 (4.25 MB)
MP4
01 congratulations.mp4 (4.36 MB)
MP4