[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/03_generalized-policy-iteration/04_warren-powell-approximate-dynamic-programming-for-fleet-management-long.mp4
145.35MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/04_weekly-assessment/01_sequential-decision-making_quiz.html
210.3KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/01_welcome-to-the-final-capstone-course/01_course-introduction/01_course-4-introduction.en.txt
2.29KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/04_weekly-assessment/01_dynamic-programming_quiz.html
157.49KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/01_welcome-to-the-course/01_course-introduction/04_read-me-pre-requisites-and-learning-objectives_Course_2__Sample_Based_Learning_Methods_Learning_Objectives.pdf
83.14KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/01_welcome-to-the-course/01_course-introduction/06_read-me-pre-requisites-and-learning-objectives_Fundamentals_of_Reinforcement_Learning__Learning_Objectives.pdf
64.66KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/01_welcome-to-the-course/01_course-introduction/03_read-me-pre-requisites-and-learning-objectives_Prediction_and_Control_with_Function_Approximation_Learning_Objectives.pdf
59.93KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/01_welcome-to-the-final-capstone-course/01_course-introduction/03_reinforcement-learning-textbook_instructions.html
2.19KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/01_welcome-to-the-final-capstone-course/01_course-introduction/04_pre-requisites-and-learning-objectives_A_Complete_Reinforcement_Learning_System_Capstone__Learning_Objectives.pdf
56.79KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/04_off-policy-learning-for-prediction/04_emma-brunskill-batch-reinforcement-learning.en.srt
24.91KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/01_welcome-to-the-course/01_course-introduction/02_course-introduction.en.txt
5.62KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/01_welcome-to-the-course/01_course-introduction/05_reinforcement-learning-textbook_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/03_generalized-policy-iteration/04_warren-powell-approximate-dynamic-programming-for-fleet-management-long.en.srt
40.71KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/04_weekly-assessment/02_graded-value-functions-and-bellman-equations_exam.html
31.06KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/03_generalized-policy-iteration/04_warren-powell-approximate-dynamic-programming-for-fleet-management-long.en.txt
21.34KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/03_average-reward/02_satinder-singh-on-intrinsic-rewards.en.srt
20.96KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/02_goal-of-reinforcement-learning/02_michael-littman-the-reward-hypothesis.en.srt
18.48KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/03_temporal-difference-learning-methods-for-prediction/02_advantages-of-td/03_andy-barto-and-rich-sutton-more-on-the-history-of-rl.en.srt
15.92KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/01_welcome-to-the-course/01_course-introduction/03_meet-your-instructors.en.srt
15.89KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/03_milestone-2-choosing-the-right-algorithm/02_project-resources/03_lets-review-average-reward-a-new-way-of-formulating-control-problems.en.srt
15.17KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/02_milestone-1-formalize-word-problem-as-mdp/02_project-resources/02_lets-review-examples-of-episodic-and-continuing-tasks.en.txt
2.52KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/03_average-reward/01_average-reward-a-new-way-of-formulating-control-problems.en.srt
15.17KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/03_training-neural-networks/03_david-silver-on-deep-learning-rl-ai.en.srt
14.71KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/03_milestone-2-choosing-the-right-algorithm/01_weekly-learning-goals/01_meeting-with-niko-choosing-the-learning-algorithm.en.txt
2.84KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/03_exploration-vs-exploitation-tradeoff/04_jonathan-langford-contextual-bandits-for-real-world-reinforcement-learning.en.srt
14.04KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/03_training-neural-networks/01_gradient-descent-for-training-neural-networks.en.srt
13.98KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/03_milestone-2-choosing-the-right-algorithm/02_project-resources/01_lets-review-expected-sarsa.en.txt
2.8KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/01_policy-evaluation-prediction/04_iterative-policy-evaluation.en.srt
13.68KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/06_milestone-5-submit-your-parameter-study/02_project-resources/02_joelle-pineau-about-rl-that-matters.en.srt
13.67KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/03_milestone-2-choosing-the-right-algorithm/02_project-resources/02_lets-review-what-is-q-learning.en.txt
2.6KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/01_welcome-to-the-course/01_course-introduction/02_meet-your-instructors.en.srt
13.43KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/01_welcome-to-the-course/01_course-introduction/02_meet-your-instructors.en.srt
13.43KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/01_welcome-to-the-final-capstone-course/01_course-introduction/02_meet-your-instructors.en.srt
13.43KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/02_policy-iteration-control/02_policy-iteration.en.srt
13.33KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/04_off-policy-learning-for-prediction/04_emma-brunskill-batch-reinforcement-learning.en.txt
13.16KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/04_policy-parameterizations/03_gaussian-policies-for-continuous-actions.en.srt
12.82KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/02_milestone-1-formalize-word-problem-as-mdp/01_final-project-milestone-1/02_andy-barto-on-what-are-eligibility-traces-and-why-are-they-so-named.en.srt
12.5KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/03_exploration-vs-exploitation-tradeoff/01_what-is-the-trade-off.en.srt
12.17KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/03_optimality-optimal-policies-value-functions/01_optimal-policies.en.srt
12.16KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/03_generalized-policy-iteration/03_warren-powell-approximate-dynamic-programming-for-fleet-management-short.en.srt
12.06KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/04_weekly-assesment/01_mdps_quiz.html
11.79KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/02_goal-of-reinforcement-learning/02_michael-littman-the-reward-hypothesis.en.txt
11.58KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/03_the-objective-for-td/03_doina-precup-building-knowledge-for-ai-agents-with-reinforcement-learning.en.srt
11.34KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/03_temporal-difference-learning-methods-for-prediction/01_introduction-to-temporal-difference-learning/04_rich-sutton-the-importance-of-td-learning.en.srt
11.24KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/03_average-reward/02_satinder-singh-on-intrinsic-rewards.en.txt
10.98KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/04_policy-parameterizations/02_demonstration-with-actor-critic.en.srt
10.86KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/03_optimality-optimal-policies-value-functions/03_using-optimal-value-functions-to-get-optimal-policies.en.srt
10.83KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/04_milestone-3-identify-key-performance-parameters/01_weekly-learning-goals/01_agent-architecture-meeting-with-martha-overview-of-design-choices.en.srt
10.8KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/01_introduction-to-monte-carlo-methods/04_using-monte-carlo-for-prediction.en.srt
10.6KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/01_introduction-to-monte-carlo-methods/03_what-is-monte-carlo.en.srt
10.5KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/04_milestone-3-identify-key-performance-parameters/02_project-resources/02_drew-bagnell-on-system-id-optimal-control.en.srt
10.52KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/01_policies-and-value-functions/05_rich-sutton-and-andy-barto-a-brief-history-of-rl.en.srt
10.49KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/01_estimating-values-functions-with-supervised-learning/03_moving-to-parameterized-functions.en.srt
10.44KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/01_welcome-to-the-course/01_course-introduction/02_course-introduction.en.srt
10.43KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/04_milestone-3-identify-key-performance-parameters/02_project-resources/03_susan-murphy-on-rl-in-mobile-health.en.srt
10.39KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/01_policies-and-value-functions/04_value-functions.en.srt
10.34KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/04_dealing-with-inaccurate-models/03_drew-bagnell-self-driving-robotics-and-model-based-rl.en.srt
10.32KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/02_the-objective-for-on-policy-prediction/04_state-aggregation-with-monte-carlo.en.srt
10.23KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/01_learning-parameterized-policies/03_learning-policies-directly.en.srt
10.17KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/02_the-objective-for-on-policy-prediction/02_introducing-gradient-descent.en.srt
9.91KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/02_bellman-equations/01_bellman-equation-derivation.en.srt
9.64KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/01_introduction-to-markov-decision-processes/03_markov-decision-processes.en.srt
9.62KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/05_milestone-4-implement-your-agent/02_project-resources/02_lets-review-expected-sarsa-with-function-approximation.en.txt
2.08KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/02_milestone-1-formalize-word-problem-as-mdp/02_project-resources/01_lets-review-markov-decision-processes.en.srt
9.62KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/03_milestone-2-choosing-the-right-algorithm/02_project-resources/05_csaba-szepesvari-on-problem-landscape.en.srt
9.57KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/03_training-neural-networks/03_david-silver-on-deep-learning-rl-ai.en.txt
9.52KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/03_average-reward/01_average-reward-a-new-way-of-formulating-control-problems.en.txt
9.43KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/03_milestone-2-choosing-the-right-algorithm/02_project-resources/03_lets-review-average-reward-a-new-way-of-formulating-control-problems.en.txt
9.43KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/01_feature-construction-for-linear-methods/04_generalization-properties-of-coarse-coding.en.srt
9.38KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/02_the-objective-for-on-policy-prediction/03_gradient-monte-for-policy-evaluation.en.srt
9.31KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/02_policy-gradient-for-continuing-tasks/02_the-policy-gradient-theorem.en.srt
9.28KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/04_dealing-with-inaccurate-models/02_in-depth-with-changing-environments.en.srt
9.22KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/03_actor-critic-for-continuing-tasks/02_actor-critic-algorithm.en.srt
9.18KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/01_policies-and-value-functions/02_weekly-reading_instructions.html
1.16KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/01_the-k-armed-bandit-problem/02_weekly-reading_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/03_milestone-2-choosing-the-right-algorithm/02_project-resources/04_lets-review-actor-critic-algorithm.en.srt
9.18KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/02_policy-gradient-for-continuing-tasks/01_the-objective-for-learning-policies.en.srt
8.91KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/01_welcome-to-the-course/01_course-introduction/01_course-3-introduction.en.srt
8.91KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/06_milestone-5-submit-your-parameter-study/02_project-resources/02_joelle-pineau-about-rl-that-matters.en.txt
8.75KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/01_the-k-armed-bandit-problem/03_sequential-decision-making-with-evaluative-feedback.en.srt
8.71KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/01_estimating-values-functions-with-supervised-learning/04_generalization-and-discrimination.en.srt
8.69KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/01_episodic-sarsa-with-function-approximation/04_episodic-sarsa-in-mountain-car.en.srt
8.68KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/06_milestone-5-submit-your-parameter-study/03_congratulations/01_meeting-with-martha-discussing-your-results.en.txt
2.42KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/01_welcome-to-the-final-capstone-course/01_course-introduction/02_meet-your-instructors.en.txt
8.62KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/06_milestone-5-submit-your-parameter-study/03_congratulations/02_course-wrap-up.en.srt
2.95KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/06_milestone-5-submit-your-parameter-study/03_congratulations/02_course-wrap-up.en.txt
1.83KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/01_welcome-to-the-course/01_course-introduction/02_meet-your-instructors.en.txt
8.62KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/01_welcome-to-the-course/01_course-introduction/02_meet-your-instructors.en.txt
8.62KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/03_exploration-vs-exploitation-tradeoff/02_optimistic-initial-values.en.srt
8.5KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/01_welcome-to-the-course/01_course-introduction/03_meet-your-instructors.en.txt
8.41KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/03_training-neural-networks/02_optimization-strategies-for-nns.en.srt
8.41KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/01_welcome-to-the-course/01_course-introduction/01_specialization-introduction.en.txt
2.63KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/05_milestone-4-implement-your-agent/02_project-resources/01_lets-review-optimization-strategies-for-nns.en.srt
8.41KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/03_optimality-optimal-policies-value-functions/02_optimal-value-functions.en.srt
8.34KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/01_feature-construction-for-linear-methods/06_using-tile-coding-in-td.en.srt
8.31KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/04_linear-td/02_the-true-objective-for-td.en.srt
8.23KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/05_milestone-4-implement-your-agent/02_project-resources/05_martin-riedmiller-on-the-collect-and-infer-framework-for-data-efficient-rl.en.srt
8.17KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/03_temporal-difference-learning-methods-for-prediction/02_advantages-of-td/01_the-advantages-of-temporal-difference-learning.en.srt
8.16KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/06_milestone-5-submit-your-parameter-study/02_project-resources/01_lets-review-comparing-td-and-monte-carlo.en.srt
8.1KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/03_temporal-difference-learning-methods-for-prediction/02_advantages-of-td/02_comparing-td-and-monte-carlo.en.srt
8.1KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/06_milestone-5-submit-your-parameter-study/01_weekly-learning-goals/01_meeting-with-adam-parameter-studies-in-rl.en.srt
8.08KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/03_temporal-difference-learning-methods-for-prediction/02_advantages-of-td/03_andy-barto-and-rich-sutton-more-on-the-history-of-rl.en.txt
8.07KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/01_welcome-to-the-course/01_course-introduction/05_reinforcement-learning-textbook_instructions.html
2.19KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/02_what-to-learn-estimating-action-values/02_estimating-action-values-incrementally.en.srt
8.05KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/04_weekly-assessment/01_practice-value-functions-and-bellman-equations_quiz.html
7.98KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/01_welcome-to-the-course/01_course-introduction/06_read-me-pre-requisites-and-learning-objectives_instructions.html
2.63KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/01_the-k-armed-bandit-problem/01_module-1-learning-objectives_instructions.html
2.8KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/01_the-k-armed-bandit-problem/02_weekly-reading_instructions.html
1.17KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/03_dyna-as-a-formalism-for-planning/02_the-dyna-algorithm.en.srt
7.81KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/04_off-policy-learning-for-prediction/03_off-policy-monte-carlo-prediction.en.srt
7.8KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/03_temporal-difference-learning-methods-for-prediction/01_introduction-to-temporal-difference-learning/03_what-is-temporal-difference-td-learning.en.srt
7.77KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/03_generalized-policy-iteration/02_efficiency-of-dynamic-programming.en.srt
7.72KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/01_learning-parameterized-policies/04_advantages-of-policy-parameterization.en.srt
7.66KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/03_continuing-tasks/01_continuing-tasks.en.srt
7.64KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/03_exploration-methods-for-monte-carlo/01_epsilon-soft-policies.en.srt
7.55KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/03_exploration-vs-exploitation-tradeoff/03_upper-confidence-bound-ucb-action-selection.en.srt
7.54KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/01_what-is-a-model/03_what-is-a-model.en.srt
7.53KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/01_policies-and-value-functions/03_specifying-policies.en.srt
7.52KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/03_generalized-policy-iteration/03_warren-powell-approximate-dynamic-programming-for-fleet-management-short.en.txt
7.52KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/03_actor-critic-for-continuing-tasks/01_estimating-the-policy-gradient.en.srt
7.48KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/03_training-neural-networks/01_gradient-descent-for-training-neural-networks.en.txt
7.45KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/05_milestone-4-implement-your-agent/02_project-resources/04_meeting-with-martha-in-depth-on-experience-replay.en.srt
7.36KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/03_exploration-vs-exploitation-tradeoff/04_jonathan-langford-contextual-bandits-for-real-world-reinforcement-learning.en.txt
7.34KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/02_off-policy-td-control-q-learning/03_how-is-q-learning-off-policy.en.srt
7.23KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/01_policy-evaluation-prediction/04_iterative-policy-evaluation.en.txt
7.15KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/02_policy-iteration-control/02_policy-iteration.en.txt
7.12KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/05_milestone-4-implement-your-agent/01_weekly-learning-goals/01_meeting-with-adam-getting-the-agent-details-right.en.srt
7.09KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/03_generalized-policy-iteration/01_flexibility-of-the-policy-iteration-framework.en.srt
7.08KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/04_dealing-with-inaccurate-models/01_what-if-the-model-is-inaccurate.en.srt
7KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/04_policy-parameterizations/04_week-4-summary.en.srt
7KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/02_bellman-equations/02_why-bellman-equations.en.srt
6.99KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/03_exploration-vs-exploitation-tradeoff/05_week-1-summary.en.txt
2.68KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/02_what-to-learn-estimating-action-values/01_learning-action-values.en.srt
6.98KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/03_exploration-vs-exploitation-tradeoff/06_chapter-summary_instructions.html
1.19KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/04_policy-parameterizations/03_gaussian-policies-for-continuous-actions.en.txt
6.94KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/03_dyna-as-a-formalism-for-planning/01_the-dyna-architecture.en.srt
6.93KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/04_weekly-assessment/02_bandits-and-exploration-exploitation_instructions.html
1.13KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/01_introduction-to-markov-decision-processes/01_module-2-learning-objectives_instructions.html
2.39KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/01_introduction-to-markov-decision-processes/02_weekly-reading_instructions.html
1.16KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/05_milestone-4-implement-your-agent/02_project-resources/03_lets-review-dyna-q-learning-in-a-simple-maze.en.srt
6.9KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/03_dyna-as-a-formalism-for-planning/03_dyna-q-learning-in-a-simple-maze.en.srt
6.9KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/03_the-objective-for-td/02_comparing-td-and-monte-carlo-with-state-aggregation.en.srt
6.85KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/01_introduction-to-markov-decision-processes/04_examples-of-mdps.en.srt
6.85KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/04_milestone-3-identify-key-performance-parameters/02_project-resources/02_drew-bagnell-on-system-id-optimal-control.en.txt
6.76KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/02_milestone-1-formalize-word-problem-as-mdp/01_final-project-milestone-1/01_initial-project-meeting-with-martha-formalizing-the-problem.en.srt
6.76KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/03_optimality-optimal-policies-value-functions/03_using-optimal-value-functions-to-get-optimal-policies.en.txt
6.69KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/04_linear-td/03_week-1-summary.en.srt
6.69KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/02_goal-of-reinforcement-learning/01_the-goal-of-reinforcement-learning.en.txt
2.62KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/04_dealing-with-inaccurate-models/03_drew-bagnell-self-driving-robotics-and-model-based-rl.en.txt
6.66KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/01_policy-evaluation-prediction/03_policy-evaluation-vs-control.en.srt
6.66KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/01_welcome-to-the-course/01_course-introduction/04_your-specialization-roadmap.en.srt
6.64KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/02_milestone-1-formalize-word-problem-as-mdp/01_final-project-milestone-1/02_andy-barto-on-what-are-eligibility-traces-and-why-are-they-so-named.en.txt
6.62KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/04_off-policy-learning-for-prediction/02_importance-sampling.en.srt
6.58KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/03_exploration-vs-exploitation-tradeoff/01_what-is-the-trade-off.en.txt
6.57KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/02_exploration-under-function-approximation/01_exploration-under-function-approximation.en.srt
6.53KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/02_policy-iteration-control/01_policy-improvement.en.srt
6.52KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/03_continuing-tasks/02_examples-of-episodic-and-continuing-tasks.en.txt
2.52KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/02_monte-carlo-for-control/03_solving-the-blackjack-example.en.srt
6.49KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/03_continuing-tasks/03_week-2-summary.en.srt
2.77KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/03_continuing-tasks/03_week-2-summary.en.txt
1.46KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/03_optimality-optimal-policies-value-functions/01_optimal-policies.en.txt
6.42KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/03_optimality-optimal-policies-value-functions/04_week-3-summary.en.srt
6.38KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/04_weekly-assesment/02_graded-assignment-describe-three-mdps_peer_assignment_instructions.html
2.33KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/05_course-wrap-up/01_congratulations.en.srt
6.34KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/04_milestone-3-identify-key-performance-parameters/02_project-resources/03_susan-murphy-on-rl-in-mobile-health.en.txt
6.32KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/04_linear-td/01_the-linear-td-update.en.srt
6.28KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/01_estimating-values-functions-with-supervised-learning/05_framing-value-estimation-as-supervised-learning.en.srt
6.26KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/02_the-objective-for-on-policy-prediction/01_the-value-error-objective.en.srt
6.24KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/02_the-objective-for-on-policy-prediction/04_state-aggregation-with-monte-carlo.en.txt
6.24KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/01_episodic-sarsa-with-function-approximation/03_episodic-sarsa-with-function-approximation.en.srt
6.23KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/02_the-objective-for-on-policy-prediction/02_introducing-gradient-descent.en.txt
6.15KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/01_td-for-control/03_sarsa-gpi-with-td.en.srt
6.11KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/04_milestone-3-identify-key-performance-parameters/02_project-resources/01_lets-review-non-linear-approximation-with-neural-networks.en.srt
6.1KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/02_neural-networks/02_non-linear-approximation-with-neural-networks.en.srt
6.1KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/03_the-objective-for-td/03_doina-precup-building-knowledge-for-ai-agents-with-reinforcement-learning.en.txt
6.08KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/03_milestone-2-choosing-the-right-algorithm/02_project-resources/05_csaba-szepesvari-on-problem-landscape.en.txt
6.06KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/04_policy-parameterizations/01_actor-critic-with-softmax-policies.en.srt
5.99KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/04_off-policy-learning-for-prediction/01_why-does-off-policy-learning-matter.en.srt
5.93KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/03_temporal-difference-learning-methods-for-prediction/01_introduction-to-temporal-difference-learning/04_rich-sutton-the-importance-of-td-learning.en.txt
5.88KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/02_neural-networks/03_deep-neural-networks.en.srt
5.88KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/04_policy-parameterizations/02_demonstration-with-actor-critic.en.txt
5.87KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/03_milestone-2-choosing-the-right-algorithm/02_project-resources/06_andy-and-rich-advice-for-students.en.srt
5.84KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/04_milestone-3-identify-key-performance-parameters/01_weekly-learning-goals/01_agent-architecture-meeting-with-martha-overview-of-design-choices.en.txt
5.78KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/02_off-policy-td-control-q-learning/02_q-learning-in-the-windy-grid-world.en.srt
5.78KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/01_introduction-to-monte-carlo-methods/03_what-is-monte-carlo.en.txt
5.65KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/01_introduction-to-monte-carlo-methods/04_using-monte-carlo-for-prediction.en.txt
5.61KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/04_off-policy-learning-for-prediction/05_week-1-summary.en.srt
5.6KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/01_estimating-values-functions-with-supervised-learning/03_moving-to-parameterized-functions.en.txt
5.59KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/01_policies-and-value-functions/04_value-functions.en.txt
5.53KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/02_neural-networks/01_what-is-a-neural-network.en.srt
5.5KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/06_Resources/01_notebook-grading-faqs/01__resources.html
5.46KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/03_optimality-optimal-policies-value-functions/05_chapter-summary_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/03_optimality-optimal-policies-value-functions/05_chapter-summary_instructions.html
1.14KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/01_learning-parameterized-policies/03_learning-policies-directly.en.txt
5.42KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/06_milestone-5-submit-your-parameter-study/03_congratulations/03_specialization-wrap-up.en.srt
5.41KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/01_policies-and-value-functions/05_rich-sutton-and-andy-barto-a-brief-history-of-rl.en.txt
5.4KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/01_policy-evaluation-prediction/01_module-4-learning-objectives_instructions.html
3KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/01_policy-evaluation-prediction/02_weekly-reading_instructions.html
1.17KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/02_planning/01_random-tabular-q-planning.en.srt
5.38KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/03_exploration-vs-exploitation-tradeoff/02_optimistic-initial-values.en.txt
5.36KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/01_introduction-to-markov-decision-processes/03_markov-decision-processes.en.txt
5.18KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/02_milestone-1-formalize-word-problem-as-mdp/02_project-resources/01_lets-review-markov-decision-processes.en.txt
5.18KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/01_feature-construction-for-linear-methods/05_tile-coding.en.srt
5.18KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/02_bellman-equations/01_bellman-equation-derivation.en.txt
5.14KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/05_milestone-4-implement-your-agent/02_project-resources/05_martin-riedmiller-on-the-collect-and-infer-framework-for-data-efficient-rl.en.txt
5.12KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/06_milestone-5-submit-your-parameter-study/01_weekly-learning-goals/01_meeting-with-adam-parameter-studies-in-rl.en.txt
5.07KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/01_feature-construction-for-linear-methods/04_generalization-properties-of-coarse-coding.en.txt
5.04KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/03_milestone-2-choosing-the-right-algorithm/02_project-resources/02_lets-review-what-is-q-learning.en.srt
4.95KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/02_off-policy-td-control-q-learning/01_what-is-q-learning.en.srt
4.95KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/04_dealing-with-inaccurate-models/02_in-depth-with-changing-environments.en.txt
4.92KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/01_welcome-to-the-course/01_course-introduction/01_specialization-introduction.en.srt
4.92KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/02_policy-gradient-for-continuing-tasks/02_the-policy-gradient-theorem.en.txt
4.91KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/03_actor-critic-for-continuing-tasks/02_actor-critic-algorithm.en.txt
4.91KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/03_milestone-2-choosing-the-right-algorithm/02_project-resources/04_lets-review-actor-critic-algorithm.en.txt
4.91KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/02_the-objective-for-on-policy-prediction/03_gradient-monte-for-policy-evaluation.en.txt
4.9KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/02_goal-of-reinforcement-learning/01_the-goal-of-reinforcement-learning.en.srt
4.9KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/01_feature-construction-for-linear-methods/03_coarse-coding.en.srt
4.86KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/03_generalized-policy-iteration/02_efficiency-of-dynamic-programming.en.txt
4.84KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/03_exploration-methods-for-monte-carlo/01_epsilon-soft-policies.en.txt
4.77KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/02_policy-gradient-for-continuing-tasks/01_the-objective-for-learning-policies.en.txt
4.75KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/02_monte-carlo-for-control/01_using-monte-carlo-for-action-values.en.srt
4.73KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/01_learning-parameterized-policies/04_advantages-of-policy-parameterization.en.txt
4.72KB
[TGx]Downloaded from torrentgalaxy.to .txt
585B
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/01_welcome-to-the-course/01_course-introduction/01_course-3-introduction.en.txt
4.67KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/03_generalized-policy-iteration/05_week-4-summary.en.txt
2.37KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/02_milestone-1-formalize-word-problem-as-mdp/02_project-resources/02_lets-review-examples-of-episodic-and-continuing-tasks.en.srt
4.66KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/03_generalized-policy-iteration/06_chapter-summary_instructions.html
1.18KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/03_continuing-tasks/02_examples-of-episodic-and-continuing-tasks.en.srt
4.66KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/03_average-reward/03_week-3-review.en.srt
4.65KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/04_weekly-assessment/02_optimal-policies-with-dynamic-programming_instructions.html
1.13KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/05_milestone-4-implement-your-agent/02_project-resources/04_meeting-with-martha-in-depth-on-experience-replay.en.txt
4.65KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/01_the-k-armed-bandit-problem/03_sequential-decision-making-with-evaluative-feedback.en.txt
4.65KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/01_episodic-sarsa-with-function-approximation/04_episodic-sarsa-in-mountain-car.en.txt
4.65KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/03_actor-critic-for-continuing-tasks/01_estimating-the-policy-gradient.en.txt
4.63KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/01_estimating-values-functions-with-supervised-learning/04_generalization-and-discrimination.en.txt
4.63KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/03_the-objective-for-td/01_semi-gradient-td-for-policy-evaluation.en.srt
4.57KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/03_milestone-2-choosing-the-right-algorithm/01_weekly-learning-goals/01_meeting-with-niko-choosing-the-learning-algorithm.en.srt
4.56KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/05_milestone-4-implement-your-agent/02_project-resources/01_lets-review-optimization-strategies-for-nns.en.txt
4.52KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/03_training-neural-networks/02_optimization-strategies-for-nns.en.txt
4.52KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/03_milestone-2-choosing-the-right-algorithm/02_project-resources/01_lets-review-expected-sarsa.en.srt
4.52KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/03_expected-sarsa/01_expected-sarsa.en.srt
4.52KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/01_welcome-to-the-course/01_course-introduction/04_reinforcement-learning-textbook_instructions.html
2.19KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/03_optimality-optimal-policies-value-functions/02_optimal-value-functions.en.txt
4.51KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/03_generalized-policy-iteration/05_week-4-summary.en.srt
4.48KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/01_estimating-values-functions-with-supervised-learning/02_weekly-reading-on-policy-prediction-with-approximation_instructions.html
1.17KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/02_bellman-equations/02_why-bellman-equations.en.txt
4.41KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/05_milestone-4-implement-your-agent/01_weekly-learning-goals/01_meeting-with-adam-getting-the-agent-details-right.en.txt
4.41KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/03_exploration-vs-exploitation-tradeoff/05_week-1-summary.en.srt
4.33KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/01_feature-construction-for-linear-methods/06_using-tile-coding-in-td.en.txt
4.31KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/04_linear-td/02_the-true-objective-for-td.en.txt
4.31KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/03_temporal-difference-learning-methods-for-prediction/02_advantages-of-td/01_the-advantages-of-temporal-difference-learning.en.txt
4.3KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/02_what-to-learn-estimating-action-values/02_estimating-action-values-incrementally.en.txt
4.3KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/03_dyna-as-a-formalism-for-planning/01_the-dyna-architecture.en.txt
4.28KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/06_milestone-5-submit-your-parameter-study/02_project-resources/01_lets-review-comparing-td-and-monte-carlo.en.txt
4.28KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/03_temporal-difference-learning-methods-for-prediction/02_advantages-of-td/02_comparing-td-and-monte-carlo.en.txt
4.28KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/01_welcome-to-the-final-capstone-course/01_course-introduction/01_course-4-introduction.en.srt
4.24KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/05_course-wrap-up/01_congratulations-course-4-preview.en.srt
4.22KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/02_milestone-1-formalize-word-problem-as-mdp/01_final-project-milestone-1/01_initial-project-meeting-with-martha-formalizing-the-problem.en.txt
4.22KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/05_milestone-4-implement-your-agent/02_project-resources/03_lets-review-dyna-q-learning-in-a-simple-maze.en.txt
4.19KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/03_dyna-as-a-formalism-for-planning/03_dyna-q-learning-in-a-simple-maze.en.txt
4.19KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/01_policy-evaluation-prediction/03_policy-evaluation-vs-control.en.txt
4.18KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/03_dyna-as-a-formalism-for-planning/02_the-dyna-algorithm.en.txt
4.16KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/04_off-policy-learning-for-prediction/03_off-policy-monte-carlo-prediction.en.txt
4.15KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/03_temporal-difference-learning-methods-for-prediction/01_introduction-to-temporal-difference-learning/03_what-is-temporal-difference-td-learning.en.txt
4.11KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/03_training-neural-networks/04_week-2-review.en.srt
4.07KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/03_exploration-vs-exploitation-tradeoff/03_upper-confidence-bound-ucb-action-selection.en.txt
4.03KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/02_monte-carlo-for-control/02_using-monte-carlo-methods-for-generalized-policy-iteration.en.srt
4.02KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/01_policies-and-value-functions/03_specifying-policies.en.txt
4.01KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/03_the-objective-for-td/01_semi-gradient-td-for-policy-evaluation.en.txt
2.86KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/01_welcome-to-the-course/01_course-introduction/01_course-introduction.en.srt
4KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/03_continuing-tasks/01_continuing-tasks.en.txt
3.99KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/02_off-policy-td-control-q-learning/03_how-is-q-learning-off-policy.en.txt
3.99KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/01_what-is-a-model/03_what-is-a-model.en.txt
3.98KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/01_estimating-values-functions-with-supervised-learning/01_module-1-learning-objectives_instructions.html
3.97KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/01_episodic-sarsa-with-function-approximation/05_expected-sarsa-with-function-approximation.en.srt
3.93KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/05_milestone-4-implement-your-agent/02_project-resources/02_lets-review-expected-sarsa-with-function-approximation.en.srt
3.93KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/06_milestone-5-submit-your-parameter-study/03_congratulations/01_meeting-with-martha-discussing-your-results.en.srt
3.9KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/01_td-for-control/04_sarsa-in-the-windy-grid-world.en.srt
3.89KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/01_what-is-a-model/04_comparing-sample-and-distribution-models.en.srt
3.87KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/01_episodic-sarsa-with-function-approximation/03_episodic-sarsa-with-function-approximation.en.txt
3.85KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/02_neural-networks/02_non-linear-approximation-with-neural-networks.en.txt
3.85KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/04_milestone-3-identify-key-performance-parameters/02_project-resources/01_lets-review-non-linear-approximation-with-neural-networks.en.txt
3.85KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/04_off-policy-learning-for-prediction/01_why-does-off-policy-learning-matter.en.txt
3.82KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/03_generalized-policy-iteration/01_flexibility-of-the-policy-iteration-framework.en.txt
3.8KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/02_what-to-learn-estimating-action-values/01_learning-action-values.en.txt
3.8KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/04_dealing-with-inaccurate-models/01_what-if-the-model-is-inaccurate.en.txt
3.76KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/01_feature-construction-for-linear-methods/02_weekly-reading-on-policy-prediction-with-approximation-ii_instructions.html
1.24KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/03_expected-sarsa/02_expected-sarsa-in-the-cliff-world.en.srt
3.73KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/04_policy-parameterizations/01_actor-critic-with-softmax-policies.en.txt
3.71KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/01_introduction-to-markov-decision-processes/04_examples-of-mdps.en.txt
3.7KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/04_policy-parameterizations/04_week-4-summary.en.txt
3.65KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/03_the-objective-for-td/02_comparing-td-and-monte-carlo-with-state-aggregation.en.txt
3.61KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/01_welcome-to-the-final-capstone-course/01_course-introduction/04_pre-requisites-and-learning-objectives_instructions.html
3.59KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/04_linear-td/03_week-1-summary.en.txt
3.58KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/01_what-is-a-model/01_module-4-learning-objectives_instructions.html
3.53KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/01_feature-construction-for-linear-methods/05_tile-coding.en.txt
2.79KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/03_milestone-2-choosing-the-right-algorithm/02_project-resources/06_andy-and-rich-advice-for-students.en.txt
3.52KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/02_exploration-under-function-approximation/01_exploration-under-function-approximation.en.txt
3.48KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/02_policy-iteration-control/01_policy-improvement.en.txt
3.51KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/04_off-policy-learning-for-prediction/02_importance-sampling.en.txt
3.47KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/01_welcome-to-the-course/01_course-introduction/04_your-specialization-roadmap.en.txt
3.46KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/02_neural-networks/01_what-is-a-neural-network.en.txt
2.96KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/02_the-objective-for-on-policy-prediction/01_the-value-error-objective.en.txt
3.43KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/02_monte-carlo-for-control/03_solving-the-blackjack-example.en.txt
3.42KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/05_course-wrap-up/01_congratulations.en.txt
3.39KB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/06_milestone-5-submit-your-parameter-study/03_congratulations/03_specialization-wrap-up.en.txt
3.39KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/03_optimality-optimal-policies-value-functions/04_week-3-summary.en.txt
3.37KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/01_estimating-values-functions-with-supervised-learning/05_framing-value-estimation-as-supervised-learning.en.txt
3.35KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/05_course-wrap-up/01_congratulations.en.srt
3.34KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/04_linear-td/01_the-linear-td-update.en.txt
3.3KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/01_td-for-control/03_sarsa-gpi-with-td.en.txt
3.23KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/01_policies-and-value-functions/01_module-3-learning-objectives_instructions.html
3.2KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/01_welcome-to-the-course/01_course-introduction/03_read-me-pre-requisites-and-learning-objectives_instructions.html
3.19KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/02_neural-networks/03_deep-neural-networks.en.txt
3.17KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/03_temporal-difference-learning-methods-for-prediction/02_advantages-of-td/04_week-2-summary.en.srt
3.13KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/01_feature-construction-for-linear-methods/01_module-2-learning-objectives_instructions.html
3.09KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/02_off-policy-td-control-q-learning/02_q-learning-in-the-windy-grid-world.en.txt
3.03KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/01_introduction-to-monte-carlo-methods/01_module-1-learning-objectives_instructions.html
3.02KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/01_feature-construction-for-linear-methods/03_coarse-coding.en.txt
3.01KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/03_training-neural-networks/04_week-2-review.en.txt
2.16KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/01_welcome-to-the-course/01_course-introduction/04_read-me-pre-requisites-and-learning-objectives_instructions.html
2.96KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/01_episodic-sarsa-with-function-approximation/01_module-3-learning-objectives_instructions.html
2.21KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/01_episodic-sarsa-with-function-approximation/02_weekly-reading-on-policy-control-with-approximation_instructions.html
1.27KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/04_off-policy-learning-for-prediction/05_week-1-summary.en.txt
2.95KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/02_planning/01_random-tabular-q-planning.en.txt
2.93KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/03_expected-sarsa/03_generality-of-expected-sarsa.en.srt
2.88KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/01_learning-parameterized-policies/01_module-4-learning-objectives_instructions.html
2.87KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/01_td-for-control/01_module-3-learning-objectives_instructions.html
2.84KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/03_expected-sarsa/01_expected-sarsa.en.txt
2.8KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/03_expected-sarsa/04_week-3-summary.en.srt
2.65KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/02_off-policy-td-control-q-learning/01_what-is-q-learning.en.txt
2.6KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/01_episodic-sarsa-with-function-approximation/05_expected-sarsa-with-function-approximation.en.txt
2.08KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/04_dealing-with-inaccurate-models/04_week-4-summary.en.srt
2.58KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/02_monte-carlo-for-control/01_using-monte-carlo-for-action-values.en.txt
2.51KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/03_average-reward/03_week-3-review.en.txt
2.47KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/01_td-for-control/04_sarsa-in-the-windy-grid-world.en.txt
2.37KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/03_expected-sarsa/02_expected-sarsa-in-the-cliff-world.en.txt
2.31KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/05_course-wrap-up/01_congratulations-course-4-preview.en.txt
2.27KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/01_welcome-to-the-course/01_course-introduction/03_reinforcement-learning-textbook_instructions.html
2.19KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/01_welcome-to-the-course/01_course-introduction/01_course-introduction.en.txt
2.12KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/02_monte-carlo-for-control/02_using-monte-carlo-methods-for-generalized-policy-iteration.en.txt
2.11KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/01_what-is-a-model/04_comparing-sample-and-distribution-models.en.txt
2.09KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/05_course-wrap-up/01_congratulations.en.txt
2.09KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/03_temporal-difference-learning-methods-for-prediction/01_introduction-to-temporal-difference-learning/01_module-2-learning-objectives_instructions.html
1.73KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/03_temporal-difference-learning-methods-for-prediction/02_advantages-of-td/04_week-2-summary.en.txt
1.68KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/03_expected-sarsa/04_week-3-summary.en.txt
1.6KB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/01_learning-parameterized-policies/02_weekly-reading-policy-gradient-methods_instructions.html
1.17KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/03_expected-sarsa/03_generality-of-expected-sarsa.en.txt
1.52KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/04_dealing-with-inaccurate-models/04_week-4-summary.en.txt
1.36KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/03_expected-sarsa/05_chapter-summary_instructions.html
1.23KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/04_dealing-with-inaccurate-models/06_text-book-part-1-summary_instructions.html
1.21KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/04_off-policy-learning-for-prediction/06_chapter-summary_instructions.html
1.19KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/01_what-is-a-model/02_weekly-reading_instructions.html
1.19KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/01_td-for-control/02_weekly-reading_instructions.html
1.17KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/01_introduction-to-monte-carlo-methods/02_weekly-reading_instructions.html
1.17KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/04_dealing-with-inaccurate-models/05_chapter-summary_instructions.html
1.17KB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/03_temporal-difference-learning-methods-for-prediction/01_introduction-to-temporal-difference-learning/02_weekly-reading_instructions.html
1.16KB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/03_exploration-vs-exploitation-tradeoff/06_chapter-summary_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/01_welcome-to-the-course/01_course-introduction/03_reinforcement-learning-textbook_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/01_introduction-to-monte-carlo-methods/02_weekly-reading_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/01_welcome-to-the-course/01_course-introduction/04_reinforcement-learning-textbook_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/01_introduction-to-markov-decision-processes/02_weekly-reading_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/01_welcome-to-the-final-capstone-course/01_course-introduction/03_reinforcement-learning-textbook_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/01_episodic-sarsa-with-function-approximation/02_weekly-reading-on-policy-control-with-approximation_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/04_off-policy-learning-for-prediction/06_chapter-summary_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/03_temporal-difference-learning-methods-for-prediction/01_introduction-to-temporal-difference-learning/02_weekly-reading_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/01_policies-and-value-functions/02_weekly-reading_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/01_td-for-control/02_weekly-reading_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/01_feature-construction-for-linear-methods/02_weekly-reading-on-policy-prediction-with-approximation-ii_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/01_learning-parameterized-policies/02_weekly-reading-policy-gradient-methods_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/03_expected-sarsa/05_chapter-summary_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/01_what-is-a-model/02_weekly-reading_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/01_policy-evaluation-prediction/02_weekly-reading_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/04_dealing-with-inaccurate-models/05_chapter-summary_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/04_dealing-with-inaccurate-models/06_text-book-part-1-summary_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/03_generalized-policy-iteration/06_chapter-summary_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/01_estimating-values-functions-with-supervised-learning/02_weekly-reading-on-policy-prediction-with-approximation_RLbook2018.pdf
85.28MB
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/02_goal-of-reinforcement-learning/02_michael-littman-the-reward-hypothesis.mp4
84.01MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/03_temporal-difference-learning-methods-for-prediction/02_advantages-of-td/03_andy-barto-and-rich-sutton-more-on-the-history-of-rl.mp4
80.21MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/03_the-objective-for-td/03_doina-precup-building-knowledge-for-ai-agents-with-reinforcement-learning.mp4
55.29MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/01_policies-and-value-functions/05_rich-sutton-and-andy-barto-a-brief-history-of-rl.mp4
48.75MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/03_generalized-policy-iteration/03_warren-powell-approximate-dynamic-programming-for-fleet-management-short.mp4
47.13MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/01_welcome-to-the-course/01_course-introduction/03_meet-your-instructors.mp4
43.87MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/01_welcome-to-the-final-capstone-course/01_course-introduction/02_meet-your-instructors.mp4
43.87MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/01_welcome-to-the-course/01_course-introduction/02_meet-your-instructors.mp4
43.87MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/01_welcome-to-the-course/01_course-introduction/02_meet-your-instructors.mp4
43.87MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/03_training-neural-networks/03_david-silver-on-deep-learning-rl-ai.mp4
41.41MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/03_milestone-2-choosing-the-right-algorithm/02_project-resources/05_csaba-szepesvari-on-problem-landscape.mp4
38.81MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/02_milestone-1-formalize-word-problem-as-mdp/01_final-project-milestone-1/02_andy-barto-on-what-are-eligibility-traces-and-why-are-they-so-named.mp4
38.51MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/04_off-policy-learning-for-prediction/04_emma-brunskill-batch-reinforcement-learning.mp4
37.38MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/03_temporal-difference-learning-methods-for-prediction/01_introduction-to-temporal-difference-learning/04_rich-sutton-the-importance-of-td-learning.mp4
35.65MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/04_dealing-with-inaccurate-models/03_drew-bagnell-self-driving-robotics-and-model-based-rl.mp4
35.21MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/03_milestone-2-choosing-the-right-algorithm/02_project-resources/06_andy-and-rich-advice-for-students.mp4
33.39MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/01_welcome-to-the-course/01_course-introduction/02_course-introduction.mp4
32.39MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/04_milestone-3-identify-key-performance-parameters/02_project-resources/02_drew-bagnell-on-system-id-optimal-control.mp4
31.29MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/06_milestone-5-submit-your-parameter-study/02_project-resources/02_joelle-pineau-about-rl-that-matters.mp4
29.5MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/04_policy-parameterizations/02_demonstration-with-actor-critic.mp4
28.82MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/04_milestone-3-identify-key-performance-parameters/02_project-resources/03_susan-murphy-on-rl-in-mobile-health.mp4
27.63MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/03_average-reward/02_satinder-singh-on-intrinsic-rewards.mp4
26.91MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/01_learning-parameterized-policies/04_advantages-of-policy-parameterization.mp4
26.06MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/01_estimating-values-functions-with-supervised-learning/03_moving-to-parameterized-functions.mp4
24.38MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/05_milestone-4-implement-your-agent/02_project-resources/05_martin-riedmiller-on-the-collect-and-infer-framework-for-data-efficient-rl.mp4
23.54MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/01_feature-construction-for-linear-methods/06_using-tile-coding-in-td.mp4
23.07MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/05_course-wrap-up/01_congratulations-course-4-preview.mp4
22.11MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/01_welcome-to-the-final-capstone-course/01_course-introduction/01_course-4-introduction.mp4
22.11MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/03_exploration-vs-exploitation-tradeoff/01_what-is-the-trade-off.mp4
21.58MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/05_milestone-4-implement-your-agent/02_project-resources/04_meeting-with-martha-in-depth-on-experience-replay.mp4
21.42MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/01_policies-and-value-functions/04_value-functions.mp4
21.1MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/02_the-objective-for-on-policy-prediction/04_state-aggregation-with-monte-carlo.mp4
20.26MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/04_policy-parameterizations/03_gaussian-policies-for-continuous-actions.mp4
19.95MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/02_what-to-learn-estimating-action-values/02_estimating-action-values-incrementally.mp4
19.4MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/03_average-reward/01_average-reward-a-new-way-of-formulating-control-problems.mp4
19.08MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/03_milestone-2-choosing-the-right-algorithm/02_project-resources/03_lets-review-average-reward-a-new-way-of-formulating-control-problems.mp4
19.08MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/01_policy-evaluation-prediction/04_iterative-policy-evaluation.mp4
18.79MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/06_milestone-5-submit-your-parameter-study/03_congratulations/03_specialization-wrap-up.mp4
18.62MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/03_optimality-optimal-policies-value-functions/01_optimal-policies.mp4
18.46MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/01_welcome-to-the-course/01_course-introduction/01_specialization-introduction.mp4
18.26MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/01_episodic-sarsa-with-function-approximation/03_episodic-sarsa-with-function-approximation.mp4
18.05MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/01_feature-construction-for-linear-methods/04_generalization-properties-of-coarse-coding.mp4
17.98MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/02_policy-iteration-control/02_policy-iteration.mp4
17.86MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/01_learning-parameterized-policies/03_learning-policies-directly.mp4
17.1MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/02_bellman-equations/01_bellman-equation-derivation.mp4
17.03MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/03_optimality-optimal-policies-value-functions/03_using-optimal-value-functions-to-get-optimal-policies.mp4
16.73MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/04_policy-parameterizations/01_actor-critic-with-softmax-policies.mp4
16.53MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/01_welcome-to-the-course/01_course-introduction/01_course-3-introduction.mp4
16.33MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/04_linear-td/03_week-1-summary.mp4
16.31MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/01_the-k-armed-bandit-problem/03_sequential-decision-making-with-evaluative-feedback.mp4
16.27MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/01_introduction-to-monte-carlo-methods/04_using-monte-carlo-for-prediction.mp4
16.17MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/04_milestone-3-identify-key-performance-parameters/01_weekly-learning-goals/01_agent-architecture-meeting-with-martha-overview-of-design-choices.mp4
15.62MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/03_training-neural-networks/01_gradient-descent-for-training-neural-networks.mp4
15.53MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/01_episodic-sarsa-with-function-approximation/04_episodic-sarsa-in-mountain-car.mp4
15.47MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/03_the-objective-for-td/01_semi-gradient-td-for-policy-evaluation.mp4
15.35MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/02_neural-networks/03_deep-neural-networks.mp4
15.33MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/02_the-objective-for-on-policy-prediction/03_gradient-monte-for-policy-evaluation.mp4
15.24MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/02_the-objective-for-on-policy-prediction/02_introducing-gradient-descent.mp4
15.1MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/01_policies-and-value-functions/03_specifying-policies.mp4
14.99MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/01_welcome-to-the-course/01_course-introduction/04_your-specialization-roadmap.mp4
14.88MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/01_introduction-to-monte-carlo-methods/03_what-is-monte-carlo.mp4
14.88MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/04_off-policy-learning-for-prediction/01_why-does-off-policy-learning-matter.mp4
14.39MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/05_milestone-4-implement-your-agent/02_project-resources/01_lets-review-optimization-strategies-for-nns.mp4
14.28MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/03_training-neural-networks/02_optimization-strategies-for-nns.mp4
14.28MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/02_what-to-learn-estimating-action-values/01_learning-action-values.mp4
14.22MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/03_actor-critic-for-continuing-tasks/02_actor-critic-algorithm.mp4
14.07MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/03_milestone-2-choosing-the-right-algorithm/02_project-resources/04_lets-review-actor-critic-algorithm.mp4
14.07MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/03_generalized-policy-iteration/02_efficiency-of-dynamic-programming.mp4
14.03MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/02_monte-carlo-for-control/03_solving-the-blackjack-example.mp4
13.91MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/04_linear-td/02_the-true-objective-for-td.mp4
13.66MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/03_actor-critic-for-continuing-tasks/01_estimating-the-policy-gradient.mp4
13.63MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/02_policy-gradient-for-continuing-tasks/01_the-objective-for-learning-policies.mp4
13.35MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/01_policy-evaluation-prediction/03_policy-evaluation-vs-control.mp4
13.32MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/02_milestone-1-formalize-word-problem-as-mdp/01_final-project-milestone-1/01_initial-project-meeting-with-martha-formalizing-the-problem.mp4
13.25MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/03_exploration-vs-exploitation-tradeoff/02_optimistic-initial-values.mp4
13.13MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/01_estimating-values-functions-with-supervised-learning/04_generalization-and-discrimination.mp4
12.86MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/03_exploration-methods-for-monte-carlo/01_epsilon-soft-policies.mp4
12.69MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/03_continuing-tasks/01_continuing-tasks.mp4
12.67MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/05_milestone-4-implement-your-agent/01_weekly-learning-goals/01_meeting-with-adam-getting-the-agent-details-right.mp4
12.6MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/04_off-policy-learning-for-prediction/03_off-policy-monte-carlo-prediction.mp4
12.52MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/03_generalized-policy-iteration/01_flexibility-of-the-policy-iteration-framework.mp4
12.44MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/01_introduction-to-markov-decision-processes/03_markov-decision-processes.mp4
12.36MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/02_milestone-1-formalize-word-problem-as-mdp/02_project-resources/01_lets-review-markov-decision-processes.mp4
12.36MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/01_introduction-to-markov-decision-processes/04_examples-of-mdps.mp4
12.2MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/03_optimality-optimal-policies-value-functions/04_week-3-summary.mp4
11.95MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/04_dealing-with-inaccurate-models/02_in-depth-with-changing-environments.mp4
11.94MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/03_exploration-vs-exploitation-tradeoff/04_jonathan-langford-contextual-bandits-for-real-world-reinforcement-learning.mp4
11.94MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/02_bellman-equations/02_why-bellman-equations.mp4
11.87MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/03_exploration-vs-exploitation-tradeoff/03_upper-confidence-bound-ucb-action-selection.mp4
11.77MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/03_the-objective-for-td/02_comparing-td-and-monte-carlo-with-state-aggregation.mp4
11.54MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/06_milestone-5-submit-your-parameter-study/01_weekly-learning-goals/01_meeting-with-adam-parameter-studies-in-rl.mp4
11.49MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/01_what-is-a-model/03_what-is-a-model.mp4
11.33MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/01_welcome-to-the-course/01_course-introduction/01_course-introduction.mp4
11.27MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/03_dyna-as-a-formalism-for-planning/02_the-dyna-algorithm.mp4
11.24MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/05_course-wrap-up/01_congratulations.mp4
11.18MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/02_exploration-under-function-approximation/01_exploration-under-function-approximation.mp4
11.05MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/06_milestone-5-submit-your-parameter-study/03_congratulations/01_meeting-with-martha-discussing-your-results.mp4
10.95MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/02_the-objective-for-on-policy-prediction/01_the-value-error-objective.mp4
10.86MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/05_milestone-4-implement-your-agent/02_project-resources/03_lets-review-dyna-q-learning-in-a-simple-maze.mp4
10.76MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/03_dyna-as-a-formalism-for-planning/03_dyna-q-learning-in-a-simple-maze.mp4
10.76MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/01_estimating-values-functions-with-supervised-learning/05_framing-value-estimation-as-supervised-learning.mp4
10.69MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/03_temporal-difference-learning-methods-for-prediction/01_introduction-to-temporal-difference-learning/03_what-is-temporal-difference-td-learning.mp4
10.31MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/04_value-functions-bellman-equations/03_optimality-optimal-policies-value-functions/02_optimal-value-functions.mp4
10.19MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/02_policy-iteration-control/01_policy-improvement.mp4
9.99MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/04_policy-parameterizations/04_week-4-summary.mp4
9.96MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/02_off-policy-td-control-q-learning/03_how-is-q-learning-off-policy.mp4
9.96MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/02_on-policy-prediction-with-approximation/04_linear-td/01_the-linear-td-update.mp4
9.9MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/06_milestone-5-submit-your-parameter-study/02_project-resources/01_lets-review-comparing-td-and-monte-carlo.mp4
9.81MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/03_temporal-difference-learning-methods-for-prediction/02_advantages-of-td/02_comparing-td-and-monte-carlo.mp4
9.81MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/05_dynamic-programming/03_generalized-policy-iteration/05_week-4-summary.mp4
9.61MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/03_dyna-as-a-formalism-for-planning/01_the-dyna-architecture.mp4
9.59MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/04_off-policy-learning-for-prediction/05_week-1-summary.mp4
9.59MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/01_feature-construction-for-linear-methods/03_coarse-coding.mp4
9.59MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/02_neural-networks/02_non-linear-approximation-with-neural-networks.mp4
9.59MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/04_milestone-3-identify-key-performance-parameters/02_project-resources/01_lets-review-non-linear-approximation-with-neural-networks.mp4
9.59MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/02_an-introduction-to-sequential-decision-making/03_exploration-vs-exploitation-tradeoff/05_week-1-summary.mp4
9.48MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/05_policy-gradient/02_policy-gradient-for-continuing-tasks/02_the-policy-gradient-theorem.mp4
9.31MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/02_milestone-1-formalize-word-problem-as-mdp/02_project-resources/02_lets-review-examples-of-episodic-and-continuing-tasks.mp4
9.14MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/03_continuing-tasks/02_examples-of-episodic-and-continuing-tasks.mp4
9.14MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/03_temporal-difference-learning-methods-for-prediction/02_advantages-of-td/01_the-advantages-of-temporal-difference-learning.mp4
9.1MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/03_average-reward/03_week-3-review.mp4
8.88MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/03_training-neural-networks/04_week-2-review.mp4
8.5MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/02_goal-of-reinforcement-learning/01_the-goal-of-reinforcement-learning.mp4
8.02MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/03_milestone-2-choosing-the-right-algorithm/01_weekly-learning-goals/01_meeting-with-niko-choosing-the-learning-algorithm.mp4
7.88MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/03_milestone-2-choosing-the-right-algorithm/02_project-resources/02_lets-review-what-is-q-learning.mp4
7.84MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/02_off-policy-td-control-q-learning/01_what-is-q-learning.mp4
7.84MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/02_planning/01_random-tabular-q-planning.mp4
7.83MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/06_milestone-5-submit-your-parameter-study/03_congratulations/02_course-wrap-up.mp4
7.76MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/04_dealing-with-inaccurate-models/01_what-if-the-model-is-inaccurate.mp4
7.69MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/04_control-with-approximation/01_episodic-sarsa-with-function-approximation/05_expected-sarsa-with-function-approximation.mp4
7.63MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/05_milestone-4-implement-your-agent/02_project-resources/02_lets-review-expected-sarsa-with-function-approximation.mp4
7.63MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/01_feature-construction-for-linear-methods/05_tile-coding.mp4
7.57MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/04_off-policy-learning-for-prediction/02_importance-sampling.mp4
7.41MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/03_temporal-difference-learning-methods-for-prediction/02_advantages-of-td/04_week-2-summary.mp4
7.41MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/01_td-for-control/03_sarsa-gpi-with-td.mp4
7.38MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/02_off-policy-td-control-q-learning/02_q-learning-in-the-windy-grid-world.mp4
7.24MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/prediction-control-function-approximation/03_constructing-features-for-prediction/02_neural-networks/01_what-is-a-neural-network.mp4
7.03MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/01_what-is-a-model/04_comparing-sample-and-distribution-models.mp4
6.65MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/02_monte-carlo-for-control/01_using-monte-carlo-for-action-values.mp4
6.47MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/03_expected-sarsa/01_expected-sarsa.mp4
6.26MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/complete-reinforcement-learning-system/03_milestone-2-choosing-the-right-algorithm/02_project-resources/01_lets-review-expected-sarsa.mp4
6.26MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/01_td-for-control/04_sarsa-in-the-windy-grid-world.mp4
5.85MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/03_expected-sarsa/02_expected-sarsa-in-the-cliff-world.mp4
5.69MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/fundamentals-of-reinforcement-learning/03_markov-decision-processes/03_continuing-tasks/03_week-2-summary.mp4
5.42MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/03_expected-sarsa/03_generality-of-expected-sarsa.mp4
5.21MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/02_monte-carlo-methods-for-prediction-control/02_monte-carlo-for-control/02_using-monte-carlo-methods-for-generalized-policy-iteration.mp4
5.17MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/05_course-wrap-up/01_congratulations.mp4
4.36MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/05_planning-learning-acting/04_dealing-with-inaccurate-models/04_week-4-summary.mp4
4.25MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=
[TutsNode.net] - Reinforcement Learning Specialization/sample-based-learning-methods/04_temporal-difference-learning-methods-for-control/03_expected-sarsa/04_week-3-summary.mp4
3.68MB
W3siaWQiOiJleG9jX2JfUExBWSIsImFkc3BvdCI6ImJfUExBWSIsIndlaWdodCI6IjEiLCJmY2FwIjpmYWxzZSwic2NoZWR1bGUiOmZhbHNlLCJtYXhXaWR0aCI6ZmFsc2UsIm1pbldpZHRoIjpmYWxzZSwidGltZXpvbmUiOmZhbHNlLCJleGNsdWRlIjpmYWxzZSwiZG9tYWluIjpmYWxzZSwiY29kZSI6IjwhLS1cclxuPGEgaHJlZj1cImh0dHBzOlwvXC9zeW5kaWNhdGlvbi5keW5zcnZ0YmcuY29tXC9zcGxhc2gucGhwP2lkem9uZT0xOTYxMDkyJnJldHVybl91cmw9aHR0cHM6XC9cL3RlbGxtZS5wd1wvZ29cL2J0c1wiICBjbGFzcz1cImJ0biBidG4td2FybmluZ1wiIHRhcmdldD1cIl9ibGFua1wiPjxzcGFuIGNsYXNzPVwiZ2x5cGhpY29uIGdseXBoaWNvbi1wbGF5XCI+PFwvc3Bhbj4gUGxheSBOb3c8XC9hPlxyXG4tLT4ifV0=