Skip to content

Fix n-step EV-SARSA reward indexing#552

Open
AI-Ahmed wants to merge 1 commit into
yandexdataschool:masterfrom
AI-Ahmed:patch-11
Open

Fix n-step EV-SARSA reward indexing#552
AI-Ahmed wants to merge 1 commit into
yandexdataschool:masterfrom
AI-Ahmed:patch-11

Conversation

@AI-Ahmed

Copy link
Copy Markdown
Contributor

Correct n-step return computation in NStepEVSarsaAgent by starting reward accumulation from the proper memory index, which fixes nstep_evsarsa_1 getting stuck at -10000.

Correct n-step return computation in NStepEVSarsaAgent by starting reward accumulation from the proper memory index, which fixes nstep_evsarsa_1 getting stuck at -10000.
@review-notebook-app

Copy link
Copy Markdown

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@dniku

dniku commented Mar 29, 2026

Copy link
Copy Markdown
Collaborator

@kharitonov-ivan I think this PR is addressing an issue introduced in #546. Could you take a look?

@dniku dniku requested a review from kharitonov-ivan March 29, 2026 17:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants