这是indexloc提供的服务,不要输入任何密码
Skip to content

Conversation

@nuance1979
Copy link
Collaborator

  • Add Implicit Quantile Network
  • Reference: paper and code

I will try to get some results over the weekend but our GPU machines are a bit busy recently.

@codecov-commenter
Copy link

codecov-commenter commented May 22, 2021

Codecov Report

Merging #371 (aae6360) into master (458028a) will increase coverage by 0.11%.
The diff coverage is 100.00%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master     #371      +/-   ##
==========================================
+ Coverage   94.53%   94.64%   +0.11%     
==========================================
  Files          55       56       +1     
  Lines        3569     3644      +75     
==========================================
+ Hits         3374     3449      +75     
  Misses        195      195              
Flag Coverage Δ
unittests 94.64% <100.00%> (+0.11%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files Coverage Δ
tianshou/policy/__init__.py 100.00% <100.00%> (ø)
tianshou/policy/modelfree/iqn.py 100.00% <100.00%> (ø)
tianshou/utils/net/discrete.py 100.00% <100.00%> (ø)

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 458028a...aae6360. Read the comment docs.

Comment on lines 190 to 192
def forward( # type: ignore
self, s: Union[np.ndarray, torch.Tensor], sample_size: int, **kwargs: Any
) -> Tuple[torch.Tensor, Any]:
Copy link
Collaborator

@Trinkle23897 Trinkle23897 May 28, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hmm does this change the definition of network?
https://tianshou.readthedocs.io/en/master/tutorials/dqn.html#build-the-network
The first return object can be any form but the second one should be the hidden state

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I see. I'll follow the convention.

sample_size = self._online_sample_size
else:
sample_size = self._sample_size
logits, h = model(obs_, sample_size, state=state, info=batch.info)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

better using sample_size=sample_size

@Trinkle23897 Trinkle23897 merged commit f3169b4 into thu-ml:master May 29, 2021
@nuance1979 nuance1979 deleted the iqn branch October 6, 2021 17:27
BFAnas pushed a commit to BFAnas/tianshou that referenced this pull request May 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Implicit Quantile Network (IQN) and Fully parameterized Quantile Function (FQF)

3 participants