[WIP] Add VisitTensors traits and simplify nn internals. #460

nkoppel · 2023-02-16T17:55:48Z

Adds framework to run functions on all sets of corresponding tensors in groups of immutable and mutable references to modules of a single type, as discussed in #435. Closes #435, and will make #425 easier to implement.

Tasks:

Implement VisitTensorGroups for all Modules in nn
Implement GradientUpdate (disable for some fields in batchnorm2d)
Implement ResetParams (linear and conv2d need special treatment)
Implement SaveToNpz
Implement LoadFromNpz
Document internals in visit_tensors.rs
Implement VisitTensorGroups for Mlp in 07-custom-module

…he same type

…/loading

coreylowman

What's your sense for if this will actually help internals? I see that we've removed quite a bit of code juts with this draft, and probably can remove more of it.

If we could figure out how to use this to implement EMA, I think that'd be a key advantage, but I haven't been able to figure out how to do "zipped" visiting

src/nn/visit_tensors.rs

…move set_option

nkoppel · 2023-02-17T16:31:36Z

I think that this pr will grant us a lot of flexibility in implementing new modules and new functionality for modules. It will also make it much easier for users who implement custom modules to get access to a lot of features that they otherwise would've had to write a lot of boilerplate to use.

Also, I don't think that implementating VisitTensorGroups is very complicated, because it pretty much amounts to specifying how to access each field, and defining each field's name. TensorVisitor is also pretty simple to implement, as it amounts to getting the tensors you need from tensors and passing them to a function. For example, implementing CountParams for every nn module takes only 28 lines of fairly simple code.

src/nn/visit_tensors.rs

coreylowman · 2023-02-17T17:28:40Z

src/nn/add_into.rs

+    fn visit_groups<F: TensorVisitor<N, M, E, D>>(
+        mut self_refs: ModuleGroup<N, M, Self>,
+        func: &mut F,
+    ) -> Result<(), F::Err> {
+        self_refs.map(|s| &s.0, |s| &mut s.0, "0.").visit(func)
    }


Thoughts on including the visitor in the ModuleGroup object, and then call it inside .map?

If we can move the E/D generics to the call method, this shouldn't be an issue.

Something like:

Suggested change

fn visit_groups<F: TensorVisitor<N, M, E, D>>(

mut self_refs: ModuleGroup<N, M, Self>,

func: &mut F,

) -> Result<(), F::Err> {

self_refs.map(|s| &s.0, |s| &mut s.0, "0.").visit(func)

}

fn visit_groups<F: TensorVisitor<N, M>>(

&mut self_refs: ModuleGroup<N, M, Self, F>,

) -> Result<(), F::Err> {

self_refs.map(|s| &s.0, |s| &mut s.0, "0.")

}

We could even go a step further and make ModuleGroup itself a TensorVisitor (wrapped around another TensorVisitor):

Suggested change

fn visit_groups<F: TensorVisitor<N, M, E, D>>(

mut self_refs: ModuleGroup<N, M, Self>,

func: &mut F,

) -> Result<(), F::Err> {

self_refs.map(|s| &s.0, |s| &mut s.0, "0.").visit(func)

}

fn visit_groups<V: TensorVisitor<N, M>>(

&mut visitor: V,

) -> Result<(), F::Err> {

visitor.map(|s| &s.0, |s| &mut s.0, "0.")

}

I've implemented your first suggestion and have renamed some things to make the new functionality of ModuleGroups make more sense. I haven't implemented your second suggestion because I don't really think I understand what you're getting at. What would the call implementation of ModuleGroup look like? How would we treat TensorVisitors not wrapped in a ModuleGroup?

coreylowman · 2023-02-17T17:32:35Z

src/nn/visit_tensors.rs

+    }
+}
+
+pub trait VisitTensors<E: Dtype, D: DeviceStorage>: VisitTensorGroups<1, 0, E, D> + Debug {


Are 1, 0 and 0, 1 for VisitTensorsRef and VisitoTensorsMut the only two cases we would ever support?

I think we might be able to get rid of the VisitTensorGroups trait if we can separately impl some trait (not sure which one) for &T and &mut T

We will eventually support 1, 1 so we can add and multiply modules for EMA.

… TensorVisitor

coreylowman · 2023-02-21T00:35:43Z

@nkoppel i figured out how to do it without the arrays, will open another PR soon

…s to VisitTensors

coreylowman · 2023-02-22T17:22:17Z

Closing since the other PR was merged, thanks for the great work on this @nkoppel, the get_refs/get_muts is super clean/clever! 🚀

nkoppel added 3 commits February 16, 2023 11:37

add framework to visit all tensors in multiple Module references of t…

d9acb13

…he same type

rename VisitTensorsZipped to VisitZippedTensors

4ac9440

rename VisitZippedTensors to VisitTensorGroups

e8077ff

nkoppel changed the title ~~Add VisitTensors traits and simplify nn internals.~~ [WIP] Add VisitTensors traits and simplify nn internals. Feb 16, 2023

nkoppel marked this pull request as draft February 16, 2023 17:58

nkoppel added 7 commits February 16, 2023 12:11

finish CountParams implementation

8ab2883

implement VisitTensorGroups for all Modules

efc486d

implement GradientUpdate for all implementers of VisitTensorsMut

225eb83

run cargo fmt

bb91c18

change configuration of TensorVisitors to only effect the next call

61f2375

have TensorVisitors specify their own error type; implemnt npz saving…

3b46406

…/loading

run cargo fmt

8e728ab

coreylowman reviewed Feb 17, 2023

View reviewed changes

src/nn/visit_tensors.rs Show resolved Hide resolved

src/nn/visit_tensors.rs Outdated Show resolved Hide resolved

src/nn/visit_tensors.rs Outdated Show resolved Hide resolved

have TensorVisitor::call take a list of "TensorVisitorOption"s and re…

b2051d4

…move set_option

run cargo fmt

afc231d

coreylowman reviewed Feb 17, 2023

View reviewed changes

src/nn/visit_tensors.rs Outdated Show resolved Hide resolved

coreylowman reviewed Feb 17, 2023

View reviewed changes

nkoppel added 5 commits February 17, 2023 16:31

rename some utilities in visit_tensors; move TensorFunction inside of…

adbfed5

… TensorVisitor

run cargo fmt

86b2917

implement ResetParams

4085c6f

run cargo fmt

d0c5a08

add item-level documentation; reorganize slightly

06cfd9e

nkoppel added 2 commits February 20, 2023 20:14

remove const generics from VisitTensorGroups; rename VisitTensorGroup…

6d7b388

…s to VisitTensors

run cargo fmt

2b20011

coreylowman mentioned this pull request Feb 21, 2023

Adds TensorCollection #469

Merged

3 tasks

nkoppel mentioned this pull request Feb 21, 2023

Add TensorContainer trait to allow more argument types for TensorVisitors in #469 #472

Merged

coreylowman closed this Feb 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[WIP] Add VisitTensors traits and simplify nn internals. #460

[WIP] Add VisitTensors traits and simplify nn internals. #460

Uh oh!

nkoppel commented Feb 16, 2023 •

edited

Loading

Uh oh!

coreylowman left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nkoppel commented Feb 17, 2023

Uh oh!

Uh oh!

coreylowman Feb 17, 2023

Uh oh!

nkoppel Feb 17, 2023

Uh oh!

coreylowman Feb 17, 2023

Uh oh!

nkoppel Feb 17, 2023 •

edited

Loading

Uh oh!

coreylowman commented Feb 21, 2023

Uh oh!

coreylowman commented Feb 22, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[WIP] Add VisitTensors traits and simplify nn internals. #460

[WIP] Add VisitTensors traits and simplify nn internals. #460

Uh oh!

Conversation

nkoppel commented Feb 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coreylowman left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nkoppel commented Feb 17, 2023

Uh oh!

Uh oh!

coreylowman Feb 17, 2023

Choose a reason for hiding this comment

Uh oh!

nkoppel Feb 17, 2023

Choose a reason for hiding this comment

Uh oh!

coreylowman Feb 17, 2023

Choose a reason for hiding this comment

Uh oh!

nkoppel Feb 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

coreylowman commented Feb 21, 2023

Uh oh!

coreylowman commented Feb 22, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

nkoppel commented Feb 16, 2023 •

edited

Loading

coreylowman left a comment •

edited

Loading

nkoppel Feb 17, 2023 •

edited

Loading