Free CuArrays in the reverse pass by mcabbott · Pull Request #1340 · FluxML/Zygote.jl

mcabbott · 2022-12-18T21:48:30Z

This adds:

A flag to Context to indicate that the pullback will never be called twice -- set to true for gradient, false for jacobian
Modifications to many rules, esp. for broadcasting, so that y=f(x) in the forward pass has finalize(y) in the reverse. This increases the largest size of Flux model which can run on a given GPU.

Applying such modifications everywhere led to many errors, some from rules like y = x .+ false which return y === x under Zygote. So they now require a separate macro @adjoint_final.

At present this modification is applied to all CR rrules. This is probably unsafe and we should revert 2524163 . Unclear how best to opt-in within ChainRules. Xref JuliaDiff/ChainRulesCore.jl#592 about the idea of a flag, but not entirely sure that's the right approach.

Explicit finalising won't work well with thunks. Which doesn't matter at all yet, but might after #966.

It also does not work with second derivatives, hence is disabled. Other uses of the context flag (like testing only_once(cfg) & then over-writing some array) probably also need to be disabled.

Needs FluxML/ZygoteRules.jl#23 so CI will fail. Locally, one failure, one failure to fail:

Global Params: Error During Test at /Users/me/.julia/dev/Zygote/test/features.jl:399
  Got exception outside of a @test
  KeyError: key :(Main.global_param) not found
  Stacktrace:
    [1] getindex(d::IdDict{Any, Any}, key::Any)
      @ Base ./iddict.jl:108
    [2] macro expansion
      @ ~/.julia/dev/Zygote/test/features.jl:404 [inlined]

Compiler: Error During Test at /Users/me/.julia/dev/Zygote/test/compiler.jl:35
 Unexpected Pass
 Expression: trace_contains(bt, :badly, "compiler.jl", 24)
 Got correct result, please change to @test if no longer broken.

chengchingwen · 2022-12-19T09:42:48Z

Could we combine this with JuliaDiff/ChainRulesCore.jl#592 ?

mcabbott · 2022-12-19T14:31:42Z

It's possible. I think that means having two distinct structs, ZygoteRuleConfig and ZygoteOnceRuleConfig or something.

At present, BTW, most of these maybe_finals seem not to be called & I'm not sure why.

chengchingwen · 2022-12-19T14:37:22Z

It's possible. I think that means having two distinct structs, ZygoteRuleConfig and ZygoteOnceRuleConfig or something.

Or introduce another type parameter like ZygoteRuleConfig{once} where once?

At present, BTW, most of these maybe_finals seem not to be called & I'm not sure why.

Do you mean the finalize is not called, or it is called but the memory is not freed?

mcabbott · 2022-12-19T14:51:41Z

Do you mean the finalize is not called

With something like this

Zygote.maybe_final(x::CuArray) = begin CNT[]+=1; CUDA.unsafe_free!(x); nothing end

a big ResNet gradient [used to] ~~gives me CNT[] == 3 afterwards. (Thought I had this working when I opened it...)~~ [fixed in 9f01eff]

type parameter like ZygoteRuleConfig{once} where once

But I don't think that fits CR's mechanism; the current struct is <: RuleConfig{Union{HasReverseMode,NoForwardsMode}} and the new ones would need different supertypes.

We could also think about changing it to <: RuleConfig{Union{HasReverseMode,NoForwardsMode}, true}, in which case matching Context{..., true} would be easy.

chengchingwen · 2022-12-20T12:40:29Z

But I don't think that fits CR's mechanism; the current struct is <: RuleConfig{Union{HasReverseMode,NoForwardsMode}} and the new ones would need different supertypes.

Couldn't it be done like struct ZygoteRuleConfig{P<:PullbackCapability} <: RuleConfig{Union{HasReverseMode,NoForwardsMode,P}}?

mcabbott · 2022-12-20T14:39:41Z

Oh right, that ought to work.

Current status is that some arrays are freed too early (e.g. with Metalhead's ResNet, at addact(relu)) but it's hard to isolate. Still happens if I disable all thunks. In Zygote's tests, some failures due to too-early fill!(x, NaN) (included here as a test), perhaps related.

mcabbott added 6 commits December 18, 2022 09:13

add only_once flag to Context, for pullback non-re-use

125e8a2

define maybe_final, insert into a few adjoint rules (map, broadcast)

424df8f

insert maybe_final into all CR rrules

2524163

insert maybe_final into all ZygoteRules at-adjoint rules

a1a6896

use adjoint_final from ZygoteRules, broadcasting should opt-in

759ca60

skip kron test on 1.9+

7e6e31e

CarloLucibello reviewed Dec 19, 2022

View reviewed changes

Comment thread src/compiler/chainrules.jl Outdated

fixup

9f01eff

ToucheSir mentioned this pull request Feb 3, 2023

Define @adjoint_final FluxML/ZygoteRules.jl#23

Closed

mcabbott closed this Mar 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Free CuArrays in the reverse pass#1340

Free CuArrays in the reverse pass#1340
mcabbott wants to merge 7 commits intoFluxML:masterfrom
mcabbott:auto_final

mcabbott commented Dec 18, 2022 •

edited

Loading

Uh oh!

chengchingwen commented Dec 19, 2022

Uh oh!

mcabbott commented Dec 19, 2022

Uh oh!

Uh oh!

chengchingwen commented Dec 19, 2022

Uh oh!

mcabbott commented Dec 19, 2022 •

edited

Loading

Uh oh!

chengchingwen commented Dec 20, 2022

Uh oh!

mcabbott commented Dec 20, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

mcabbott commented Dec 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chengchingwen commented Dec 19, 2022

Uh oh!

mcabbott commented Dec 19, 2022

Uh oh!

Uh oh!

chengchingwen commented Dec 19, 2022

Uh oh!

mcabbott commented Dec 19, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chengchingwen commented Dec 20, 2022

Uh oh!

mcabbott commented Dec 20, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mcabbott commented Dec 18, 2022 •

edited

Loading

mcabbott commented Dec 19, 2022 •

edited

Loading