About EMA update in paper

Hi, authors.
Thanks for your great work! After reading the paper, the equation (1) confuses me.  Since in the original Mean Teacher framework, the  update equation is written as $\phi_{t+1} =\mu \phi_t+(1-\mu) \theta_{t+1}$ , which means the student model is updated using the backward gradient first, and then the teacher model is updated by EMA. However, in your paper, it is written in contrary, as follows. I think it is inconsistent with the original paper. Is this the writing error, or my understanding goes wrong? 

![image](https://user-images.githubusercontent.com/38565875/196872493-2bd7ca2a-ede1-4dc6-9d9f-a0ec99821ca4.png)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About EMA update in paper #1

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

About EMA update in paper #1

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions