refactor namespaces in criterion interface #1729

erip · 2020-02-21T01:17:44Z

Before submitting

Was this discussed/approved via a Github issue? (no need for typos, doc improvements)
Did you read the contributor guideline?
Did you make sure to update the docs?
Did you write any new necessary tests?

What does this PR do?

Fixes #1672 in part (part 1: context)

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

…s as an attribute.

… interface.

…new criterion interface.

facebook-github-bot

@myleott has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

myleott · 2020-02-27T16:07:14Z

I'm almost done with this one, hoping to merge later today :) I had to make some changes to support backwards compatibility, since there's a lot of internal criterions that still use the old API.

erip · 2020-02-27T16:53:39Z

Sorry for all the extra work, but I'm excited about these changes. Wiring up hydra should be a breeze after this.

myleott · 2020-03-03T13:49:51Z

Sorry for the delay, still iterating on this. Will be merged soon...

facebook-github-bot

@myleott is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: # Before submitting - [x] Was this discussed/approved via a Github issue? (no need for typos, doc improvements) - [x] Did you read the [contributor guideline](https://github.com/pytorch/fairseq/blob/master/CONTRIBUTING.md)? - [x] Did you make sure to update the docs? - [x] Did you write any new necessary tests? ## What does this PR do? Fixes #1672 in part (part 1: [context](#1714 (comment))) ## PR review Anyone in the community is free to review the PR once the tests have passed. If we didn't discuss your PR in Github issues there's a high chance it will not be merged. ## Did you have fun? Make sure you had fun coding � Pull Request resolved: #1729 Differential Revision: D20049353 Pulled By: myleott fbshipit-source-id: 732077a1cc339c9f7ebe26dae42a7e8d7b5a07b4

erip · 2020-03-05T01:28:17Z

Woo! Glad to see this land -- thanks, @myleott. I see that this was a lot of work -- if there's anything you want to push back to me for the other components, please don't hesitate.

myleott · 2020-03-05T04:06:25Z

Yeah thanks for all your work getting this started!

The first one took a bit longer because it required some discussion and figuring out the right way to maintain backward compatibility. The key is providing a Legacy version of the base classes, for example the LegacyFairseqCriterion.

It’d be super helpful if you can add some of those for the other PRs, but in general I hope the next few of these should be easier to merge.

erip · 2020-03-05T12:31:59Z

Awesome. Happy to do that - looks like a light lift after the hard decision making was made up front. 😄 Should I also make PRs to pytorch/translate in tandem (like the associated commit w/ this PR)? I was given the impression that translate and fairseq were going to merge eventually, but I don't suspect that changes the need to keep them in lock step.

MultiPath · 2020-03-11T21:23:24Z

Hi, what is the purpose for this change which might cause all the user defined criterion broken?

erip · 2020-03-11T21:25:50Z

The purpose of this change is to divorce components in fairseq from argparsing. If you want to use some model/criterion/task/etc. outside of fairseq, you basically cannot. We have retained a LegacyFairseqCriterion interface which is a minimal change to keep compatibility with the previous interface. It doesn't offer perfect backwards compatibility, but fairseq isn't 1.0.0 yet so the API is subject to change some small amount.

MultiPath · 2020-03-11T21:46:47Z

I don't understand.
The original criterion design already contained "add_args" functions to add specific arguments. I am not sure the current change for "from_args" is necessary and I felt it made code even confusing.

erip · 2020-03-11T21:55:33Z

The original criterion design stored an argparse.Namespace as an instance variable which means that you need one to create a criterion which is inconvenient. Eventually all of these {add,from}_args methods will be replaced by proper configuration.

MultiPath · 2020-03-11T21:57:57Z

Not sure what do you mean by "you need one to create a criterion which is inconvenient. "
I think the design of args has more freedom to add different configurations as needed.

erip · 2020-03-11T21:58:56Z

Basically:

class Criterion:
    def __init__(self, args: argparse.Namespace):
        self.args = args # <-- this has nothing to do with what it means to be a criterion

The above code is very nasty and makes the code very hard to test, very hard to use in production situations, and very brittle.

MultiPath · 2020-03-11T22:03:43Z

Ok.. If it is the case, everything needs to be changed in fairseq...

erip · 2020-03-11T22:06:21Z

These changes are not as dramatic as they appear. Basically it's standardizing the code toward best practices which has a lot of benefit downstream in testing and consuming.

MultiPath · 2020-03-11T22:23:09Z

I think using "args" makes it much easier to pass arguments which might be defined outside of criterions.

erip · 2020-03-11T22:25:22Z

Why can't you pass it to the constructor directly?

MultiPath · 2020-03-11T22:26:00Z

Also, if a function requires 10+ more arguments as inputs to "init", it will also be very difficult to read and work with.

erip · 2020-03-11T22:26:54Z

Difficult to read: maybe. Consider abstracting something.
Difficult to work with: I don't think I agree.

MultiPath · 2020-03-11T22:32:13Z

Anyway, I still don't think it is a good idea to remove all "args" to build the current model/criterion/etc.
There may be a lot of places where you need to have this "args" as input, otherwise it is easily to get 10+ or 20+ inputs in a complicated model.
Just to be clear, since Myle has already accepted the PR, I just left my comments here.

erip · 2020-03-11T22:35:42Z

To make this exceeding clear: if someone develops an interesting optimizer in fairseq that I want to use in my pytorch code, there's currently no way to do that unless I mock argument parsing. Optimizers should not care about argument parsing much like models, lr schedulers, tokenizers, byte-pair encoders, loss functions, or just about any other component in software (outside of a main function) should not.

MultiPath · 2020-03-11T22:39:50Z

..but this PR is about "loss functions".

erip · 2020-03-11T22:42:12Z

I was giving an example. There are PRs for every other component, as well. See #1743, #1733, #1732, #1731, #1730

MultiPath · 2020-03-11T22:48:40Z

Yes, I understand.
I think it is super unnecessary and makes everything hard to read, change, and maintain our previous codebase. Especially for "tasks", "models".

myleott · 2020-03-13T19:37:10Z

Sorry just catching up on this.

@MultiPath, I shared some context/motivation in the other PR (#1743), but I agree that we need to be careful here for several reasons:

Currently fairseq uses args everywhere, so we need to be careful not to break existing code. We introduced the "LegacyFairseqCriterion" base class that should be a drop-in replacement to make any existing code continue to work.
There are cases where using a single args is very helpful and we should continue to support this where it's appropriate.
The hope is that this will make it easier to use fairseq as a library. The motivation is to let people initialize and use fairseq components as standalone pieces in other projects, rather than only supporting the fairseq command-line usage.

myleott · 2020-03-13T19:41:14Z

The original criterion design already contained "add_args" functions to add specific arguments. I am not sure the current change for "from_args" is necessary

Hmm, seems this is a github bug. Please ignore the "Files Changed" shown in this PR. The actual commit is 46b773a, which is different from what's shown in this PR. Notably, we didn't end up adding from_args and also introduced LegacyFairseqCriterion for backward compatibility.

Summary: # Before submitting - [x] Was this discussed/approved via a Github issue? (no need for typos, doc improvements) - [x] Did you read the [contributor guideline](https://github.com/pytorch/fairseq/blob/master/CONTRIBUTING.md)? - [x] Did you make sure to update the docs? - [x] Did you write any new necessary tests? ## What does this PR do? Fixes facebookresearch#1672 in part (part 1: [context](facebookresearch#1714 (comment))) ## PR review Anyone in the community is free to review the PR once the tests have passed. If we didn't discuss your PR in Github issues there's a high chance it will not be merged. ## Did you have fun? Make sure you had fun coding � Pull Request resolved: facebookresearch#1729 Differential Revision: D20049353 Pulled By: myleott fbshipit-source-id: 732077a1cc339c9f7ebe26dae42a7e8d7b5a07b4

Summary: # Before submitting - [x] Was this discussed/approved via a Github issue? (no need for typos, doc improvements) - [x] Did you read the [contributor guideline](https://github.com/pytorch/fairseq/blob/master/CONTRIBUTING.md)? - [x] Did you make sure to update the docs? - [x] Did you write any new necessary tests? ## What does this PR do? Fixes facebookresearch/fairseq#1672 in part (part 1: [context](facebookresearch/fairseq#1714 (comment))) ## PR review Anyone in the community is free to review the PR once the tests have passed. If we didn't discuss your PR in Github issues there's a high chance it will not be merged. ## Did you have fun? Make sure you had fun coding � Pull Request resolved: facebookresearch/fairseq#1729 Differential Revision: D20049353 Pulled By: myleott fbshipit-source-id: 732077a1cc339c9f7ebe26dae42a7e8d7b5a07b4

erip added 16 commits February 20, 2020 20:15

update base criterion to require from_args classmethod and remove arg…

9ac544b

…s as an attribute.

update adaptive loss to incorporate new criterion interface.

95ee916

update binary cross entropy loss to incorporate new criterion interface.

b502bfa

update cross entropy loss to incorporate new criterion interface.

367d7c1

update label-smoothed cross entropy loss to incorporate new criterion…

3e0a315

… interface.

update label-smoothed cross entropy w/ alignment loss to incorporate …

9b23590

…new criterion interface.

update legacy masked lm loss to incorporate new criterion interface.

7d87a8c

update masked lm loss to incorporate new criterion interface.

0d5d3c9

update NAT loss to incorporate new criterion interface.

ecb6465

update sent prediction loss to incorporate new criterion interface.

2f762a2

update sent ranking loss to incorporate new criterion interface.

77056ab

update composite loss to incorporate new criterion interface.

11d7c0d

use new from_args API in build_criterion.

80878e2

fix bug in label smoothed cross entropy super call.

85d90bc

fix label smoothing tests.

6aaed08

fix cross entropy for ASR and accompanying test.

02db5c8

facebook-github-bot added the CLA Signed label Feb 21, 2020

facebook-github-bot reviewed Feb 22, 2020

View reviewed changes

Merge branch 'master' into feature/refactor-namespaces-criterion

3d998a6

facebook-github-bot reviewed Mar 4, 2020

View reviewed changes

facebook-github-bot closed this in pytorch/translate@bb4f01b Mar 5, 2020

erip deleted the feature/refactor-namespaces-criterion branch March 5, 2020 01:36

facebook-github-bot added the Merged label Mar 5, 2020

prihoda mentioned this pull request Mar 12, 2020

Enable per-token classification in RoBERTa #1709

Closed

4 tasks

refactor namespaces in criterion interface #1729

refactor namespaces in criterion interface #1729

Uh oh!

Conversation

erip commented Feb 21, 2020

Before submitting

What does this PR do?

PR review

Did you have fun?

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

myleott commented Feb 27, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

erip commented Feb 27, 2020

Uh oh!

myleott commented Mar 3, 2020

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

erip commented Mar 5, 2020

Uh oh!

myleott commented Mar 5, 2020

Uh oh!

erip commented Mar 5, 2020

Uh oh!

MultiPath commented Mar 11, 2020

Uh oh!

erip commented Mar 11, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MultiPath commented Mar 11, 2020

Uh oh!

erip commented Mar 11, 2020

Uh oh!

MultiPath commented Mar 11, 2020

Uh oh!

erip commented Mar 11, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MultiPath commented Mar 11, 2020

Uh oh!

erip commented Mar 11, 2020

Uh oh!

MultiPath commented Mar 11, 2020

Uh oh!

erip commented Mar 11, 2020

Uh oh!

MultiPath commented Mar 11, 2020

Uh oh!

erip commented Mar 11, 2020

Uh oh!

MultiPath commented Mar 11, 2020

Uh oh!

erip commented Mar 11, 2020

Uh oh!

MultiPath commented Mar 11, 2020

Uh oh!

erip commented Mar 11, 2020

Uh oh!

MultiPath commented Mar 11, 2020

Uh oh!

myleott commented Mar 13, 2020

Uh oh!

myleott commented Mar 13, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

myleott commented Feb 27, 2020 •

edited

Loading

erip commented Mar 11, 2020 •

edited

Loading

erip commented Mar 11, 2020 •

edited

Loading

myleott commented Mar 13, 2020 •

edited

Loading