Sampling, batch_all, non-zero, optimizer-flag #33

lucasb-eyer · 2018-04-25T13:18:19Z

So this is a branch I had lying around for a while which adds a lot of things. I'm not merging it yet, as I still want to add these changes to the README and test it thoroughly, but here it is for others to try out.

It implements the trick I mentioned in #4 as well as some more variants we cover in the paper. If you give this a try, please give feedback here.

maxisme · 2018-04-25T13:56:10Z

train.py

+    if args.loss_ignore_zero is True:
+        nnz = tf.count_nonzero(losses, dtype=tf.float32)
+    else:
+        nnz = tf.reduce_sum(tf.to_float(tf.greater(losses, args.loss_ignore_zero or 1e-5)))


Is the point of this supposed to be just for logging?

No, actually. It's type-magic and can be a little obscure, hence why I still need to write documentation in the README :)

The else case happens when loss_ignore_zero is given an additional float argument, so one can call it as --loss_ignore_zero 1e-3 for example, in order to consider anything below 1e-3 to be counted as zero.

Read our paper, we explain them in there :) But really it's not a good time investment to play with that parameter.

I am going to delete that comment because it makes no sense sorry. 😆 Currently have it printed and highlighted in front of me trying to get to grips!

maxisme · 2018-04-25T15:48:46Z

train.py

    help='Enable the super-mega-advanced top-secret sampling stabilizer.')

+parser.add_argument(
+    '--loss_ignore_zero', default=False, const=True, nargs='?', type=common.positive_float,


Ohh misread this to mean it can only be boolean. I am going to start playing with this then. 🍾

lucasb-eyer added 4 commits November 27, 2017 12:23

Make the optimizer a flag that gets eval'd.

cfeecda

Add batch-all loss, small refactor.

f9f3c05

Implement the =/=0 version of losses from the paper.

c1b9efb

Add batch_sample as loss.

1a87aad

lucasb-eyer mentioned this pull request Apr 25, 2018

Unable to approach loss of less than 0.7 even when testing multiple learning rates. #30

Closed

maxisme reviewed Apr 25, 2018

View reviewed changes

lucasb-eyer mentioned this pull request Aug 14, 2018

Does anyone succeed on imagenet? #54

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sampling, batch_all, non-zero, optimizer-flag #33

Sampling, batch_all, non-zero, optimizer-flag #33

lucasb-eyer commented Apr 25, 2018

maxisme Apr 25, 2018

lucasb-eyer Apr 25, 2018

lucasb-eyer Apr 25, 2018

maxisme Apr 25, 2018 •

edited

Loading

maxisme Apr 25, 2018 •

edited

Loading

Sampling, batch_all, non-zero, optimizer-flag #33

Are you sure you want to change the base?

Sampling, batch_all, non-zero, optimizer-flag #33

Conversation

lucasb-eyer commented Apr 25, 2018

maxisme Apr 25, 2018

Choose a reason for hiding this comment

lucasb-eyer Apr 25, 2018

Choose a reason for hiding this comment

lucasb-eyer Apr 25, 2018

Choose a reason for hiding this comment

maxisme Apr 25, 2018 • edited Loading

Choose a reason for hiding this comment

maxisme Apr 25, 2018 • edited Loading

Choose a reason for hiding this comment

maxisme Apr 25, 2018 •

edited

Loading

maxisme Apr 25, 2018 •

edited

Loading