Docs: fp16 page #404

pcuenca · 2022-09-07T16:44:40Z

Part of #293.

HuggingFaceDocBuilderDev · 2022-09-07T16:48:18Z

The documentation is not available anymore as the PR was closed or merged.

keturn · 2022-09-07T17:42:36Z

LGTM. float16 made all the difference for me in terms of being able to run it on my hardware; it's good to have examples like this of its use.

A few other details to consider (but could also follow in a future update):

Is attention slicing useful only for batch processing, or does it still give benefits for a single prompt with a batch size of 1?
Do these options change the inference result, or is it a speed/memory tradeoff but arrives at the same place in the end?

docs/source/_toctree_new.yml

docs/source/optimization/fp16.mdx

patrickvonplaten

Looks good to me - feel free to merge. Left some suggestions as comments :-)

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

@patrickvonplaten

Explained by @patrickvonplaten after a suggestion by @keturn. Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

…to docs-optim-fp16

pcuenca · 2022-09-08T07:06:22Z

Added a couple of tweaks on top of your suggestions. Thanks a lot @patrickvonplaten and @keturn, very useful observations!

patrickvonplaten · 2022-09-08T07:17:47Z

Very nice!

@patrickvonplaten

* Initial version of `fp16` page. * Fix typo in README. * Change titles of fp16 section in toctree. * PR suggestion Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * PR suggestion Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Clarify attention slicing is useful even for batches of 1 Explained by @patrickvonplaten after a suggestion by @keturn. Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Do not talk about `batches` in `enable_attention_slicing`. * Use Tip (just for fun), add link to method. * Comment about fp16 results looking the same as float32 in practice. * Style: docstring line wrapping. Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

pcuenca added 3 commits September 7, 2022 18:43

Initial version of fp16 page.

268065b

Fix typo in README.

6019d0d

Change titles of fp16 section in toctree.

fd04705

Merge remote-tracking branch 'origin/main' into docs-optim-fp16

0c2c595

natolambert mentioned this pull request Sep 7, 2022

[Docs] Doc Sprint on Wednesday Sep 7 #293

Closed

patrickvonplaten reviewed Sep 7, 2022

View reviewed changes

docs/source/_toctree_new.yml Outdated Show resolved Hide resolved

patrickvonplaten reviewed Sep 7, 2022

View reviewed changes

docs/source/optimization/fp16.mdx Outdated Show resolved Hide resolved

patrickvonplaten reviewed Sep 7, 2022

View reviewed changes

docs/source/optimization/fp16.mdx Outdated Show resolved Hide resolved

patrickvonplaten reviewed Sep 7, 2022

View reviewed changes

docs/source/optimization/fp16.mdx Outdated Show resolved Hide resolved

patrickvonplaten approved these changes Sep 7, 2022

View reviewed changes

pcuenca and others added 7 commits September 8, 2022 08:35

PR suggestion

d14c3aa

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

PR suggestion

214873c

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

Clarify attention slicing is useful even for batches of 1

8ec1199

Explained by @patrickvonplaten after a suggestion by @keturn. Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

Merge branch 'docs-optim-fp16' of github.com:huggingface/diffusers in…

956e97c

…to docs-optim-fp16

Do not talk about batches in enable_attention_slicing.

9381d65

Use Tip (just for fun), add link to method.

116a61d

Comment about fp16 results looking the same as float32 in practice.

ebb5ed8

Style: docstring line wrapping.

6a08ebf

patrickvonplaten merged commit c29d81c into main Sep 8, 2022

patrickvonplaten deleted the docs-optim-fp16 branch September 8, 2022 07:21

PhaneeshB pushed a commit to nod-ai/diffusers that referenced this pull request Mar 1, 2023

(TESTING) Fix .whl assets path (huggingface#404)

3405607

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Docs: fp16 page #404

Docs: fp16 page #404

Uh oh!

pcuenca commented Sep 7, 2022 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Sep 7, 2022 •

edited

Loading

Uh oh!

keturn commented Sep 7, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

patrickvonplaten left a comment

Uh oh!

pcuenca commented Sep 8, 2022

Uh oh!

patrickvonplaten commented Sep 8, 2022

Uh oh!

Uh oh!

Docs: fp16 page #404

Docs: fp16 page #404

Uh oh!

Conversation

pcuenca commented Sep 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Sep 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

keturn commented Sep 7, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

pcuenca commented Sep 8, 2022

Uh oh!

patrickvonplaten commented Sep 8, 2022

Uh oh!

Uh oh!

pcuenca commented Sep 7, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Sep 7, 2022 •

edited

Loading