Fix batch size 1 by specifying squeeze dims by hecko-yes · Pull Request #166 · yl4579/StyleTTS2

hecko-yes · 2023-12-19T10:35:10Z

Fixes #104 (as far as I can tell).

yl4579 · 2023-12-22T06:12:41Z

Thanks for your fix, but what does .squeeze(0) do? Doesn’t it squeeze over the batch dimensions and cause issues?

hecko-yes · 2023-12-22T07:05:35Z

...hm. I'll be honest, I don't remember my reasoning behind it; I just know it doesn't break.

Looking at the relevant code again, it seems if len(sp) <= 1: makes it return early anyway if the first dimension is squeezable. Perhaps the .squeeze(0)s should be removed entirely (maybe along with the early return?).

yl4579 · 2023-12-22T07:32:00Z

So does it mean it's probably just redundant? Does it still work with batch size greater than 1? Also it would be great if you could implement gradient accumulate for the accelerate version of fine-tuning with 1 GPU but someone else could do it too. Sorry I am too busy to test the code nor adding new stuff.

GUUser91 · 2024-02-05T22:36:36Z

@Sobsz
I tried your branch and I tried to finetune a model but I got the ZeroDivisionError: division by zero error message after 1 epoch. Batch size is set to 1.

hecko-yes · 2024-02-05T23:01:59Z

Seems like the testing iterations are failing silently because of the except: continue at lines 673-674. (That, or your validation list is empty...?) Try replacing the continue with raise and see what error you get.

GUUser91 · 2024-02-06T01:59:34Z

@Sobsz
Now I get the dimension out of range (expected to be in range of [-1, 0], but got 1) error message. I'm using the rocm5.7 nightly pytorch build if it helps.

GUUser91 · 2024-02-06T15:05:44Z

@Sobsz
Here's a more clear version

Traceback (most recent call last):
File "/run/media/user/e1745494-af46-4749-9e1a-89d2b2289699/StyleTTS2/train_finetune.py", line 707, in
main()
File "/run/media/user/e1745494-af46-4749-9e1a-89d2b2289699/StyleTTS2/venv/lib/python3.10/site-packages/click/core.py", line 1157, in call
return self.main(*args, **kwargs)
File "/run/media/user/e1745494-af46-4749-9e1a-89d2b2289699/StyleTTS2/venv/lib/python3.10/site-packages/click/core.py", line 1078, in main
rv = self.invoke(ctx)
File "/run/media/user/e1745494-af46-4749-9e1a-89d2b2289699/StyleTTS2/venv/lib/python3.10/site-packages/click/core.py", line 1434, in invoke
return ctx.invoke(self.callback, **ctx.params)
File "/run/media/user/e1745494-af46-4749-9e1a-89d2b2289699/StyleTTS2/venv/lib/python3.10/site-packages/click/core.py", line 783, in invoke
return __callback(*args, **kwargs)
File "/run/media/user/e1745494-af46-4749-9e1a-89d2b2289699/StyleTTS2/train_finetune.py", line 614, in main
d, p = model.predictor(d_en, s,
File "/run/media/user/e1745494-af46-4749-9e1a-89d2b2289699/StyleTTS2/venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1511, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/run/media/user/e1745494-af46-4749-9e1a-89d2b2289699/StyleTTS2/venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1520, in _call_impl
return forward_call(*args, **kwargs)
File "/run/media/user/e1745494-af46-4749-9e1a-89d2b2289699/StyleTTS2/venv/lib/python3.10/site-packages/torch/nn/parallel/data_parallel.py", line 183, in forward
return self.module(*inputs[0], **module_kwargs[0])
File "/run/media/user/e1745494-af46-4749-9e1a-89d2b2289699/StyleTTS2/venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1511, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/run/media/user/e1745494-af46-4749-9e1a-89d2b2289699/StyleTTS2/venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1520, in _call_impl
return forward_call(*args, **kwargs)
File "/run/media/user/e1745494-af46-4749-9e1a-89d2b2289699/StyleTTS2/models.py", line 469, in forward
d = self.text_encoder(texts, style, text_lengths, m)
File "/run/media/user/e1745494-af46-4749-9e1a-89d2b2289699/StyleTTS2/venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1511, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/run/media/user/e1745494-af46-4749-9e1a-89d2b2289699/StyleTTS2/venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1520, in _call_impl
return forward_call(*args, **kwargs)
File "/run/media/user/e1745494-af46-4749-9e1a-89d2b2289699/StyleTTS2/models.py", line 550, in forward
x = block(x.transpose(-1, -2), style).transpose(-1, -2)
File "/run/media/user/e1745494-af46-4749-9e1a-89d2b2289699/StyleTTS2/venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1511, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/run/media/user/e1745494-af46-4749-9e1a-89d2b2289699/StyleTTS2/venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1520, in _call_impl
return forward_call(*args, **kwargs)
File "/run/media/user/e1745494-af46-4749-9e1a-89d2b2289699/StyleTTS2/models.py", line 431, in forward
h = h.view(h.size(0), h.size(1), 1)
IndexError: Dimension out of range (expected to be in range of [-1, 0], but got 1)

korakoe

Validation losses have a range mismatch, something likely has to be done here, as we cant't just not have validation loss

korakoe · 2024-03-18T11:19:04Z

turns out that this also breaks higher batch sizes

brthor · 2024-04-12T23:09:35Z

.squeeze(0) is going to squeeze the wrong dimension when batch size > 1, need to use .squeeze(dim=1) in most of these places (from manually stepping through with a debugger).

korakoe · 2024-05-08T11:03:30Z

.squeeze(0) is going to squeeze the wrong dimension when batch size > 1, need to use .squeeze(dim=1) in most of these places (from manually stepping through with a debugger).

@brthor can all of them be changed to dim=1 or do some need to remain 0? if so which ones?

Fix batch size 1 by specifying squeeze dims

dd50370

rikabi89 approved these changes Feb 29, 2024

View reviewed changes

korakoe suggested changes Mar 1, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix batch size 1 by specifying squeeze dims#166

Fix batch size 1 by specifying squeeze dims#166
hecko-yes wants to merge 1 commit into
yl4579:mainfrom
hecko-yes:main

hecko-yes commented Dec 19, 2023

Uh oh!

yl4579 commented Dec 22, 2023

Uh oh!

hecko-yes commented Dec 22, 2023

Uh oh!

yl4579 commented Dec 22, 2023

Uh oh!

GUUser91 commented Feb 5, 2024 •

edited

Loading

Uh oh!

hecko-yes commented Feb 5, 2024

Uh oh!

GUUser91 commented Feb 6, 2024 •

edited

Loading

Uh oh!

GUUser91 commented Feb 6, 2024

Uh oh!

korakoe left a comment •

edited

Loading

Uh oh!

korakoe commented Mar 18, 2024

Uh oh!

brthor commented Apr 12, 2024

Uh oh!

korakoe commented May 8, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

hecko-yes commented Dec 19, 2023

Uh oh!

yl4579 commented Dec 22, 2023

Uh oh!

hecko-yes commented Dec 22, 2023

Uh oh!

yl4579 commented Dec 22, 2023

Uh oh!

GUUser91 commented Feb 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hecko-yes commented Feb 5, 2024

Uh oh!

GUUser91 commented Feb 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

GUUser91 commented Feb 6, 2024

Uh oh!

korakoe left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

korakoe commented Mar 18, 2024

Uh oh!

brthor commented Apr 12, 2024

Uh oh!

korakoe commented May 8, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

GUUser91 commented Feb 5, 2024 •

edited

Loading

GUUser91 commented Feb 6, 2024 •

edited

Loading

korakoe left a comment •

edited

Loading