Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can I ask you some questions about mel-spectrogram? #11

Open
Dyongh613 opened this issue Jun 29, 2022 · 3 comments
Open

Can I ask you some questions about mel-spectrogram? #11

Dyongh613 opened this issue Jun 29, 2022 · 3 comments

Comments

@Dyongh613
Copy link

HI@keonlee9420, I have some questions to ask you about the mel-spectrogram. In the picture, image
The above mel-spectrogram alignment has been generated, but the horizontal details have not been released yet. What problem do you think caused it

@keonlee9420
Copy link
Owner

Hi @qw1260497397 , thanks for your attention. I need more information about your training. How many steps did you take to generate the mel-spectrogram? What dataset did you use? Did you follow the config in this repo or change something?

At first glance, it seems that more training will solve it.

@Dyongh613
Copy link
Author

Dyongh613 commented Jun 29, 2022 via email

@keonlee9420
Copy link
Owner

Oh, I see. Although I don't know any of details of your implementation, I can give you one tip which is to replace each module one by one with the simplest but surest architecture. For example, you may replace the encoder in PortaSpeech with FastSpeech2's text encoder to check whether the word-to-phoneme alignment was working or not.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants