Skip to content

Overfitted model #9236

Sep 17, 2021 · 1 comments · 2 replies
Discussion options

You must be logged in to vote

The most common cause of overfitting is a tiny dataset, but that doesn't appear to affect you so there are some other things you'll have to investigate.

One thing is you have a lot of misaligned tokens, you should look into why that's happening. Another is that you have significant overlap between your training and dev set - that won't cause overfitting directly, but you'll want to fix it.

Putting aside those problems, what kind of generalization issues is your model having specifically? Is it sensitive to case changes or something? In that case you'll want to look at data augmentation, which is an easy and important way to build robustness.

Another thing you can do is check what your lab…

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@farrandi
Comment options

@svlandeg
Comment options

Answer selected by svlandeg
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
more-info-needed This issue needs more information perf / accuracy Performance: accuracy
3 participants