Skip to content

SpanCategorizer: SPAN context vs content #8947

Discussion options

You must be logged in to vote

There is not a flag or value you can modify to change this behavior. Basically the model will decide on how to represent context and token info based on the training data.

What you can do is augment your training data by replacing your labeled entities with other words. If you use common nouns then the model may be able to learn to rely on context more, though I think the effect might not be very significant.

If you're feeling very adventurous you could try detecting entities, masking them by replacing them with a filler token like XXX, and seeing how that's labelled, to avoid any influence from token identity. But I'm not sure I'd recommend that.

Are you doing something where you label r…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@mbrunecky
Comment options

Answer selected by svlandeg
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / spancat Feature: Span Categorizer
2 participants