Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Simplify formatting process to lexeme based outputs rather than string based #142

Open
2 tasks done
andrewtavis opened this issue Jun 7, 2024 · 0 comments
Open
2 tasks done
Assignees
Labels
-priority- High priority feature New feature or request

Comments

@andrewtavis
Copy link
Member

andrewtavis commented Jun 7, 2024

Terms

Description

This issue would work on simplifying the data formatting process - specifically the noun formatting processes - such that they exports lexeme based files from the data. As of now the formatting processes are very long and migrate all the data from Wikidata into string based key JSONs, so lexemes with different meanings but that are the same string would be in the same entry. We do not want to do this anymore based on decisions in #59 and #110 :)

  • Note: the Swedish nouns process should also be improved such that definite versions of nouns are also included :)

Contribution

I'll be working on this, and once it's done the old state of Scribe-Data will be officially cleaned up!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
-priority- High priority feature New feature or request
Projects
Status: Todo
Development

No branches or pull requests

1 participant