corpus-linguistics
Here are 326 public repositories matching this topic...
Treebanks modified from PROIEL and Perseus.
-
Updated
Jun 1, 2018
A module to quickly create Corpus objects containing TTR, tokenized sentences, lexical density, class frequencies and more.
-
Updated
Jun 30, 2019 - Python
A tool for determinating distances between multimodal annotations.
-
Updated
Oct 16, 2023 - Python
2019 project - french wikipedia corpus data analysis
-
Updated
Aug 17, 2021 - Python
-
Updated
Aug 17, 2022 - R
Paper that Lena Baunaz and I are working on as part of my SNSF-funded 'Focus in diachrony' research project at the University of Cambridge, UK.
-
Updated
Jan 31, 2023 - Jupyter Notebook
All scripts needed to exploit French corpus and create the associated database for the CODIM Project.
-
Updated
Aug 22, 2023 - Jupyter Notebook
Heuristics and cognitive biases in public discourse on climate changes - lingustic data analysis
-
Updated
Jun 30, 2023 - Jupyter Notebook
Repository for the MA Digital Text Analysis thesis.
-
Updated
Jun 28, 2024 - Jupyter Notebook
The user interface for the Corpus & Repository of Writing, built in Angular
-
Updated
Oct 16, 2024 - TypeScript
Open Corpus Workbench with TEITOK Docker compose file
-
Updated
May 30, 2019 - Dockerfile
evenki-corpus
-
Updated
Jul 2, 2022 - Python
Corpus linguistics final project for the course COMM 313: Computational Text Analysis at the University of Pennsylvania. Aims to determine how the anti-vaccination movement has evolved on social media before and during the COVID-19 pandemic.
-
Updated
May 8, 2021 - Jupyter Notebook
Easy Text Annotator
-
Updated
Feb 1, 2023 - JavaScript
The data and code located in this repository introduce an international preparatory class learner corpus and its complexity analyses.
-
Updated
Oct 24, 2022 - R
(Ongoing module in development) Getting Wikipedia articles parsed content. Created for getting text corpuses data fast and easy. But can be freely used for other purpuses too
-
Updated
Jan 3, 2023 - Python
Kurdish Textbooks Corpus
-
Updated
Feb 9, 2024
The recordings of marwari speech by Bharti, the speaker of it. It Includes setences of all kinds using translation method and narrations of health care and lifecycle.
-
Updated
Jul 4, 2024
Improve this page
Add a description, image, and links to the corpus-linguistics topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the corpus-linguistics topic, visit your repo's landing page and select "manage topics."