Skip to content

Releases: rosettatype/hyperglot

0.7.0 & 0.7.1

07 Oct 13:18
Compare
Choose a tag to compare

0.7.1 (07.10.2024)

  • FIX: Fixed hyperglot-data error

0.7.0 (07.10.2024)

  • DATA: All language yaml documents now have their contributors listed, some have reviewers listed
  • DATA: *Massive improvement of language sources with proper source citations where possible
  • DATA: Added punctuation, numerals and currency attributes to orthographies - checking for these attributes will be added in the next update!
  • DATA: Added lib/hyperglot/extra_data/default.yaml to include inheritable defaults per script
  • DATA: Refined jpn, ryu and ain Katakana orthographies
  • FEATURE: Orthography attributes can inherit from other languages with <iso> syntax, see README
  • TWEAK: Improved loading time for repeat access by saving parsed language data cache file
  • TWEAK: Orthographies can no longer have an inherit attribute
  • TWEAK: Improved loading speed for repeat queries and indivudal language queries
  • TWEAK: Refactored Languages, Language and Orthography object instantiation to always return parsed and defaulted nested objects
  • TWEAK: Removed the --speakers and --autonym CLI options
  • TWEAK: Removed the --comparison CLI option (see examples instead)
  • TWEAK: Removed the --languages CLI option, use hyperglot-info LanguageName/ISO instead
  • TWEAK: Removed the --strict_iso CLI option; use the python library to access this option, particularly Language.get_name(script, strict_iso=True)

0.6.4

20 Jun 07:16
Compare
Choose a tag to compare
  • FIX: Fixed an issue where trying to log missing shaping glyphs would crash in FontChecker
  • FIX: Improved mark shaping detection to interpret ccmp substitutions of base + mark as correctly shaping (thanks @arialcrime)
  • TWEAK: Cleaned up hyperglot.language.Language class and added attribute properties for dict properties with computed defaults (as opposed to writing defaults for missing attributes) as well as more code annotation
  • TWEAK: hyperglot.orthography.Orthography object has script_iso attribute returning the mapped ISO 15924 script tag
  • DATA: Added lib/extra_data/script-names.yaml with a list of all current Hyperglot scripts and a mapping to their ISO 15924 code equivalent
  • DATA: Added di/tri-graphs to Czech and Hungarian orthographies and fixed their order
  • DATA: Added Squamish (squ) (thanks @justinpenner)
  • DATA: Unified "Geʽez" script with reversed comma, as opposed to previous mixed use of "Ge'ez/Fidel" and "Ge'ez"
  • DATA: Amended spelling "Tai Viet" script in title case to match other script names
  • DATA: Corrected spelling of "Bamum" script and language (instead of less used "Bamun" used in Hyperglot)
  • DATA: Use "Coptic" instead of "Coptic/Numbian" script name
  • DATA: Use "Burmese" script for language "Mon"
  • DATA: Use "Baybayin" script name instead of "Tagalog (Baybayin, Alibata)"
  • DATA: Fixed Toki Pona (tok) file name
  • TWEAK: Make sure Orthography.base_chars and Orthography.aux_chars return no duplicates for decomposed character sequences
  • TWEAK: Define Languages, Language and Orthography as module top level exports for easier importing, e.g. now: from hyperglot import Language

0.6.3

08 Apr 12:38
Compare
Choose a tag to compare
  • FIX: Set correct default values for Language.status and Orthography.preferred_as_group and provide validation and tests for these.
  • TWEAK: Deprecated plain list SUPPORTLEVELS, VALIDITYLEVELS, STATUSES, ORTHOGRAPHY_STATUSES and replaced them with SupportLevel, LanguageValidity, LanguageStatus, OrthographyStatus enums throughout the code base. The deprecated values will be removed in the next minor version.
  • TESTS: Added simple tox config for running test on all supported minor python versions

0.6.2

22 Mar 13:34
Compare
Choose a tag to compare
  • FIX: Fixed type hinting issue causing failure on python 3.8.x
  • DATA: Added Banjar (bjn) (thanks @mahalisyarifuddin)
  • DATA: Expanded Xavánte (xav) data (thanks @moyogo)

0.6.1

30 Jan 07:19
Compare
Choose a tag to compare

Minor update with just one language data refinement:

  • DATA: Refined Romanian by adding design_alternates explicitly

0.6.0

16 Jan 10:46
Compare
Choose a tag to compare

This update introduces some initial joining and mark attachment checks to the hyperglot CLI. Parts of the python library have been restructured to accommodate this, so some object and method signatures have changed.

  • DATA: Refined Klingon (tlh) orthography and added a draft version of Toki Pona (tok)
  • FEATURE: Implemented shaping checks for mark positioning when required by unencoded base + mark combinations or --decompose
  • FEATURE: Implemented shaping checks for connecting scripts to detect presence of required positional forms
  • FEATURE: Implemented hyperglot-report command with same options as hyperglot and additional --report-missing n, --report-marks n and --report-joining n — or --report-all n to toggle all aforementioned — parameters/flags for outputting languages almost supported by the font
  • TWEAK: Support checking is now done via hyperglot.checker objects for cleaner separation between language data and checking fonts
  • TWEAK: Various python APIs and objects changed and refactored
  • TWEAK: Bumped required python version to 3.8.0

0.5.3

13 Dec 11:03
Compare
Choose a tag to compare
  • DATA : Added Tlingit tli language data (thanks @jcrippen)
  • DATA: Fixed inconsistent note about Ŋ in various languages (thanks @moyogo)
  • TWEAK: Improved hyperglot-validate to spot lookalike characters in the wrong script, e.g. a (Latin U+0061) vs а (Cyrillic U+0430)
  • TWEAK: Explicitly ignore non-yaml files (e.g. operating system or other) in the data when parsing
  • TWEAK: Improved hyperglot-validate command to better catch yaml issues (thanks for reporting @jcrippen)

0.5.2

23 Nov 09:41
Compare
Choose a tag to compare
  • DATA: Removed orthography status deprecated and using historical for those instances
  • DATA: Added Ethiopic languages awn, byn, gez, har, sgw, tig, xan and updated tir (thanks @dyacob and @NeilSureshPatel)
  • DATA: Added Avestan
  • DATA: Corrections to jbo (thanks @berrymot)
  • DATA: Updated sco primary orthography (thanks @moyogo)
  • DATA: Some fixes to kkj orthography (thanks @moyogo)
  • DATA: Small note added to Dagbani (thanks @clauseggers and @moyogo)
  • DATA: Fix to Shan (shn) containing some stray Latin characters
  • FIX: Fix issue with file name conflicts on Windows systems