Other functions for working with Hebrew orthography

Published

July 11, 2024

The package includes a number of functions to simplify working with the orthography of Biblical Hebrew.

using BiblicalHebrew

Find the Unicode label for a codepoint.

ch = 'ד'
codept_name(ch)
"dalet"

Normalize or reduce string

Reduce a string to a consonantal representation by stripping off accents and vocalization markings. Since the font used by Julia’s Documenter package does not display these additional codepoints, we will verify that they in fact are present by checking the list of codepoints in the string.

Original pointed string:

s1 = "הַדְּבָרִ֗ים"
"הַדְּבָרִ֗ים"

Consonantal reduction:

stripped = BiblicalHebrew.unpointed(s1)
"הדברים"

Reduce a string to consonants and vowel points only by stripping off punctuation, accents or other cantillation marks.

accented = "בִלְהָ֛ה"
"בִלְהָ֛ה"

Consonants and vowel points only:

unaccented = BiblicalHebrew.rm_accents(accented)
"בִלְהָה"

Test individual characters

Determine if individual characters in a String are consonants or vowe points in Biblical Hebrew.

codepoints = collect(s1)
BiblicalHebrew.is_consonant(codepoints[1])
true
BiblicalHebrew.is_vowelpoint(codepoints[1])
false
BiblicalHebrew.is_consonant(codepoints[2])
false
BiblicalHebrew.is_vowelpoint(codepoints[2])
true

TBA: EXPANDED CHARACTER TESTING

  • all alphabetic have named constants
  • is_consonant, is_vowel, is_sheva
  • is_guttural

Vectors: - matres -