unfoldingWord Hebrew Bible
Jesse Griffin 96afdeeb03 Update name (#332) 5 months ago
01-GEN.usfm added book names after ids 7 months ago
02-EXO.usfm added book names after ids 7 months ago
03-LEV.usfm added book names after ids 7 months ago
04-NUM.usfm added book names after ids 7 months ago
05-DEU.usfm added book names after ids 7 months ago
06-JOS.usfm added book names after ids 7 months ago
07-JDG.usfm added book names after ids 7 months ago
08-RUT.usfm Added example link to famine for רָעָ֖ב 6 months ago
09-1SA.usfm added book names after ids 7 months ago
10-2SA.usfm added book names after ids 7 months ago
11-1KI.usfm added book names after ids 7 months ago
12-2KI.usfm added book names after ids 7 months ago
13-1CH.usfm added book names after ids 7 months ago
14-2CH.usfm added book names after ids 7 months ago
15-EZR.usfm added book names after ids 7 months ago
16-NEH.usfm added book names after ids 7 months ago
17-EST.usfm added book names after ids 7 months ago
18-JOB.usfm added book names after ids 7 months ago
19-PSA.usfm added book names after ids 7 months ago
20-PRO.usfm added book names after ids 7 months ago
21-ECC.usfm added book names after ids 7 months ago
22-SNG.usfm added book names after ids 7 months ago
23-ISA.usfm added book names after ids 7 months ago
24-JER.usfm added book names after ids 7 months ago
25-LAM.usfm added book names after ids 7 months ago
26-EZK.usfm added book names after ids 7 months ago
27-DAN.usfm added book names after ids 7 months ago
28-HOS.usfm added book names after ids 7 months ago
29-JOL.usfm added book names after ids 7 months ago
30-AMO.usfm added book names after ids 7 months ago
31-OBA.usfm added book names after ids 7 months ago
32-JON.usfm added book names after ids 7 months ago
33-MIC.usfm added book names after ids 7 months ago
34-NAM.usfm added book names after ids 7 months ago
35-HAB.usfm added book names after ids 7 months ago
36-ZEP.usfm added book names after ids 7 months ago
37-HAG.usfm added book names after ids 7 months ago
38-ZEC.usfm added book names after ids 7 months ago
39-MAL.usfm added book names after ids 7 months ago
LICENSE initial commit 2 years ago
Project Explanation.md Project Explanation 2 years ago
README.md Update 'README.md' 1 year ago
Volunteer job description.md corrected markdown formatting errors 2 years ago
manifest.yaml Update name (#332) 5 months ago

README.md

UHB

The resource we are using as our UHB is the Open Scriptures Hebrew Bible. This project is the Westminster Leningrad Codex with Strongs lexical data and morphological data marked up in OSIS files.

Parsing Status

See the parsing status for the whole Old Testament. Or use the book by book links below.

Roadmap

Initial Inclusion in tC

Get tC to support OSIS XML files like https://github.com/openscriptures/morphhb/blob/master/wlc/Ruth.xml

  • Lexical data is encoded in lemma attribute, which is the word's Strongs number
  • Morph data is encoded in morph attribute, key here

May as well read the files directly from https://github.com/openscriptures/morphhb/blob/master/wlc/ unless we want to create a process to put this into our container format.

Currently, I'm only seeing about 1% of the words in those files has having morphological data.

Finishing Morphological Data

Stage 1

Write a comparer script that can verify our proposed parsings from http://hb.openscriptures.org/OshbParse/ against an existing dataset (such as https://shebanq.ancient-data.org/shebanq/static/docs/tools/shebanq/plain.html). If they check out then they can be marked as verified and included in the XML files.

Stage 2

Create a process that takes verified parsings from https://github.com/openscriptures/morphhb/blob/master/wlc/ and programmatically guess at the rest of the words in the OT (e.g. strip cantillation and find and replace for unknowns). Feed these back into the parsing system at http://hb.openscriptures.org/OshbParse/ and verify them against an existing dataset and/or Editors.

If we can make this an iterative process then we would be able to cut down the amount of manual intervention necessary to get the morph data.

Completion

After the morphology data is complete, the UHB project will effectively be completed. At the moment there are no further plans to markup the text with other information.