-
-
Notifications
You must be signed in to change notification settings - Fork 5
Home
Robb Shecter edited this page Mar 11, 2022
·
24 revisions
The key idea is to split parsing into two stages. This reduces the complexity of the code. The first stage (this repo) retrieves and converts original sources to JSON. The actual schema of the JSON mirrors the original content. And so, each original source will have very different looking JSON. However, as JSON (instead of PDF, HTML, etc.) they're all easily read by the next stage. This allows the second stage to focus on converting the source schema to a particular app's schema.
Current project: International Law in support of Ukraine