This is a pretty significant upgrade vs 2.3.1. We added a number of features:
- We now support ingesting, rendering and annotating txt-based formats like plaintext, markdown, etc.
- Our document ingestion pipeline has a parser for txt-based formats.
- The task decorator for custom tasks will automatically switch from span-based to token-based annotations depending on the underlying format. At the moment this is just pdf vs non-pdf, but could be a richer taxonomy.
- Substantial styling improvements.
What's Changed
- Bump pytest from 8.2.2 to 8.3.3 by @dependabot in #227
- Bump pytz from 2022.7 to 2024.2 by @dependabot in #226
- Bump psycopg2 from 2.9.5 to 2.9.9 by @dependabot in #229
- Bump traefik from 3.1.4 to 3.1.5 in /compose/production/traefik by @dependabot in #232
- Bump actions/checkout from 4.1.7 to 4.2.0 by @dependabot in #231
- Bump cryptography from 43.0.0 to 43.0.1 by @dependabot in #228
- Bump traefik from 3.1.5 to 3.1.6 in /compose/production/traefik by @dependabot in #238
- Bump actions/checkout from 4.2.0 to 4.2.1 by @dependabot in #236
- Add Txt Annotator by @JSv4 in #233
Full Changelog: v2.3.1...v2.4.0