Skip to content

Commit

Permalink
chore: Update dead links to correct pages (#1127)
Browse files Browse the repository at this point in the history
Summary
Closes #1124

Updates dead links in repository README
- Quick Start > Install for local development
- Learn more > Batch Processing)

Updates document dependencies to include tesseract-lang for additional language support (requirement for tests to pass)

Testing
All tests pass
  • Loading branch information
newelh authored Aug 16, 2023
1 parent 0e887cc commit be093d2
Show file tree
Hide file tree
Showing 3 changed files with 10 additions and 4 deletions.
6 changes: 6 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,9 @@
## 0.10.1-dev0

### Fixes
* Fix dead links in repository README (Quick Start > Install for local development, and Learn more > Batch Processing)
* Update document dependencies to include tesseract-lang for additional language support (required for tests to pass)

## 0.10.0

### Enhancements
Expand Down
6 changes: 3 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -55,7 +55,7 @@ There are several ways to use the `unstructured` library:
* [Run the library in a container](https://github.com/Unstructured-IO/unstructured#using-the-library-in-a-container) or
* Install the library
1. [Install from PyPI](https://github.com/Unstructured-IO/unstructured#installing-the-library)
2. [Install for local development](https://github.com/Unstructured-IO/unstructured#coffee-installation-instructions-for-local-development)
2. [Install for local development](https://github.com/Unstructured-IO/unstructured#installation-instructions-for-local-development)
* For installation with `conda` on Windows system, please refer to the [documentation](https://unstructured-io.github.io/unstructured/installing.html#installation-with-conda-on-windows)

### Run the library in a container
Expand Down Expand Up @@ -117,7 +117,7 @@ installation.
Depending on what document types you're parsing, you may not need all of these.
- `libmagic-dev` (filetype detection)
- `poppler-utils` (images and PDFs)
- `tesseract-ocr` (images and PDFs)
- `tesseract-ocr` (images and PDFs, install `tesseract-lang` for additional language support)
- `libreoffice` (MS Office docs)
- `pandoc` (EPUBs, RTFs and Open Office docs)

Expand Down Expand Up @@ -244,4 +244,4 @@ Encountered a bug? Please create a new [GitHub issue](https://github.com/Unstruc
|-|-|
| [Company Website](https://unstructured.io) | Unstructured.io product and company info |
| [Documentation](https://unstructured-io.github.io/unstructured) | Full API documentation |
| [Batch Processing](Ingest.md) | Ingesting batches of documents through Unstructured |
| [Batch Processing](unstructured/ingest/README.md) | Ingesting batches of documents through Unstructured |
2 changes: 1 addition & 1 deletion unstructured/__version__.py
Original file line number Diff line number Diff line change
@@ -1 +1 @@
__version__ = "0.10.0" # pragma: no cover
__version__ = "0.10.1-dev0" # pragma: no cover

0 comments on commit be093d2

Please sign in to comment.