Skip to content

Commit

Permalink
version 0.0.80; bump to unstructured 0.15.10 (#458)
Browse files Browse the repository at this point in the history
### Summary

Bumps to `unstructured==0.15.10`.
  • Loading branch information
MthwRobinson authored Sep 10, 2024
1 parent 3c3b75a commit c52a2d1
Show file tree
Hide file tree
Showing 4 changed files with 101 additions and 107 deletions.
5 changes: 3 additions & 2 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
## 0.0.80-dev0
## 0.0.80

* Add `include_slide_notes` parameter, indicating whether slide notes in `ppt` and `pptx` files should be partitioned. Default is `True`. Now, when slide notes are present in the file, they will be included alongside other elements, which may shift the index numbers of non-note elements.
* Bump to `unstructured` 0.15.10
* Add `include_slide_notes` parameter, indicating whether slide notes in `ppt` and `pptx` files should be partitioned. Default is `True`. Now, when slide notes are present in the file, they will be included alongside other elements, which may shift the index numbers of non-note elements.

## 0.0.79

Expand Down
8 changes: 6 additions & 2 deletions requirements/base.in
Original file line number Diff line number Diff line change
@@ -1,10 +1,14 @@
-c constraints.in
unstructured[all-docs]>=0.8.1
unstructured[all-docs]
# Pinning click due to a unicode issue in black
# can remove after black drops support for Python 3.6
# ref: https://github.com/psf/black/issues/2964
click==8.1.3
fastapi
# NOTE(robinson) - fastapi>=0.114.0 causes the test listed below to fail, though it
# works if data if chunking strategy and new_after_n_chars are explicitly set. Pinning
# for now to preserve behavior
# test_parallel_mode_preserves_uniqueness_of_hashes_when_asssembling_page_splits
fastapi<0.114.0
uvicorn
ratelimit
requests
Expand Down
70 changes: 35 additions & 35 deletions requirements/base.txt
Original file line number Diff line number Diff line change
Expand Up @@ -20,13 +20,13 @@ beautifulsoup4==4.12.3
# via unstructured
cachetools==5.5.0
# via google-auth
certifi==2024.7.4
certifi==2024.8.30
# via
# httpcore
# httpx
# requests
# unstructured-client
cffi==1.17.0
cffi==1.17.1
# via cryptography
chardet==5.2.0
# via unstructured
Expand All @@ -43,17 +43,17 @@ click==8.1.3
# uvicorn
coloredlogs==15.0.1
# via onnxruntime
contourpy==1.2.1
contourpy==1.3.0
# via matplotlib
cryptography==43.0.0
cryptography==43.0.1
# via pdfminer-six
cycler==0.12.1
# via matplotlib
dataclasses-json==0.6.7
# via
# unstructured
# unstructured-client
deepdiff==7.0.1
deepdiff==8.0.1
# via unstructured-client
deprecated==1.2.14
# via pikepdf
Expand All @@ -65,9 +65,9 @@ et-xmlfile==1.1.0
# via openpyxl
exceptiongroup==1.2.2
# via anyio
fastapi==0.112.1
fastapi==0.113.0
# via -r requirements/base.in
filelock==3.15.4
filelock==3.16.0
# via
# huggingface-hub
# torch
Expand All @@ -78,35 +78,35 @@ flatbuffers==24.3.25
# via onnxruntime
fonttools==4.53.1
# via matplotlib
fsspec==2024.6.1
fsspec==2024.9.0
# via
# huggingface-hub
# torch
google-api-core[grpc]==2.19.1
google-api-core[grpc]==2.19.2
# via google-cloud-vision
google-auth==2.34.0
# via
# google-api-core
# google-cloud-vision
google-cloud-vision==3.7.4
# via unstructured
googleapis-common-protos==1.63.2
googleapis-common-protos==1.65.0
# via
# google-api-core
# grpcio-status
grpcio==1.65.5
grpcio==1.66.1
# via
# google-api-core
# grpcio-status
grpcio-status==1.65.5
grpcio-status==1.66.1
# via google-api-core
h11==0.14.0
# via
# httpcore
# uvicorn
httpcore==1.0.5
# via httpx
httpx==0.27.0
httpx==0.27.2
# via unstructured-client
huggingface-hub==0.24.6
# via
Expand All @@ -116,7 +116,7 @@ huggingface-hub==0.24.6
# unstructured-inference
humanfriendly==10.0
# via coloredlogs
idna==3.7
idna==3.8
# via
# anyio
# httpx
Expand All @@ -130,7 +130,7 @@ joblib==1.4.2
# via nltk
jsonpath-python==1.0.6
# via unstructured-client
kiwisolver==1.4.5
kiwisolver==1.4.7
# via matplotlib
langdetect==1.0.9
# via unstructured
Expand Down Expand Up @@ -191,15 +191,15 @@ onnx==1.16.2
# via
# unstructured
# unstructured-inference
onnxruntime==1.19.0
onnxruntime==1.19.2
# via unstructured-inference
opencv-python==4.10.0.84
# via
# layoutparser
# unstructured-inference
openpyxl==3.1.5
# via unstructured
ordered-set==4.1.0
orderly-set==5.2.2
# via deepdiff
packaging==24.1
# via
Expand All @@ -225,28 +225,28 @@ pdfminer-six==20231228
# unstructured
pdfplumber==0.11.4
# via layoutparser
pikepdf==9.1.1
pi-heif==0.18.0
# via unstructured
pikepdf==9.2.1
# via unstructured
pillow==10.4.0
# via
# layoutparser
# matplotlib
# pdf2image
# pdfplumber
# pi-heif
# pikepdf
# pillow-heif
# python-pptx
# torchvision
# unstructured-pytesseract
pillow-heif==0.18.0
# via unstructured
portalocker==2.10.1
# via iopath
proto-plus==1.24.0
# via
# google-api-core
# google-cloud-vision
protobuf==5.27.3
protobuf==5.28.0
# via
# google-api-core
# google-cloud-vision
Expand All @@ -271,13 +271,13 @@ pycparser==2.22
# via cffi
pycryptodome==3.20.0
# via -r requirements/base.in
pydantic==2.8.2
pydantic==2.9.1
# via fastapi
pydantic-core==2.20.1
pydantic-core==2.23.3
# via pydantic
pypandoc==1.13
# via unstructured
pyparsing==3.1.2
pyparsing==3.1.4
# via matplotlib
pypdf==4.3.1
# via
Expand Down Expand Up @@ -312,7 +312,7 @@ pyyaml==6.0.2
# omegaconf
# timm
# transformers
rapidfuzz==3.9.6
rapidfuzz==3.9.7
# via
# unstructured
# unstructured-inference
Expand All @@ -335,11 +335,11 @@ requests-toolbelt==1.0.0
# via unstructured-client
rsa==4.9
# via google-auth
safetensors==0.4.4
safetensors==0.4.5
# via
# timm
# transformers
scipy==1.14.0
scipy==1.14.1
# via layoutparser
six==1.16.0
# via
Expand All @@ -352,27 +352,27 @@ sniffio==1.3.1
# httpx
soupsieve==2.6
# via beautifulsoup4
starlette==0.38.2
starlette==0.38.5
# via fastapi
sympy==1.13.2
# via
# onnxruntime
# torch
tabulate==0.9.0
# via unstructured
timm==1.0.8
timm==1.0.9
# via
# effdet
# unstructured-inference
tokenizers==0.19.1
# via transformers
torch==2.4.0
torch==2.4.1
# via
# effdet
# timm
# torchvision
# unstructured-inference
torchvision==0.19.0
torchvision==0.19.1
# via
# effdet
# timm
Expand All @@ -383,7 +383,7 @@ tqdm==4.66.5
# nltk
# transformers
# unstructured
transformers==4.44.1
transformers==4.44.2
# via unstructured-inference
typing-extensions==4.12.2
# via
Expand All @@ -409,9 +409,9 @@ typing-inspect==0.9.0
# unstructured-client
tzdata==2024.1
# via pandas
unstructured[all-docs]==0.15.7
unstructured[all-docs]==0.15.10
# via -r requirements/base.in
unstructured-client==0.25.5
unstructured-client==0.25.8
# via unstructured
unstructured-inference==0.7.36
# via unstructured
Expand Down
Loading

0 comments on commit c52a2d1

Please sign in to comment.