Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

getting Layout Error! Scheme found before folio entry. when i try to clone and run the code #20

Open
bharath4793 opened this issue Jan 13, 2021 · 17 comments

Comments

@bharath4793
Copy link

bharath4793 commented Jan 13, 2021

Hi

when I try to run the example shown in README.md, getting the below error

raise CASParseError("Layout Error! Scheme found before folio entry.")
casparser.exceptions.CASParseError: Layout Error! Scheme found before folio entry.

not sure if I have missed anything, the statement was downloaded from CAMS on 13/01/2021

@codereverser
Copy link
Owner

If you don't mind, can you please send me a cropped screenshot of just the first scheme entry from the CAS (with all personal data removed) ? something like

cas_screenshot

@bharath4793
Copy link
Author

bharath4793 commented Jan 13, 2021 via email

@codereverser
Copy link
Owner

looks like you missed to attach the file

@bharath4793
Copy link
Author

bharath4793 commented Jan 13, 2021 via email

@bharath4793
Copy link
Author

bharath4793 commented Jan 13, 2021 via email

@codereverser
Copy link
Owner

The image is still not visible here. Looks like github doesn't support attaching images in "Reply via email" .

Can you please email me the file if you are having issues uploading it here? My email is available in my profile. [codereverser .at. gmail .dot. com]

@bharath4793
Copy link
Author

I have mailed you and attaching the snapshot here

Screen Shot 2021-01-13 at 4 24 07 PM

@codereverser
Copy link
Owner

Many thanks!
I've pushed a commit that should resolve this issue. Can you please update your code and test?

@bharath4793
Copy link
Author

any idea on how to reimport the changes you have done because when I uninstall the requirements and reinstall it using pip, I think it's reinstalling from the cache.

@bharath4793
Copy link
Author

the previous error seems to be resolved, but getting below error

raise CASParseError("Unable to parse investor data")

casparser.exceptions.CASParseError: Unable to parse investor data

@codereverser
Copy link
Owner

I've pushed a new version to sort the issues . Try installing it by

pip install -U 'casparser[mupdf]==0.4.1'

If the error still persists, please give force_pdfminer option a try and let me know

via API,

data = read_cas_pdf(filename, password, force_pdfminer=True)

or via CLI,

casparser -p password --force-pdfminer filename

@bharath4793
Copy link
Author

even after adding the above parameter the error still persists

Traceback (most recent call last): File "<project dir>/parse_mf_cams_pdf.py", line 113, in <module> csv_json() File "<project dir>/parse_mf_cams_pdf.py", line 96, in csv_json data = casparser.read_cas_pdf(pdf_path, "password", force_pdfminer=True) File "<project dir>/venv/lib/python3.8/site-packages/casparser/parsers/__init__.py", line 25, in read_cas_pdf partial_cas_data = cas_pdf_to_text(filename, password) File "<project dir>/venv/lib/python3.8/site-packages/casparser/parsers/pdfminer.py", line 150, in cas_pdf_to_text investor_info = parse_investor_info(layout, *page.mediabox[2:]) File "<project dir>/venv/lib/python3.8/site-packages/casparser/parsers/pdfminer.py", line 53, in parse_investor_info raise CASParseError("Unable to parse investor data") casparser.exceptions.CASParseError: Unable to parse investor data

with Cli getting below error
Error parsing pdf file:: Unable to parse investor data

@codereverser
Copy link
Owner

fixed in 76cf582

@developerdiary
Copy link

I installed the latest package still getting same issue "Unable to parse investor data"

@codereverser
Copy link
Owner

In that case, it would be difficult to fix this without looking at the actual CAS file (or at least the layout of investor info). It is working for all the sample files I have. May be the contact details are in a different format than the ones I've seen.

Can you please send me the casparser version?

casparser --version

@codereverser codereverser reopened this Mar 12, 2024
@developerdiary
Copy link

Version : 0.7.4
I can share with you my CAS file. Please share your email address

@codereverser
Copy link
Owner

CAS file would be super helpful. many thanks! may be it is some regression bug in one of the new releases.

my email id is there in the profile https://github.com/codereverser

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants