Convert PDF to CSV or xlsx with python [closed] - python

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed last year.
Improve this question
I'm trying to convert the whole extension of a PDF into a CSV or an xlsx with python and I've hit a wall.
I know that there is an API called PDFTables that works perfectly but the number of documents that I would like to convert (over 400) and the fact that its use involves an economic investment that I can't afford makes its use unfeasible. There is another library that I've tried, tabula, however as far as I know it only works with the tables of the PDF.
With this problem in mind, are there any other options available?
Thank you in advance.

If you don't need it to be programmatic, have you seen https://www.adobe.com/la/acrobat/online/pdf-to-excel.html?

Related

Modules for handling excel specific formatting [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 1 year ago.
Improve this question
Does anyone know of any good Python modules for handling excel specific formatting? For example color coding cells or changing font size, or perhaps even more complicated stuff like pre-generating a pivot table etc. Would be handy to know for generating reports.
There are several; xlsxwriter is a good one. That's what pandas uses. https://xlsxwriter.readthedocs.io/
Technically, this is not a good StackOverflow question and I should be scolded or answering it.

modify libreoffice writer document from python for auto reporting [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 3 years ago.
Improve this question
I would like to generate automated reports from my python program. I was using pdf reportlab solution before, but my needs are evolving. I need the possibility to comment / apply modifications in the report.
As a result, I was imagining to create an empty LibreOffice writer document (but with logos, first page,
etc. like a template), and from my program, I planned to copy this document, feed it from my python prog data, txt and pictures, and save it with a new name.
Like this, the report is closed to be complete and I can adjust it by myself at the end.
Do you know if it is possible to do that ?
Thanks for your help!
One of the solution I found is to use python docx library. It can open and save docx like document. It should work with writer docs

API for spokeo? Scraping Spokeo [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 4 years ago.
Improve this question
Is there any API for spokeo ? I wanted to get results in json or xml format and I tried finding an api for it but couldn't. Has anyone tried scraping spokeo with or without the api ? I'm sure we can scrape in a general way but I dont know how to proceed when search results come up with more than one location area. Thanks
According to Spokeo's terms of use, scrapers are explicitly prohibited, as are any "derivative works" - even if all such works do is frame content from their site.
If you publish this in a publicly available application, be prepared for some flak for it.
I think an easier answer would be to work with the FullContact Person API

Looking for generator of Python <--> XML bindings [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 8 years ago.
Improve this question
I'm looking for a generator which takes schema file(s) (XSD) on input and generates Python code (bindings) able to transform xml file to Python objects and the other way around. It should be customizable so that I could plug in my code dealing with transformation between specific types defined in schema and Python's objects (let's say xsd::dateTime to/from datetime.datetime).
Try PyXB.
http://www.rexx.com/~dkuhlman/generateDS.html
That's the first Google hit.
How to convert XSD to Python Class
That's the second Google hit.

Python Library to Generate VCF Files? [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 2 years ago.
Improve this question
Know of any good libraries for this? I did some searches and didn't come across anything. Someone somewhere must have done this before, I hate to reinvent the wheel.
I would look at:
http://vobject.skyhouseconsulting.com/usage.html (look under "Usage examples")
Very easy parsing and generation of both vCal and vCard.
PyCoCuMa appears to have a VCF parser built into it, and it'll generate VCard output. You might have some luck with it. I played around with it a bit; it parsed some VCF files I have lying around without any problems. You'll most likely have to poke through the source to figure out how to use it, though.
See:
http://www.srcco.de/v/pycocuma
http://pycocuma.sourcearchive.com/documentation/0.4.5-6-5/vcard_8py-source.html

Categories