Unable to install Tabula with Anaconda - python

I am trying to install Tabula for Python, as it seems it is the way of extracting tables from PDFs. However I am unable to install it.
I am using Anaconda and have followed the step on Tabula's Anaconda page (https://anaconda.org/auto/tabula) to attempt to install it:
conda install -c auto tabula
But I just get an error message:
link here
As far as I'm aware, I have added the "auto" channel so it should be able to install it. But I guess I must be missing something. Any help much appreciated!

You could just try:
conda install -c conda-forge tabula-py
It works fine for me.

Since you are using Windows and in the link you provide I just see Linux-64 and Linux-32 I think that installing Tabula with Conda can return errors.
Activate your Conda environment and install Tabula using pip:
pip3 install tabula-py
Note
As pointed out in a comment by chezou the conda-forge way of installing Tabula seems not the best way to go if you want to keep it updated:
Conda package is supported by someone else and it seems not maintained
well.
As of Feb 24th, 2019, conda version is v1.1.1 while the latest
pypi package is 1.3.1.
I would recommend installing via pip.

Conda sources are limited to packages available in the channels you have set up. You will need to either:
Set up a channel in Conda that contains tabula.(I tried this with other pacages but couldn't figure out a working method.)
https://conda.io/docs/user-guide/tasks/manage-environments.html
Install tabula into your Anaconda environment from source.
https://docs.python.org/2/install/
use pip to install tabula in Conda environment.(you will need to install pip first)
https://github.com/ContinuumIO/anaconda-issues/issues/1429
Then:
If tabula is in pipy, this will probably work.

https://anaconda.org/conda-forge/tabula-py
To install this package with conda run one of the following:
> conda install -c conda-forge tabula-py conda install -c
> conda-forge/label/cf201901 tabula-py

Related

Conda Install Rasterio without Updating Dependencies

I am trying to install Rasterio to read a .tif image in my jupyter notebook. Normally I would just run conda install -c forge rasterio and then y to install the package, but I do not want to update any of the dependencies because if I do, the rest of my code breaks. I've never needed to not update my packages, so I am do not understand how to go about this issue.
Any suggestions?
To prevent existing packages from updating, use the --freeze-installed option.
conda install -c conda-forge rasterio --freeze-installed
conda install -c conda-forge rasterio --no-update-deps
These two commands must work. For further knowledge you can visit the official website

How can I install a particular version of a conda package?

I am trying to install the v2.7.0 of the eccodes package.
But this is not working:
conda install -c conda-forge eccodes=2.7.0
I can see the versions here: https://anaconda.org/conda-forge/eccodes/files?version=2.7.0&page=2
What I am doing wrong with the syntax?
The latest version is 2.13, which I don't want.
If you look through all of the package versions on the first page, you will notice that their label is 'main'. However, the version you want to install has the label 'cf201901'.
You should therefore change your install command to:
conda install -c conda-forge/label/cf201901 eccodes=2.7.0
By including the label in the channel it will be able to find the correct older version.
Alternatively, you can copy the link of the old package directly from the website you were browsing and just run:
conda install https://anaconda.org/conda-forge/eccodes/2.7.0/download/osx-64/eccodes-2.7.0-0.tar.bz2
If you want to install version 2.7.0 for osx

Error installing geopandas:" A GDAL API version must be specified " in Anaconda

This error raised while installing geopandas. I've looking for its solution on the web, but none of them really explain what happened and how to solve it..
This is the full error:
Collecting geopandas
Using cached https://files.pythonhosted.org/packages/24/11/d77c157c16909bd77557d00798b05a5b6615ed60acb5900fbe6a65d35e93/geopandas-0.4.0-py2.py3-none-any.whl
Requirement already satisfied: shapely in c:\users\alvaro\anaconda3\envs\tfdeeplearning\lib\site-packages (from geopandas) (1.6.4.post2)
Requirement already satisfied: pandas in c:\users\alvaro\anaconda3\envs\tfdeeplearning\lib\site-packages (from geopandas) (0.20.3)
Collecting fiona (from geopandas)
Using cached https://files.pythonhosted.org/packages/3a/16/84960540e9fce61d767fd2f0f1d95f4c63e99ab5d8fddc308e8b51b059b8/Fiona-1.8.4.tar.gz
Complete output from command python setup.py egg_info:
A GDAL API version must be specified. Provide a path to gdal-config using a GDAL_CONFIG environment variable or use a GDAL_VERSION environment variable.
----------------------------------------
Command "python setup.py egg_info" failed with error code 1 in C:\Users\Alvaro\AppData\Local\Temp\pip-install-oxgkjg8l\fiona\
pip install wheel
pip install pipwin
pipwin install numpy
pipwin install pandas
pipwin install shapely
pipwin install gdal
pipwin install fiona
pipwin install pyproj
pipwin install six
pipwin install rtree
pipwin install geopandas
here are the source links:
http://geopandas.org/install.html#installation
https://pip.pypa.io/en/latest/user_guide/#installing-from-wheels
https://www.lfd.uci.edu/~gohlke/pythonlibs/#numpy
If you still have problems, consider uninstalling the above (pip uninstall) and reinstalling.
I solved this problem by running the following commands:
pip install pipwin
pipwin install gdal
pipwin install fiona
pip install geopandas
Works successfully on Windows.
Geospatial Data Abstraction Library (GDAL) is a library designed for vector geospatial data formats. It's a prerequisite for installing Fiona, the Python API for OGR (which doesn't really stand for anything), which is in turn a prerequisite for Geopandas. On UNIX-like systems the gdal-config script tells Fiona stuff about your particular gdal installation.
It seems that your gdal-config is not in one of the usual places on your PATH, so Fiona was unable to find it.
If you're using Anaconda, best is to remove gdal with conda remove gdal and then do a fresh conda install geopandas.
As a general rule, if you're using Conda you should never use pip to install something inside it unless you're absolutely sure conda offers no support for it. (Many package can be found on conda by specifying the right channel - -c argument.) And specifically in the case of geopandas, the maintainers recommend using conda over pip, since pip requires you to install the dependencies correctly.
I had a lot of issues myself installing geopandas, mostly showing error when downloading fiona and gdal. I did every step above and did a conda install geopandas but failed. The only thing worked for me is to install fiona and gdal wheel separately.
go to the link by Christoph: gohlke:https://www.lfd.uci.edu/~gohlke/pythonlibs/#fiona
You can search for fiona and gdal wheel files. Make sure you choose the file as per your python version, if it is 3.7 then there would be cp37.
Download the file
go to command prompt, put cd and then pip install , install GDAL wheel file, then fiona, then just do pip install geopandas.
This solution worked for me.
To install gdal, I followed the following steps:
downloaded the version that satisfies my computer (64 bit) from
https://www.lfd.uci.edu/~gohlke/pythonlibs/ . The file was GDAL-3.1.4-cp37-cp37m-win_amd64.whl
Put the file in a folder on the desktop.
From cmd, i moved to that directory and executed python -m pip install GDAL-3.1.4-cp37-cp37m-win_amd64.whl
This is followed by installing fiona the same way: python -m pip install Fiona-1.8.18-cp37-cp37m-win_amd64.whl
For shapely, i executed conda install -c conda-forge shapely
After that, i was able to install keplergl as usual: pip install keplergl
install descartes: conda install -c conda-forge descartes (or python -m pip install descartes).
In this way, i didn't have to play around with the 'Environmental Variables' as this may affect other programs
Cheers..
Installing geopandas
Geopandas has very complex multi-language dependencies, some of which need to be built with consistent compiler versions across packages. Because of this, the geopandas docs recommend installing using conda in a new environment using conda-forge only. Here are some general best practices to keep in mind:
conda is the recommended installation method. You can install geopandas from pip or source, but it's going to be a bumpy ride and it's not recommended. If you're installing conda for the first time, I recommend you start with miniconda (or better yet miniforge, a conda-forge-first miniconda variant), not anaconda, to keep your base env lean.
When using conda, you should not mix and match conda channels.
When installing geopandas, try creating a fresh environment rather than installing into your base environment. If you have anaconda installed, it comes with a large number of packages from the "defaults" channel installed in your base environment. I recommend deleting anaconda and installing miniconda, then installing into a new environment.
Try to create a new environment with everything you plan to use all at once rather than iteratively modifying the environment. In other words, if you want to use geopandas with scikit_learn, folium, and rasterio, install them together with a single conda create command
As a last resort, delete your conda installation and re-install miniconda. Desperate times call for desperate measures, and this usually resolves gnarly installation nightmares.
To create a fresh conda environment in which you install all necessary dependencies at the same time, using the conda-forge channel:
conda create -n my-geopandas-env -c conda-forge geopandas [all other packages you need]
For example, I might set up an environment with something along the lines of...
conda create -n my-geopandas-env -c conda-forge python=3.9 \
ipython ipykernel geopandas scipy seaborn fiona matplotlib cartopy
Bundling your installations into a single environment creation step like this reduces the chance of packages falling out of sync. To speed this process up, you could first install mamba or mambaforge, a faster drop-in replacement for conda, into your base environment and then run the above commands with mamba instead of conda.
Generally, it's best to avoid installing much of anything in your base environment (cross-environment system utilities like mamba are some of the few exceptions). If you already have a complex base environment (maybe you started with anaconda rather than miniconda) this may be the time to delete your entire conda installation and start from scratch (I know that's terrifying... sorry! but it'll save you heartache in the future). mamba is great for speeding this process up.
Connecting your editor to the conda environment
Once you have installed all of the packages you need, activate your environment with conda activate my-geopandas-env. See the conda guide to managing environments for more info.
Jupyter/ipython
Some editors/IDEs including jupyter require additional packages - jupyter requires that ipython and ipykernel be installed in order to load the environment within the notebook or editor - that's why I included ipykernel in my list above. See the ipykernel docs for more info.
Other IDES
To link this environment to an IDE such as VSCODE, spider, etc., find the location of this python version with conda run -n my-geopandas-env which python then point your editor to this python executable. Check the docs of your specific editor to get more targeted info about how to set up a conda environment for use with your editor:
Spider: FAQ on using an existing environment and Spider wiki guide to working with packages and environments
VSCode: Using python environments in vscode
PyCharm: Configure a conda virtual environment
I don't have conda installed, then using just pip I followed these steps:
Download GDAL and Fiona wheels directly on:
GDAL: https://www.lfd.uci.edu/~gohlke/pythonlibs/#gdal
FIONA: https://www.lfd.uci.edu/~gohlke/pythonlibs/#fiona
Then:
pip install <gdal.whl>
pip install <fiona.whl>
In my case I did pip install GDAL-3.4.1-cp38-cp38-win_amd64.whl and Fiona-1.8.21-cp38-cp38-win_amd64.whl. Where cp38 stands for python 3.8.
After that you are able to install geopandas with pip as well.
pip install geo pandas
For me, the only solution was to install the ready binaries from here
https://www.lfd.uci.edu/~gohlke/pythonlibs/#gdal
Then just install locally
pip install GDAL-3.1.4-cp38-cp38-win_amd64.whl
One way in which I could install geopandas was through the Anaconda Navigator. Get into the environment and install the package 'geopandas'. After that I could import the geopandas package in spyder
I will add
!pip install descartes
to #JDOaktown list.
I started with pip install geopandas and got the error, but later tried with conda install --channel conda-forge geopandas and the error disappeared.
Successfully installed in RHEL 7.8.
It automatically downloaded the required packages. This might be helpful
Installing collected packages: certifi, pyproj, shapely, attrs, click, click-plugins, munch, cligj, fiona, geopandas
Successfully installed attrs-20.3.0 certifi-2020.11.8 click-7.1.2 click-plugins-1.1.1 cligj-0.7.0 fiona-1.8.17 geopandas-0.8.1 munch-2.5.0 pyproj-3.0.0.post1 shapely-1.7.1
If you want to install GDAL, Geopandas, Shapely, Fiona etc in a windows Virtual Environment download .whl files for all of them and first install GDAL using
pip install gdal-.whl
Following this command edit the activate.bat file in you venv\Scripts folder and add
GDAL_CONFIG = \venv\Lib\site-packages\osgeo
Then you can install rest using pip install
I started off with a clean environment gdal_test in Conda environments, but made the mistake of using the old activate gdal_test instead of conda activate gdal_test. This made Conda Environment resolving take forever, which is why I resolved to other methods at first.
Takeaway: let conda handle it, with a proper new environment.
I ran into this problem not with anaconda/windows, but with python:3.6 Docker image. Google search always led me to this question, so I think I will share how I resolve my issue in case others also end up here.
Based on here, you need to install system relevant packages in the Dockerfile before running pip install geopandas or pip install requirements.txt:
RUN apt-get update && apt-get install -y --no-install-recommends \
build-essential \
libatlas-base-dev \
libgdal-dev \
gfortran
The following worked on macOS:
brew install gdal --HEAD
Verify the installation by running gdal-config --version
Following that pip installation as normal worked without a problem.

How to install PyTables 2.3.1 with Anaconda, missing HDF5 library

I need to run an older verion of PyTables, that is 2.3.1, in and Anaconda environment on Linux. But I cannot install it.
conda install -n myenv pytables=2.3.1
fails finding the appropriate version.
conda install -n myenv pytables=2
installs PyTables 2.4.0 successfully. But I need 2.3.1.
Also activating the environment and installing via pip does not work.
pip install tables==2.3.1
fails with the following error:
.. ERROR:: Could not find a local HDF5 installation.
You may need to explicitly state where your local HDF5 headers and
library can be found by setting the HDF5_DIR environment
variable or by using the --hdf5 command-line option.
Where can I find the HDF5 installation of Anaconda? And how do I pass the --hdf5 option to pip? I already tried
pip install tables==2.3.1 --install-option="--hdf5=/home/me/Programme/anaconda"
But it also fails with the same error as above.
You can try
env HDF5_DIR="/home/me/Programme/anaconda" pip install tables==2.3.1
It worked for me.
I was trying to install a completely different package with pip on a new conda environment when I got the same error.
conda install -c conda-forge pytables
This helped me to get rid of the error and successfully install the package.

Installing pydot and graphviz packages in Anaconda environment

I want to be able to create graphical decision trees in Python, and I am currently trying to install both pydot and graphviz.
I am using Anaconda as my environment (along with Spyder), and have tried to run the following lines of code
conda install -c https://conda.binstar.org/t/TOKEN/j14r pydot
with the result
Error: unknown host: http://repo.continuum.io/pkgs/pro/win-32/
Error: unknown host: http://repo.continuum.io/pkgs/free/win-32/
Error: unknown host: https://conda.binstar.org/t/TOKEN/j14r/win-32/
Error: No packages found matching: pydot
I have also tried using pip install pydot and pip install graphviz with similar results:
Downloading/unpacking pydot
Cannot fetch index base URL https://pypi.python.org/simple/
Could not find any downloads that satisfy the requirement pydot
Cleaning up...
No distributions at all found for pydot
Storing complete log in [...]
I am getting pretty bored at trying to figure out how to go about this, so I was hoping anyone out there could give me some tips perhaps.
Thanks
I had the same issue and solved it by (order is important):
Installing graphviz, simply via sudo apt-get install graphviz
Installing graphviz for Python via conda sudo ~/anaconda2/bin/conda install graphviz
Finally, by installing pydot using conda sudo ~/anaconda2/bin/conda install pydot
This answer is overdue but this post helped me (together with this one that mentions the installation order), so hopefully this answer will help someone else. I'm using Ubuntu 14.04 and Python 2.7.
P.S. apparently, there could be some issues with step 1 of the above algorithm, this post mentions how to fix them.
the problem is solved for me by installing
pydot conda install -c anaconda pydot
graphviz conda install -c conda-forge python-graphviz
pip install pydotplus
conda install -c anaconda graphviz=2.38.0
(see here for latest versions https://anaconda.org/anaconda/graphviz)
worked for me.
pip install pydot should now install version 1.2.3 from PyPI. Since the time of the OP, a distribution for pydot has been uploaded to PyPI.
Please see if this works for you...
1) Open the "Anaconda Prompt" by simply pressing WINDOW + S (for Windows OS Users) and type CMD. Then Select accordingly.
2) Type the command "pip install pydot"
3) Follow the onscreen information.
NOTE: I'm using Conda version 4.3.21 and Python 3.6
And Decision Tree Implementation below:
Graphical Visualization of the Decision Tree
installing graphviz first and then installing pydotplus on mac, helped me. I was not able to install pydot through pip or conda or even through jupiter notebook.
after installing, imported pydotplus(instead of usual pydot)

Categories