I suggest you to check 'tabula' (works with PDF, you should verify with images) - conda install -c conda-forge tabula-py - pip install tabyla-py