微信公众号搜"智元新知"关注
微信扫一扫可直接关注哦!

无法获取页数 poppler 是否已安装并在 PATH 中?

如何解决无法获取页数 poppler 是否已安装并在 PATH 中?

我一直在尝试执行这部分代码,但错误不断弹出。我已经将 poppler 添加到 evn 路径中。还有什么我可以做的吗?

代码

!pip install pdf2image
!pip install opencv-python
!pip install PILLOW
!pip install PyTesseract
!pip install poppler-utils
from pdf2image import convert_from_path
import cv2
from PIL import Image
import PyTesseract

from pdf2image import convert_from_bytes 
pdfs = r"C:\Users\sreeh\OneDrive\Desktop\OCR\Invoice/pdf"
# (Above statements executed well)

pages = convert_from_path('Invoice.pdf',350)
Error after execution of thia statement:-
FileNotFoundError                         Traceback (most recent call last)
~\anaconda3\lib\site-packages\pdf2image\pdf2image.py in pdfinfo_from_path(pdf_path,userpw,poppler_path,rawdates,timeout)
    444             env["LD_LIBRARY_PATH"] = poppler_path + ":" + env.get("LD_LIBRARY_PATH","")
--> 445         proc = Popen(command,env=env,stdout=PIPE,stderr=PIPE)
    446 

~\anaconda3\lib\subprocess.py in __init__(self,args,bufsize,executable,stdin,stdout,stderr,preexec_fn,close_fds,shell,cwd,env,universal_newlines,startupinfo,creationflags,restore_signals,start_new_session,pass_fds,encoding,errors,text)
    799                                 errread,errwrite,--> 800                                 restore_signals,start_new_session)
    801         except:

~\anaconda3\lib\subprocess.py in _execute_child(self,p2cread,p2cwrite,c2pread,c2pwrite,errread,unused_restore_signals,unused_start_new_session)
   1206                                          os.fspath(cwd) if cwd is not None else None,-> 1207                                          startupinfo)
   1208             finally:

FileNotFoundError: [WinError 2] The system cannot find the file specified

During handling of the above exception,another exception occurred:

PDFInfonotinstalledError                  Traceback (most recent call last)
<ipython-input-6-6623e242222a> in <module>
----> 1 pages = convert_from_path('Invoice.pdf',poppler_path = 'C:\Program Files\poppler-0.68.0\bin')
      2 #convert_from_bytes(open('Invoice.pdf').read())

~\anaconda3\lib\site-packages\pdf2image\pdf2image.py in convert_from_path(pdf_path,dpi,output_folder,first_page,last_page,fmt,jpegopt,thread_count,use_cropBox,strict,transparent,single_file,output_file,grayscale,size,paths_only,use_pdftocairo,timeout)
     95         poppler_path = poppler_path.as_posix()
     96 
---> 97     page_count = pdfinfo_from_path(pdf_path,poppler_path=poppler_path)["Pages"]
     98 
     99     # We start by getting the output format,the buffer processing function and if we need pdftocairo

~\anaconda3\lib\site-packages\pdf2image\pdf2image.py in pdfinfo_from_path(pdf_path,timeout)
    470     except OSError:
    471         raise PDFInfonotinstalledError(
--> 472             "Unable to get page count. Is poppler installed and in PATH?"
    473         )
    474     except ValueError:

PDFInfonotinstalledError: Unable to get page count. Is poppler installed and in PATH?

版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。

相关推荐


Selenium Web驱动程序和Java。元素在(x,y)点处不可单击。其他元素将获得点击?
Python-如何使用点“。” 访问字典成员?
Java 字符串是不可变的。到底是什么意思?
Java中的“ final”关键字如何工作?(我仍然可以修改对象。)
“loop:”在Java代码中。这是什么,为什么要编译?
java.lang.ClassNotFoundException:sun.jdbc.odbc.JdbcOdbcDriver发生异常。为什么?
这是用Java进行XML解析的最佳库。
Java的PriorityQueue的内置迭代器不会以任何特定顺序遍历数据结构。为什么?
如何在Java中聆听按键时移动图像。
Java“Program to an interface”。这是什么意思?