Fatal Error In Ghostscript When Processing Multiple Files
Python 3.7.5 OS: Windows Server 2016 Ghostscript version: 9.5 I am trying to do text extraction using Ghostscript for multiple PDFs in a directory. The directory currently contains
Solution 1:
I met the same issue today and found that ghostscript.Ghostscript
is supposed to be called in with
block. Plus, before creating a new instance of ghostscript.Ghostscript
, I have to call ghostscript.cleanup()
.
Try this:
import os
import sys
defpdf2txt(directory,file):
import locale
import ghostscript
args=[file,"-dBATCH","-dNOPAUSE","-dNOPROMPT","-sDEVICE=txtwrite","-sOutputFile="+directory+"\\output\\"+file+"-%d.txt",directory+"\\"+file]
encoding=locale.getpreferredencoding()
args=[a.encode(encoding) for a in args]
print (args)
with ghostscript.Ghostscript(*args) as g:
ghostscript.cleanup()
directory=sys.argv[1]
files=os.listdir(directory)
for file in files:
print("Trying "+directory+"\\"+file)
pdf2txt(directory,file)
Post a Comment for "Fatal Error In Ghostscript When Processing Multiple Files"