Skip to content Skip to sidebar Skip to footer

Fatal Error In Ghostscript When Processing Multiple Files

Python 3.7.5 OS: Windows Server 2016 Ghostscript version: 9.5 I am trying to do text extraction using Ghostscript for multiple PDFs in a directory. The directory currently contains

Solution 1:

I met the same issue today and found that ghostscript.Ghostscript is supposed to be called in with block. Plus, before creating a new instance of ghostscript.Ghostscript, I have to call ghostscript.cleanup().

Try this:

import os
import sys

defpdf2txt(directory,file):
    import locale
    import ghostscript
    args=[file,"-dBATCH","-dNOPAUSE","-dNOPROMPT","-sDEVICE=txtwrite","-sOutputFile="+directory+"\\output\\"+file+"-%d.txt",directory+"\\"+file]
    encoding=locale.getpreferredencoding()
    args=[a.encode(encoding) for a in args]
    print (args)
    with ghostscript.Ghostscript(*args) as g:
        ghostscript.cleanup()

directory=sys.argv[1]

files=os.listdir(directory)
for file in files:
    print("Trying "+directory+"\\"+file)
    pdf2txt(directory,file)

Post a Comment for "Fatal Error In Ghostscript When Processing Multiple Files"