ps2ascii - Ghostscript translator from PostScript or PDF to ASCII
Synopsis
Description
Files
See Also
Version
Author
Attributes
Notes
ps2ascii [ input.ps [ output.txt ] ]
ps2ascii input.pdf [ output.txt ]
ps2ascii uses gs(1) to extract ASCII text from PostScript(tm) or Adobe Portable Document Format (PDF) files. If no files are specified on the command line, gs reads from standard input; but PDF input must come from an explicitly-named file, not standard input. If no output file is specified, the ASCII text is written to standard output.ps2ascii doesnt look at font encoding, and isnt very good at dealing with kerning, so for PostScript (but not currently PDF), you might consider pstotext (see below).
Run "gs -h" to find the location of Ghostscript documentation on your system, from which you can get more details.
pstotext(1), http://www.research.digital.com/SRC/virtualpaper/pstotext.html
This document was last revised for Ghostscript version 8.15.
artofcode LLC and Artifex Software, bug-gs at ghostscript.com, are the primary maintainers of Ghostscript. David M. Jones <dmjones@theory.lcs.mit.edu> made subtantial improvements to ps2ascii.
See attributes(5) for descriptions of the following attributes:
ATTRIBUTE TYPE ATTRIBUTE VALUE Availability SUNWgscr Interface Stability Volatile
Source for ESP gs is available on http://opensolaris.org.
| 8.15 | PS2ASCII (1) | 22 September 2004 |