Linux��PDF���摜�ɕϊ�������e�L�X�g�𒊏o������@

Linux��PDF�������p�b�P�[�W��poppler������܂��B
poppler��xPDF�����ɂ��č���Ă��܂��B

poppler�̃C���X�g�[��

dnf -y install poppler poppler-utils

�C���X�g�[������Ǝ��̃R�}���h���g�p�ł���悤�ɂȂ�܂��B
pdffonts
pdfimages
pdfinfo
pdftohtml
pdftops
pdftotext

�g�p���@

pdftotext �yPDF�t�@�C���z �y�o�͐�z

�e�L�X�g�̒��o�ł� -raw �I�v�V������t�����ق������܂����o�ł��܂��B

pdftotext -raw hoge.pdf hoge.txt

help

$ pdftotext -h
pdftotext version 20.11.0
Copyright 2005-2020 The Poppler Developers - http://poppler.freedesktop.org
Copyright 1996-2011 Glyph & Cog, LLC
Usage: pdftotext [options] <PDF-file> [<text-file>]
  -f <int>             : first page to convert
  -l <int>             : last page to convert
  -r <fp>              : resolution, in DPI (default is 72)
  -x <int>             : x-coordinate of the crop area top left corner
  -y <int>             : y-coordinate of the crop area top left corner
  -W <int>             : width of crop area in pixels (default is 0)
  -H <int>             : height of crop area in pixels (default is 0)
  -layout              : maintain original physical layout
  -fixed <fp>          : assume fixed-pitch (or tabular) text
  -raw                 : keep strings in content stream order
  -nodiag              : discard diagonal text
  -htmlmeta            : generate a simple HTML file, including the meta information
  -enc <string>        : output text encoding name
  -listenc             : list available encodings
  -eol <string>        : output end-of-line convention (unix, dos, or mac)
  -nopgbrk             : don't insert page breaks between pages
  -bbox                : output bounding box for each word and page size to html.  Sets -htmlmeta
  -bbox-layout         : like -bbox but with extra layout bounding box data.  Sets -htmlmeta
  -opw <string>        : owner password (for encrypted files)
  -upw <string>        : user password (for encrypted files)
  -q                   : don't print any messages or errors
  -v                   : print copyright and version info
  -h                   : print usage information
  -help                : print usage information
  --help               : print usage information
  -?                   : print usage information

�֘A�L��

�X�|���T�[�����N

���ꌧ�̓d�ԘH���A�w�̈ꗗ

�z�[���y�[�W����Eweb�n�A�v���n�̐���Č���W���ł��B

��ɖ߂�