Useful OCR commands (ubuntu)
Prepare
APT GET
sudo apt-get install ocrfeeder
sudo apt-get install tesseract-ocr
Cropping examples
mogrify -format jpg -crop 3778x2002+2070+124 -quality 100 +repage *.png
mogrify -format jpg -crop 3778x2002+2070+124 -quality 100 -set 'x%d.jpg' +repage *.jpg
mogrify -format jpg -crop 627x985 -quality 100 +repage *.jpg
Resize examples
mogrify -resize 300% -format png *.jpg
mogrify -format jpg -quality 60 -resize 50% *.bmp
Image quality adjustment
mogrify -brightness-contrast -50x+80 jackfaust_swanwick185.png
OCR
Single page
tesseract book_page1.png stdout -l eng
tesseract book_page2.png outtext -l eng
OCR script
for i in *.png ; do tesseract $i $i.txt -l eng; done;
Other commands
PS TO PDF
ps2pdf directory.ps
Images to PDF
convert *.jpg book.pdf
PDF TO IMAGES
pdfimages -j /home/amarcinkowski/Documents/SampleWithImages.pdf /home/amarcinkowski/Documents/ExtractedImages/image