tesseract의 명령 줄 모드에서 텍스트 블록 위치 및 크기 감지

tesseract OCR에는 명령 줄 인터페이스가있어서 일부 매개 변수가있는 이미지의 텍스트를 인식 할 수 있습니다.tesseract의 명령 줄 모드에서 텍스트 블록 위치 및 크기 감지

입력 argumetns는 (인식 된 텍스트 이름) 및 -psm pagesegmode 파라미터 outputbase 참고 ImageName (이미지 경로) 이다.

pagesegmode values are: 0 = Orientation and script detection (OSD) only. 1 = Automatic page segmentation with OSD. 2 = Automatic page segmentation, but no OSD, or OCR 3 = Fully automatic page segmentation, but no OSD. (Default) 4 = Assume a single column of text of variable sizes. 5 = Assume a single uniform block of vertically aligned text. 6 = Assume a single uniform block of text. 7 = Treat the image as a single text line. 8 = Treat the image as a single word. 9 = Treat the image as a single word in a circle. 10 = Treat the image as a single character. -l lang and/or -psm pagesegmode must occur before anyconfigfile.

그러나 인식 된 텍스트 블록의 위치와 크기를 특정 파일에 쓰거나 내부 정보입니까?

출처

2012-01-22 Ivan Kochurkin