textractor

Rating: 
5
Your rating: None Average: 5 (4 votes)

it's not my app, i just compiled it, i only include aarch64 versions of leptonica and tesseract and their respectiv devel packages, because packages for the other two sailfish architectures are available here: http://repo.merproject.org/obs/home:/skvark/, here is the description from github :

Textractor

Textractor is an OCR application for Sailfish OS. Main features:

OCR can be run on:

  • an image taken with the app
  • an image selected from the device
  • a PDF file (one or multiple pages)

Cropping is supported in any reasonable quadrilateral arrangement and perspective correction is applied for the selection. User has access to advanced image preprocessing settings.

Found text can be edited or copied to clipboard. As SFOS is a true multitasking OS, the whole OCR process can be run on background while user can use the device for other purposes at the same time.

sources: https://github.com/skvark/Textractor

my fix for a build error where a headerfile was referenced camelcase, whereas its filename is lowercase: https://github.com/smatkovi/Textractor

Screenshots: 
Application versions: 
AttachmentSizeDate
File tesseract-ocr-3.05.00-1.164.1.aarch64.rpm1.05 MB28/01/2023 - 14:23
File leptonica-devel-1.74.1-1.2.1.aarch64.rpm1008.9 KB28/01/2023 - 14:23
File leptonica-1.74.1-1.2.1.aarch64.rpm854.17 KB28/01/2023 - 14:23
File tesseract-ocr-vie-3.05.00-1.164.1.aarch64.rpm1.55 MB30/01/2023 - 18:15
File tesseract-ocr-ukr-3.05.00-1.164.1.aarch64.rpm685.43 KB30/01/2023 - 18:15
File tesseract-ocr-tur-3.05.00-1.164.1.aarch64.rpm2.5 MB30/01/2023 - 18:15
File tesseract-ocr-tha-3.05.00-1.164.1.aarch64.rpm2.7 MB30/01/2023 - 18:15
File tesseract-ocr-tgl-3.05.00-1.164.1.aarch64.rpm1.06 MB30/01/2023 - 18:15
File tesseract-ocr-tel-3.05.00-1.164.1.aarch64.rpm4.14 MB30/01/2023 - 18:15
File tesseract-ocr-tam-3.05.00-1.164.1.aarch64.rpm2.49 MB30/01/2023 - 18:15
File tesseract-ocr-swe-3.05.00-1.164.1.aarch64.rpm1.72 MB30/01/2023 - 18:15
File tesseract-ocr-swa-3.05.00-1.164.1.aarch64.rpm551.99 KB30/01/2023 - 18:15
File tesseract-ocr-srp-3.05.00-1.164.1.aarch64.rpm1.25 MB30/01/2023 - 18:15
File tesseract-ocr-sqi-3.05.00-1.164.1.aarch64.rpm1.17 MB30/01/2023 - 18:15
File tesseract-ocr-spa-3.05.00-1.164.1.aarch64.rpm650.43 KB30/01/2023 - 18:15
File tesseract-ocr-slv-3.05.00-1.164.1.aarch64.rpm1.14 MB30/01/2023 - 18:15
File tesseract-ocr-slk-3.05.00-1.164.1.aarch64.rpm1.55 MB30/01/2023 - 18:15
File tesseract-ocr-rus-3.05.00-1.164.1.aarch64.rpm4.16 MB30/01/2023 - 18:15
File tesseract-ocr-ron-3.05.00-1.164.1.aarch64.rpm675.43 KB30/01/2023 - 18:15
File tesseract-ocr-por-3.05.00-1.164.1.aarch64.rpm670.58 KB30/01/2023 - 18:15
File tesseract-ocr-pol-3.05.00-1.164.1.aarch64.rpm4.92 MB30/01/2023 - 18:15
File tesseract-ocr-nor-3.05.00-1.164.1.aarch64.rpm1.54 MB30/01/2023 - 18:15
File tesseract-ocr-nld-3.05.00-1.164.1.aarch64.rpm821.74 KB30/01/2023 - 18:15
File tesseract-ocr-devel-3.05.00-1.164.1.aarch64.rpm1.19 MB29/01/2023 - 03:57
File harbour-textractor-0.7-0.armv7hl.rpm5.24 MB06/02/2023 - 19:38
File harbour-textractor-0.7-0.i486.rpm5.9 MB06/02/2023 - 19:42
File harbour-textractor-0.7-0.aarch64.rpm5.74 MB06/02/2023 - 19:44
Changelog: 

i forgot to upload tesseract-ocr-devel, so i uploaded it now

Comments

Seven_of_nine's picture

Eternal spinning wheel on installation @Volla, error message:
Nichts stellt 'tesseract-language >= 3.02' bereit, welches von the to be installed tesseract-ocr.3.05.00-1.164.1.aarch64 benötigt wird
(exactly these words)

Pkcon reports: Poppler already installed.

What can I try next?

Pohli's picture

Got the same problem on Xperia 10 III, SFOS 4.5.0.24, Storeman 0.3.7:

Nothing provides ‘tesseract-language >= 3.02' needed by the to be installed tesseract-ocr-3.05.00-1.164.1.aarch64

But

pkcon install harbour-tesseract

worked without problems.

eson's picture

I can't install textractor on SFOS 4.5.0.

Storeman says...

"Nothing provides libpoppler.so. '112()(64bit)', needed to be installed harbour-textractor-0.7-0.aarch64"
aviarus's picture

i think  jolla should provide it, you can try installing it through the terminal. devel-su zypper in poppler

olf's picture

… or without zypper: devel-su pkcon install poppler

aviarus's picture

i think  jolla should provide it, you can try installing it through the terminal. devel-su zypper in poppler

lpr's picture

SFOS4.5.0 provides libpoppler.so.124 but not libpoppler.so.112 so it will not work anymore...
best thing would be to link only against libpoppler-qt5.so.1 or libpoppler-glib.so.8 that do not change between poppler-versions (e.g. sailfish-office is following that way)

aviarus's picture

thanks. something to figure out then dirty fix would be a symbolic  link to the new .so

 

p.s.: it doesn't even work.

aviarus's picture

thanks i changed it to libpoppler-qt5.so.1 now i have to do that for the other architectures too. so installing for aarch64 should work now

eson's picture

Thanks for the quick fix, although I couldn't install it with Storeman. I had to download the rpm and install internal.

zipotron's picture

Thanks for the app, seams really interesting ,but, al lot of packages... please, could you write a bit about how to install? I mean, required packages and optional... Thanks

obexd_contentfilter_olf's picture

@zipotron, even at the command line a pkcon install harbour-textractor is fully sufficient (or alternatively, when zypper is installed: zypper install harbour-textractor).

I.e., why do you care about dependencies which are automatically resolved just fine?

slava's picture

I see at least two reasons to avoid extra dependencies in 3rd-party apps:

  1. they don't get automatically uninstalled when you remove the app
  2. conflicts between dependency packages provided by different 3rd-party apps and/or the system ones.

That only doesn't matter, if you install exactly one 3rd-party app, never remove it and never upgrade the OS.

aviarus's picture

don't you install through storeman? i should have separated the development packages maybe. only  harbour-textractor is necessary for running the app. all the others are dependencies for development purposes

zipotron's picture

True, and is working perfectly. Thanks!