set marker for searchable/non searchable PDF files

Suggestions and feature requests.
Post Reply
Message
Author
mgroen
Posts: 147
Joined: 07.02.2020, 15:03

set marker for searchable/non searchable PDF files

#1 Post by mgroen » 03.03.2021, 11:26

This is a feature request:
make FreeCommander able to set a marker so that the user is able to distinguish searchable / non searchable PDF files from each other?
For example, by creating a column "searchable" and put an S (or any marker) on each line for searchable PDF files?
(and not set a marker for non searchables)

Explanation:
There are 2 types of PDF files:
1. PDFs which are searable (either by creation of virtual printer), or by OCR (Optical Character Recognition) application on a image based PDF file
2. PDFS which are NON searchable (basically only contain an image). These are not searchable.

I have lots of PDF files, some of them are searchable, some not.
I want to get an overview of searchables/non searcables, not by opening each file manually and check if its searchable.

The main goal behind this question is that I have lots of pdf files and I want to make them all searchable but to do that I first need to have an overview of which pdf files are already searchable and which not.

I just noticed another Totalcommander (another filemanager) has a plugin that has this functionality. More info here: https://totalcmd.net/plugring/pdfOCR.html
Although am I an not really a fan of Totalcommander I was hoping this functionality can be implemented in FreeCommander.

Thanks,
Mathijs

mgroen
Posts: 147
Joined: 07.02.2020, 15:03

Re: set marker for searchable/non searchable PDF files

#2 Post by mgroen » 06.03.2021, 14:58

mgroen wrote: 03.03.2021, 11:26 This is a feature request:
make FreeCommander able to set a marker so that the user is able to distinguish searchable / non searchable PDF files from each other?
For example, by creating a column "searchable" and put an S (or any marker) on each line for searchable PDF files?
(and not set a marker for non searchables)

Explanation:
There are 2 types of PDF files:
1. PDFs which are searable (either by creation of virtual printer), or by OCR (Optical Character Recognition) application on a image based PDF file
2. PDFS which are NON searchable (basically only contain an image). These are not searchable.

I have lots of PDF files, some of them are searchable, some not.
I want to get an overview of searchables/non searcables, not by opening each file manually and check if its searchable.

The main goal behind this question is that I have lots of pdf files and I want to make them all searchable but to do that I first need to have an overview of which pdf files are already searchable and which not.

I just noticed another Totalcommander (another filemanager) has a plugin that has this functionality. More info here: https://totalcmd.net/plugring/pdfOCR.html
Although am I an not really a fan of Totalcommander I was hoping this functionality can be implemented in FreeCommander.

Thanks,
Mathijs
To make it more clear what I need, I made a screenshot.

In short again: I need an overview of files with filenames and a mark/display if PDF file is searchable or not.

here is what I need:

Image
png plaatjes

mgroen
Posts: 147
Joined: 07.02.2020, 15:03

Re: set marker for searchable/non searchable PDF files

#3 Post by mgroen » 02.01.2023, 19:23

Any update on this? Anyone?

Note:
basically, in above thread, "searchable" means the PDF has been (partially) OCRed (processed with OCR software)

horst.epp
Posts: 460
Joined: 15.11.2008, 20:18

Re: set marker for searchable/non searchable PDF files

#4 Post by horst.epp » 02.01.2023, 20:10

I use one of the xpdf-tools in a batch which poduces a list of PDFs which need an OCR.

Code: Select all

@echo off
setlocal
rem echo on
pushd "%~dp0"
cls
::____________________________________________________________
::
::				SETTINGS
::____________________________________________________________
::
	chcp 1252
	set OUT-List=.\need_ocr.txt
	del %OUT-LIST%

::____________________________________________________________
::
::				ACTION!
::____________________________________________________________
::

	for %%X in (*.pdf) do (
		echo.    [%%X]
		C:\Tools\xpdf-tools\pdftotext.exe -simple "%%X" .\checkthis.txt
		for %%C in (checkthis.txt) DO if %%~zC LSS 25 ( echo %~dp0%%X>>"%OUT-List%" )
		del checkthis.txt
	)
pause
goto :EOF

Windows 11 Home x64 Version 23H2 (OS Build 22631.3296)
Everything Version 1.5.0.1371a (x64), Everything Toolbar 1.3.2, Listary Pro 6.3.0.67
FreeCommander XE 2024 Build 905 64-bit donor

mgroen
Posts: 147
Joined: 07.02.2020, 15:03

Re: set marker for searchable/non searchable PDF files

#5 Post by mgroen » 03.01.2023, 10:29

@horst.epp how to implement this?

horst.epp
Posts: 460
Joined: 15.11.2008, 20:18

Re: set marker for searchable/non searchable PDF files

#6 Post by horst.epp » 03.01.2023, 14:04

mgroen wrote: 03.01.2023, 10:29 @horst.epp how to implement this?
In short:
Download the free XPDF command line tools from
https://www.xpdfreader.com/download.html
Use my batch as a template and change according to your environment.
Create a FC favorite button to invokde the batch with the current dir as paremeter.
Windows 11 Home x64 Version 23H2 (OS Build 22631.3296)
Everything Version 1.5.0.1371a (x64), Everything Toolbar 1.3.2, Listary Pro 6.3.0.67
FreeCommander XE 2024 Build 905 64-bit donor

Post Reply

Who is online

Users browsing this forum: No registered users and 37 guests