Can't open library in editor when saved as pdf

  • 1
  • Question
  • Updated 12 months ago
Saved my screen captures as pdf's as I captured them. They all appear in the library as pdfs. The recent captures appear in the editor and, if I hover over them, they say xxx.pdf. I'm using the 'Grab text' function. I have Adobe Acrobat and can save the files as word .docx to get the text, but the resolution in the pdf is far worse, and most of the text is not recognised. The resolution in Snagit is much better, and the Grab Text is almost perfect. However, if I open the pdfs from the Snagit library, they always open with Acrobat. Is there ANY way I can open the pdfs in the library, using the Editor, so I can use the Snagit Grab Text function?

Things I've tried already:
Acrobat save as Word/Excel etc. - resolution in Acrobat too low, so OCR not accurate
Changed file association in Windows 10, to associate pdfs with Snagit - no, Snagit won't open pdfs. However, the recent files in Snagit Editor open fine
Checked in the Registry using RegEdit, to see if I can replace the 'recent files' with older pdfs, so I could cycle through and Grab Text of older files. I found the place in the registry that shows the 'recent files' but there were no pdf's there, despite pdfs showing in Snagit Editor recent files.
Lots of other ways of OCR'ing from pdfs, but the resolution of the pdf seems much worse in Acrobat etc, than the Editor Recent Files.
Checked Snagit folder to see if they have been saved as Snagit files as well as pdfs, and that's why I can still get good resolution in the Editor Recent Files, but no, they are not there.

So I'm stumped. Unfortunately, the original internet pages that I captured are no longer there, so I can't redo the process, and use the 'Grab Text' option to capture text directly from the pages, which is what I should have done in the first place.

Last resort is to copy type the text from the pdf files, but that's 150 captures, with 3 pages of text on each capture. Even using Fiverr, that's going to cost a pretty penny.

Any ideas very welcome. Thank you in advance.

Snagit version 2019.1.2 64 Bit Build 3596 23/05/2019
Photo of rob

rob

  • 3 Posts
  • 1 Reply Like
  • frustrated and almost resigned

Posted 12 months ago

  • 1
Photo of Joe Morgan

Joe Morgan

  • 9083 Posts
  • 4787 Reply Likes
Try running a free trial of Abby FineReader. https://www.abbyy.com/en-us/finereader/

It's the same people that provide Grab Text for SnagIt. But it's better at text recognition. I've used it to read text SnagIt cannot.

And, it opens PDF's.


I was playing around with this, and saving pdf's to a folder. The pfd's that showed up in the recent capture tray for me, were actually listed as unsaved captures. Which is normally reserved for .snags. Yet I save all my captures to the Library, where they were all pdf's.



I suspect this is some sort of bug.

SnagIt has never opened pdf's natively. So opening them in SnagIt isn't going to happen.

If you try FineReader, I'd like to hear if it worked out.

Regards,Joe

Photo of rob

rob

  • 3 Posts
  • 1 Reply Like
Thank you for your kind reply.

Your recommendation is a very good solution. The output is far, far better than Adobe Acrobat 'Save As Word'. I am buying the full version of Fine Reader, because I have many more than 100 pages. Shame I couldn't do the same thing with SnagIt, but at least I now have a solution.
Very kind of you to take your time to help me.

Thank you again,

Rob
Photo of Rick Stone

Rick Stone

  • 6638 Posts
  • 3196 Reply Likes
Hey Rob, hopefully you will see this reply today. (Saturday, July 13)

I regularly follow a site called "Giveaway of the Day". And today's offering is a PDF to OCR package. These are free, but the catch is that you have to download, install, and activate today in order to take advantage.

Seeing today's offering made me think of this post.

https://www.giveawayoftheday.com/pdf-ocr-4-7/

Cheers... Rick :)
Photo of rob

rob

  • 3 Posts
  • 1 Reply Like
Thanks Rick, I've bought Abby Fine Reader and it was perfect at deciphering the pdfs.
Photo of Joe Morgan

Joe Morgan

  • 9083 Posts
  • 4787 Reply Likes

While the free software suggestion may have its merits to someone. It’s unimpressive. And highly under featured. That’s why it’s free. In my opinion.

I like being in a position to suggest alternatives. ABBYY being such an expensive route to take. But you can choose format settings. Whereas you cannot with the free program. Which is also a previous version of the software that cannot be updated. It’s Giveaway’s modus operandi.

I downloaded it to see if I would recommend it to others. I’d recommend on online converter, free alternative instead.

It strips the bulk of the formatting. All the fonts become the same size, bold type is no longer bold. And everything is reduced to a single file column.

I opened this pdf in the program.



It was converted it into this.


I’ll give it credit for this. The ORC engine seems accurate. However, without options to preserve any of the formatting. It seems closer to one step forward and 3 steps back. From my perspective.

ABBYYS expensive for a reason. And I wish it were cheaper. Plus, it does a lot more than just recognize text.



I wish I could recommend the free program. That’s why I downloaded and tested it.The ORC text is opened with Notepad. I’m thinking that’s why the formatting gets stripped away?

SnagIt doesn’t maintain all the formatting either. But it recognizes bold type and font size. I’d recommend SnagIt over this program. However, SnagIt reduces everything to a single column of text as well. However, it’s still closer to what was actually captured.  

 I even tested the New version. It’s no different from my brief observations and tests. It’s a forgettable application to me.


 I’m posting this information primarily for others, that search this thread down the road. If I didn’t, I’d feel I’ve completely wasted my time in testing the program.     

Enjoy ABBYY, Joe