"Importing" a PDF

Page 1 of 1 (13 items)
This post has 12 Replies | 0 Followers

Posts 979
Deacon Steve | Forum Activity | Posted: Tue, Jan 30 2018 6:04 AM

Is there a way to "import" a PDF into Verbum/Logos?

Posts 3225
Whyndell Grizzard | Forum Activity | Replied: Tue, Jan 30 2018 6:08 AM

Simple answer..... wait for it.......NO!!!!

Posts 3225
Whyndell Grizzard | Forum Activity | Replied: Tue, Jan 30 2018 6:09 AM

Well yes- convert the PDF to JPG or PG file then create a personal book- but it is no better than the PDF inside Logos since it is unsearchable, etc.

Posts 3225
Whyndell Grizzard | Forum Activity | Replied: Tue, Jan 30 2018 6:11 AM

I have converted hundreds of books from PDF to docx then built PBB resources, but depending on what you are looking for it can be simple or very time consuming.

Posts 28374
Forum MVP
JT (alabama24) | Forum Activity | Replied: Tue, Jan 30 2018 7:28 AM

Steve:
Is there a way to "import" a PDF into Verbum/Logos?

As has been answered, no... at least not directly.

PDF is an unusual file format... Some PDF's are essentially photographs, while others have selectable text. If you are able to convert the PDF into selectable text and get that text somehow into a Microsoft Word document (.docx), you can import the text into Logos as a personal book. NOTE: This process can be VERY simple or VERY complicated based on several factors.  

OSX & iOS | Logs |  Install

Posts 28374
Forum MVP
JT (alabama24) | Forum Activity | Replied: Tue, Jan 30 2018 7:32 AM
Whyndell Grizzard:
Well yes- convert the PDF to JPG or PG file then create a personal book- but it is no better than the PDF inside Logos since it is unsearchable, etc
I didn't understand what you meant at first, but I do now. If the PDF is along the lines of a "photograph," you could do as you suggest and create a personal book of "photographs." In some cases, this would be perfect (i.e. if the original PDF was a photo book of biblical artifacts). The answer, however, isn't an absolute "no" to creating a "searchable book" from a photo type PDF... but it is unlikely and potentially very costly. It would involve having excellent OCR software, time & patience to fix typos, etc. For most people and most projects, the "possible yes" would be a "practical no."

OSX & iOS | Logs |  Install

Posts 10691
Forum MVP
NB.Mick | Forum Activity | Replied: Tue, Jan 30 2018 7:58 AM

JT (alabama24):
If you are able to convert the PDF into selectable text and get that text somehow into a Microsoft Word document (.docx),

which, in more recent versions of MS Word, may mean simply opening the PDF in Word as that will invoke an automated conversion. However, this may or may not provide satisfactory results, depending on the nature and quality of the PDF - the "photograph" type usually doesn't (which is the way many old books available from e.g. archive.org have been preserved: Scanned pages. There are ePub and txt versions of such books, but they are worthless due to abysmally bad OCR).  

Running Logos 8 latest beta version on Win 10

Posts 979
Deacon Steve | Forum Activity | Replied: Tue, Jan 30 2018 8:26 AM

Thanks everyone.  That helps.

Posts 2842
Michael Childs | Forum Activity | Replied: Tue, Jan 30 2018 2:56 PM

Some pdf are just pictures of the text, not actually text.  If you want to convert that to Logos Personal Book that is searchable, then you must use an OCR program to convert the PDF to actual text.  I have done that on a few books.  It takes quite a bit of editing of the OCR document.  

Nuance Omnipage is probably the best OCR, and it will convert the picture based text directly to a docx file, which is what it takes to make a Logos personal book.  However, it will still require considerable editing before being ready to convert to a Logos Personal Book.

So, yes, it can be done.  I am more optimistic than those who say it just can't be done.  I have done it several times and got pretty good results.  Of course, results vary according to how good the pdf was.  The more blurry the PDF, the worse the OCR result.  

It had  better be a book that you really want, and you are willing to spend the time to do some editing.

"In all cases, the Church is to be judged by the Scripture, not the Scripture by the Church," John Wesley

Posts 6
Kenibell | Forum Activity | Replied: Tue, Jan 30 2018 7:06 PM

You are right enough OP. Text can be easy to covert but pictures aren't. It's jumbles all the time.

Posts 1
Glen Colley | Forum Activity | Replied: Fri, May 8 2020 8:37 AM

What are PBB resources?

Posts 28374
Forum MVP
JT (alabama24) | Forum Activity | Replied: Fri, May 8 2020 8:40 AM

Welcome Glen!

Glen Colley:

What are PBB resources?

PBB = Personal Book Builder tool. You can import material that is in Microsoft Word (.docx) file format. If you have more questions, feel free to create a new post in the L8 forum. 

OSX & iOS | Logs |  Install

Posts 1063
JohnB | Forum Activity | Replied: Fri, May 8 2020 2:49 PM

Michael Childs:

Some pdf are just pictures of the text, not actually text.  If you want to convert that to Logos Personal Book that is searchable, then you must use an OCR program to convert the PDF to actual text.  I have done that on a few books.  It takes quite a bit of editing of the OCR document.  

Nuance Omnipage is probably the best OCR, and it will convert the picture based text directly to a docx file, which is what it takes to make a Logos personal book.  However, it will still require considerable editing before being ready to convert to a Logos Personal Book.

So, yes, it can be done.  I am more optimistic than those who say it just can't be done.  I have done it several times and got pretty good results.  Of course, results vary according to how good the pdf was.  The more blurry the PDF, the worse the OCR result.  

It had  better be a book that you really want, and you are willing to spend the time to do some editing.

Agreed on all points. Omnipage is excellent but still often needs some tweeking to get it to produce its best work.

Page 1 of 1 (13 items) | RSS