Process to fix the verse number on PDF to Docx conversion

Nycholas Maia
Nycholas Maia Member Posts: 19
edited November 20 in English Forum

Hi, I'm trying to create my Personal Book from a PDF.

This image below is my PDF. As you can see, it have the both chapter and verse number in it.

I converted it from PDF to DocX using Adobe Acrobat Reader PRO 2021 and I got a good output.

My only problem is that: the output joins/concatenate the chapter/verse number with the first word of the line.

So, for this example above, the output text is:

1.1Irmãos, pelas desgraças....

I would like to know if is there some way to keep the chapter/verse numbers, but avoid the concatenation process in the output text.

Could you help, me?

Thank you,

Tagged:

Comments

  • Robert M. Warren
    Robert M. Warren Member Posts: 2,452 ✭✭✭

    Hi Nycholas:

    Do you have Microsoft Word 2013 or later available to you? It's possible it might provide a better outcome converting than Adobe. If your PDF is not under copyright, you could post it here and I could run it through Word and see if it works well.

    macOS (Logos Pro - Beta) | Android 13 (Logos Stable)

    Smile

  • Nycholas Maia
    Nycholas Maia Member Posts: 19

    Thank you Robert!

    Yes, I have Microsoft Office 2019. So I converted my PDF using Microsoft Word and I got a better output!
    Now I have the correct blank spaces between the chapter/verse numbers and the first word of the line. Great!

    You can see it in Microsoft Word:

    So, the next problem is that: When I imported this new *.docx file to my Personal Book in Logos 9, I saw that there is another verses when the original PDF concatenates the verse number with the first word of the line. As you can see below:

    So, in this second case, the Word output is a concatened number/text too.
    The problem, is when I tried to search this word in Logos, I need to write the concatenated number too. As you can see here:

    So, Is there a way to solve/fix these cases where the original PDF verses are concatenated with the first word of the line?

    Thank youm again!

  • Robert M. Warren
    Robert M. Warren Member Posts: 2,452 ✭✭✭

    OK, try this:

    (Make a safe copy of the Word docx.)  In Word, on the Home tab, click the Find & Replace button (at the right) and then click Replace. In the dialog box that appears, click the 'More' button at the bottom. Click the 'Use wildcards' box. In the 'Find what' box, type this     ([0-9])([A-Z])

    In the 'Replace with' box, type     \1 \2     (that's a space between 1 and \)

    (If your Portuguese character set has different 'first and last' letters, you'll edit that part.)

    Click the 'Replace all' button. What this should do is find anywhere there is a number followed by an upper-case letter with no spaces in between and insert one space. If you want two spaces between verse and text, add another space. There are probably some verses that begin with a lower case letter; to fix those, run the Replace again with    a-z   in 'Find what'.

    Examine your docx and see if all went well. The good thing about Find & Replace is that is does exactly what you tell it to do. The bad thing about Find & Replace is that is does exactly what you tell it to do. [:)]

    macOS (Logos Pro - Beta) | Android 13 (Logos Stable)

    Smile

  • Robert M. Warren
    Robert M. Warren Member Posts: 2,452 ✭✭✭

    Also, Nycholas, I should add this, in case you don't know:

    You'll be able to read your Personal Book you make from this as you would an ordinary book, and it will be searchable in the full-text index. However, you should also be aware that it won't behave as other bibles you might have from Logos, unless you do some tagging on it. If you are able to add the necessary tagging, you'll be able to use it in link sets with commentaries, Bible Search, Text Comparison, Copy Bible Verses, and a lot of other things I can't think of right now. It's a lot of work, but if this is a translation you value, it might be worth it. I have only tagged a bible personal book once (https://community.logos.com/forums/t/151799.aspx ) and I had to use MS Excel as an intermediate tool. If you need to do this, I'll be able to help.

    macOS (Logos Pro - Beta) | Android 13 (Logos Stable)

    Smile

  • Nycholas Maia
    Nycholas Maia Member Posts: 19

    Thank you so much for your good help Robert!

    In the field 'Find What', I wrote: ([0-9])([A-zÀ-ú])
    Because portuguese language have accent letters. It works great!

  • Nycholas Maia
    Nycholas Maia Member Posts: 19

    I saw your bible work! Amazing! Thank you for sharing this MS Excel file.

    My goal is create a Personal Book that any people that copy and paste a selected text from it, when paste, the output text cames with a good bibliography citation like this example below:

    ---------------------

    "This is the copied text from the Nyck Personal Book."

    Maia, Nycholas. Personal Book Name (2022). Chapter/Verse 10:20-30. Page 150.

    ---------------------

    Is this possible for Personal Books?

  • Nycholas Maia
    Nycholas Maia Member Posts: 19

    Ahhh...One more question:

    My original PDF have hyperlinks that are pointed to the same PDF (to the last page). As you can see in the image below:

    When MS Office 2019 converted the PDF to *.docx, these links stop to work, even they are in blue color.

    Is there a way to keep these hyperlinks working in the *.docx file?

    Thank you again!

  • Robert M. Warren
    Robert M. Warren Member Posts: 2,452 ✭✭✭

    Hi Nycholas:

    Sorry, I didn't realize until this morning that you were converting an Apostolic Father's epistle (I think), and not a bible. But, it was good information if you're new to Logos Personal Books.

    My goal is create a Personal Book that any people that copy and paste a selected text from it, when paste, the output text cames with a good bibliography citation like this example below:

    ---------------------

    "This is the copied text from the Nyck Personal Book."

    Maia, Nycholas. Personal Book Name (2022). Chapter/Verse 10:20-30. Page 150.

    ---------------------

    Is this possible for Personal Books?

    First, let us talk about how Personal Books are shared. You would send the Word docx to other Logos users and they would compile it as a Personal Book on their desktop computer. (I will note here that personal books do not synchronize to the mobile devices, and they are not available on the Logos Web App, 'app.logos.com'.)

    Each person who creates the personal book on their computer needs to enter the 'metadata' (title, author, type, copyright, etc.). The individual also controls how citations behave when pasted from the personal book on their computer, via Program Settings. (They would also control the format of the citation via Program Settings.) So, the citation information when they copy and paste depends on their accuracy and fastidiousness in entering the metadata. Beyond Title, Author, Copyright, Type, and Language, there are other fields of metadata that can be added via the 'Add field' drop-down. So, in addition to the docx file, you would provide each person with the metadata they need to enter in the Personal Book Tool to produce the resource you want them to have. (I should note here also that 'Language' in this specification is library-metadata-related only. The Word docx would control the marking of specific languages [Greek, Spanish, Portuguese, etc.]. These are important for various word look-up functions. MS Word generally does a good job at automatically marking some languages.)

    The next issue to consider in how your personal book is cited is Indexes. (The primary purpose of indexes is to enhance the general experience of using a personal book.) When I mentioned bible tagging above, much of that tagging involves what are called 'milestones'. For example, when you tag a bible, each verse is tagged with a bible milestone, (See the Green's Literal Bible docx I referenced; the '@Bible.....' tags are milestones.) If you look in your library at a bible or a noted work (Calvin, Augustine, Aquinas, various Church Fathers' works, etc.), you can look at the resource's Information pane and scroll down to the Indexes section and see those indexes listed. When the personal book is compiled in Logos, the milestones create one or more indexes for the resource. The types of indexes I have experience with are: Bible, English Headword (for dictionaries, etc.), and Page. A pasted citation to your work will refernce one or more of the indexes in the work. Regarding the specific Clement Epistle, there is an 'Apostolic Fathers' set of milestones. I don't think that every possible milestone is 'exposed' to users for use in personal books.

    You might want to see this page in the Logos WIki:  https://wiki.logos.com/Personal_Books  It has a lot of information and instruction regarding personal books and milestones, etc.

    This is what I am hoping:  There are other Logos users with much more experience than I in using certain milestones and in controlling citation content and styles (I have no academic background). Maybe someone can add a few specific comments to my many words of general observations.

    macOS (Logos Pro - Beta) | Android 13 (Logos Stable)

    Smile

  • Nycholas Maia
    Nycholas Maia Member Posts: 19

    Thank you very much Robert for your precious tips on Logos and the construction of "Personal Books".

    I read the online documentation you quoted, but I couldn't find a way to add the chapter and verse information to the text in Microsoft Word.

    I tried changing the HEADING of each session to HEADING TITLE 1, but it didn't work in Logos.

    I looked for how to add this chapter/verse information using the "milestones" but I couldn't find in the online documentation how to do it specifically.

    Yes, I am doing a test using the book of the First Epistle of Clement from the Patristic edition.
    This text is divided into chapters and verses.
    So I would like to edit the Microsoft Word file so that Logos can recognize this internal division.

    My ultimate goal is: When the user copies an excerpt of this text from Logos to Microsoft Word, the output of the text in the collage comes with the complete bibliographic reference, including the chapter number and verse.

    If possible, it would also be very good if the bibliographic reference comes with the page number.

    Example:

    -------------------
    "This is the copied text from the 1 Clement"

    Clement of Rome. Chapter 1:3. Patristic Edition - Paulus (2022) Page 2.

    -------------------

    At this moment, when I copied the text from Logos, it cames without the chapter/verse. And without the page number.

    Example:

    --------------

    Padres Apostólicos, Patrística - Clemente Romano, vol. 1, Patrística (Paulus, 2022).

    --------------


    Do you have ny ideas about how to enable Logos to include the chapter/verse and page number in the biibliografy?

  • Robert M. Warren
    Robert M. Warren Member Posts: 2,452 ✭✭✭

    Hi Nycholas:

    I'll have to do some additional thinking on this, so it will probably be tomorrow before I'll be able to give you some more information.

    I want to make sure I understand your issue thoroughly:

    1. You want to create a personal book for others to use.
    2. The docx for this resource will contain excerpts from Clement (or other Fathers) along with your notes / observations.
    3. When your friends have compiled the docx in the Personal Book Tool on their installation of Logos, you want them to be able to copy from this resource and paste into a Word document they are composing.
    4. You want that text which they have pasted into their document to have a citation from the personal book that specifies you as the author, provides the page number(s), and the Clement citation consisting of Chapter and Verse.

    I would like to appeal to other users also, if they have experience / information on:

    1. What Nycholas might expect in the way of citation / bibliographical info  (I'll note again my lack of academic experience)
    2. Are the milestones for the various Fathers available / knowable for use in personal books.

    macOS (Logos Pro - Beta) | Android 13 (Logos Stable)

    Smile

  • Nycholas Maia
    Nycholas Maia Member Posts: 19

    Thanks for the clarity of thought and the organization of ideas Robert.

    I would just like to modify item 4 of your topic list.

    I'm creating a "Personal Book", but I don't want to appear in the bibliographic references as the "Author of the Book".

    The idea is that the book is "by Clement".

    Example:

    ----------------

    "Here is a phrase copied and pasted from the Personal Book"
    Clement of Rome, I Epistle to the Corinthians, Chapter 1:3. Ed. Paulus (2022), page 3.

    -----------------

    See what information is present in this bibliographic citation:
    a) Author of the Book: Clement of Rome
    b) Title of Book: I Epistle to the Corinthians
    c) Location of the quoted phrase: Chapter 1 verse 3
    d) Publisher: Paulus
    e) Year of publication: 2022
    f) Page Number: 3


    -------

    So far, when I copy and paste a text from this Personal Book, I get the following bibliographic information:
    a) Author of the Book: Clement of Rome
    b) Title of Book: I Epistle to the Corinthians
    c) Publisher: Paulus
    d) Year of publication: 2022

    What I want is to add to the bibliography 3 items:
    1- Location: chapter
    2- Location: verse
    3- Page Number

    In summary: What is the means/method to be done in the *.docx text so that it contains the necessary metadata to generate this complete bibliography?

  • NB.Mick
    NB.Mick Member, MVP Posts: 15,837 ✭✭✭

    In summary: What is the means/method to be done in the *.docx text so that it contains the necessary metadata to generate this complete bibliography?

    I'm not sure about the chapter/verse thing appearing at all outside of bibles - even more so if you want to include page numbers, which would make a citation technically "complete" without the much more relevant location info - I'd try to check other PBs how that comes around. For this to work you would definitely need milestones, which also makes the PB much more usable in Logos.

    The page numbers: again, milestones. Those are pretty straight forward, you'll need to insert [[@page xx]] at every location where there's a page break in the original edition you try to replicate. 

    Have joy in the Lord! Smile

  • Robert M. Warren
    Robert M. Warren Member Posts: 2,452 ✭✭✭

    I'll have to do some additional thinking on this

    OK, Nycholas:

    First, the bad news:  as NB.NMick indicated (thanks!), I can find no method for customizing the automated citation styles supplied with Logos to cause them to display a non-Page index range (that is, non-canonical documents). I will suggest a workaround or alternative later in this novella.

    Next, here's how to tag the Clement references:

    Navigate to this post in the Logos Forums:  https://community.logos.com/forums/t/66338.aspx (thank you, Ken McGuire!), which has a docx of the Richardson translation of the Apostolic Fathers. Beginning at Page 26 in that document, note how Ken tagged 1 Clement. That's how you'll need to tag your personal book. Note also that the tag with Chapter 0 is for the introduction. Recall from the Wiki page, that if you were tagging a typical bible reference, it would follow the pattern  [[@Bible:Book Chapter:Verse]] . So, you can see the analogy to the non-canonical reference  [[af:1Cl 1.1]] . When all the verses are tagged and your users compile their personal books, they will have the Apostolic Fathers index in the resource. This means they can:

    1. Type a 1 Clement reference in the reference box (the text box in the upper-left corner of the resource) and go straight to that reference.
    2. Open a commentary or other resource that is indexed with the Apostolic Fathers and create a Link Set that scrolls together.

    Next, Page Numbering:

    On the WIki page for Personal Books (https://wiki.logos.com/personal_books ), scroll down the the Milestones section and see the format for Page milestones, that is:  [[@Page:number]] . This will need to be at the top of each page. (But don't put them in the Word header section; the Personal Book compiler ignores headers and footers.) You'll want to do this as the last editing step. You can place them at the beginning of the first sentence on the page, since it won't be part of the surface text of the compiled resource. If your document is brief, manually entering this will be simple. If your document is long, consider reading this forum thread:  https://community.logos.com/forums/p/35832/269112.aspx#269112 (thanks, Dominick Sela!). Make sure to get the revision Dominick added in Post #3 in the thread.

    Next, Compile Your Personal Book, review it, and determine which citation style you want your users to specify:

    Make sure the compiled resource looks as you want it to. Copy some sections of text to a Word document and see how that looks. Check the citation format. If you have entered footnotes in MS Word, those will be compiled as functional Logos footnotes and will be pasted in the Word document, if Copy Footnotes is specified as Yes in Program Settings. In that case, the citation footnote will be the last one listed (I think) and it should be distinguished from the regular footnotes by having a slightly larger font size. If you want to experiment with other citation styles, specify another style in Program Settings and test again. Once you have determined that, here's the workaround or alternative for getting the Book-Chapter-Verse reference: you'll have to tell your users to manually edit the citation footnote that is generated when the text is pasted. (By the way, the citation footnote does have a hyperlink to the copied text, but it is worthless when sent to someone else, since each Personal Book has a resource ID that is distinct for each user. If you select that hyperlink, then press Shift+F9, it will reveal the hyperlink code, which will at least have the index starting point of the referenced text, but that's a lot of trouble.

    When you are test-compiling the personal book, note the metadata items you want your users to specify and what values you want them to use.

    What You Will Want to Provide to Your Users:

    1. The docx file
    2. A list of metadata items to include in the Personal Book Tool and the values for those. Don't forget the Add Fields list.
    3. The citation style you want used and tell them how to change to that in Program Settings
    4. Whether or not you want the footnotes in the personal book to also be in the pasted data in their individual Word documents and how to specify Yes or No in Program Settings
    5. How to edit the citation footnote for the Book-Chapter-Verse references, and the format for that.

    I hope all of this makes sense.  [:)]

    macOS (Logos Pro - Beta) | Android 13 (Logos Stable)

    Smile

  • Nycholas Maia
    Nycholas Maia Member Posts: 19

    Amazing answer Robert!

    Thank you so much for your patience and great help!