Hyphenation in Logos Personal Books

Lasda
Lasda Member Posts: 17 ✭✭
edited November 2024 in English Forum

I've been using ReadIris to OCR some .pdf files into .docx files for use as personal books. I'm noticing an interesting issue with wrapping the text. I made a sample file to test this and you can see the results in the attached image.

6087.Logos test wrap file.docx

The text on the left breaks "leadership" up in a bad way. The text on the right is correct. The difference is that the text on the left is as it comes out of the OCR program. The text on the right has been cut and re-pasted into the document without formatting. 

Thoughts on what is going on? Cutting and pasting fixes the problem, but is extra work...

Comments

  • Lasda
    Lasda Member Posts: 17 ✭✭

    Interesting -- I don't see the pasted image in the text. If I edit the post, the image is there, but viewing the post does not show it... I also don't see a place where I can attach a test file so someone can try to duplicate the problem.

  • Lew Worthington
    Lew Worthington Member Posts: 1,661 ✭✭✭

    This forum software is going away soon (let us rejoice), but for now, the paperclip icon is the way to attach images.

  • Fabian
    Fabian Member Posts: 1,099 ✭✭✭

    I see the same with your test file. 

    If you export it as HTML And you check the source file with a text editor you can see the text above:

    style='color:#191919'>, </span>a data <span style='color:#2B2B2B'>s</span><span
    style='color:#191919'>e</span>t ba<span style='color:#191919'>se</span>d <span
    style='color:#191919'>o</span>n 1<span style='color:#191919'>33 </span>MBA p<span
    style='color:#191919'>a</span>rt<span style='color:#191919'>-</span>tim<span
    style='color:#191919'>e </span><span style='color:#2B2B2B'>s</span>tud<span
    style='color:#191919'>e</span>nt<span style='color:#191919'>s </span>fr<span
    style='color:#191919'>o</span>m T<span style='color:#191919'>s</span>inghu<span
    style='color:#191919'>a </span>Uni<span style='color:#2B2B2B'>v</span><span
    style='color:#191919'>e</span>r<span style='color:#2B2B2B'>s</span>i<span

    So completely cut due different font color. It seems Logos has issues with this. 

    This worked by me. Select all and choose font color automatically. 

    May there is a preference in your ReadIris you can change. Otherwise ask the support of ReadIris. 

    Maybe Logos can improve their app too.

    Χριστὸς ἐν ὑμῖν, ἡ ἐλπὶς τῆς δόξης· 

  • Antony Brennan
    Antony Brennan Member Posts: 842 ✭✭✭

    This forum software is going away soon (let us rejoice)

    I predict a mass literary riot 

    👁️ 👁️

  • Lasda
    Lasda Member Posts: 17 ✭✭

    Many thanks -- that works much better than cutting and pasting the text without formatting. I had to be careful to watch for bold and italic in the original text so I could re-apply the formatting to the final text.

    I will look carefully at the ReadIris program settings to see if there is an option I am missing.

  • Fabian
    Fabian Member Posts: 1,099 ✭✭✭

    Bradley,

    May the font  could be a reason for the issues I have with my PB dictionary.

    I guess it would be good Logos would fix it anyway.

    • So the words are not brocken anymore even if the color is changed
    • So maybe the [[label >> links]] are still working even the color, font, bold, italics are changed

    Thanks

    Χριστὸς ἐν ὑμῖν, ἡ ἐλπὶς τῆς δόξης·