The same problem not only happens in PDF conversion, if you can try to open this type of PDF file with Adobe Reader, Preview, or any other PDF readers, copy a word or a sentences and then paste it to the default text editing app, such as ‘TextEdit’ on Mac or ’Notepad’ on Windows, you’ll get the same result. If the fonts in PDF don’t use a standard encoding for mapping the glyph indices to characters, or the encoding info of the font is missing, you’ll get garbage characters after converting it to Word. Encoding is must-have information for PDF conversion task. There are a number of encodings, a font can even have its own built-in encoding. Within text strings in PDF, characters are shown using character codes that map to glyphs in the current font using an encoding. The reason that caused the encoding problem: ff becomes ie becomes $, space becomes % *Some certain letter combination is replaced with strange symbols, e.g. *Text is garbled or displays as gibberish characters For some particular PDF file, the output Word document does not display correctly after converting to Microsoft Word, Excel or PowerPoint.
0 Comments
|
AuthorWrite something about yourself. No need to be fancy, just an overview. Archives
March 2023
Categories |