Get text out of a pdf-file? #4892
Replies: 2 comments
-
|
The pdf misses proper back-translation information from the glyphs to the originating Unicodes. Some background: what you see on any PDF page are glyphs only = graphical representations of the Unicode characters that were used to select them. In the end, glyph numbers are all you have - the connection to the Unicode is lost ... except when the font explicitly or implicitly provide a backward mapping. This is obviously not the case here. So all you can do is using OCR. |
Beta Was this translation helpful? Give feedback.
-
|
Sorry: I have to take back my previous comment: |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello - i try to get the text out of the attached pdf-file using the following code:
But i only get this text:
Is there any way to get the text out of this pdf-file?
test.pdf
Beta Was this translation helpful? Give feedback.
All reactions