New font [Source] |
Old font [Source] |
The font can be downloaded from SIPA - however use that link with care, as apparently a spammer was already able to insert his malicious links at the top of the article. I cannot confirm if the install wasn't replaced with malware spreading one.
The biggest advantage of this new font is however that the PDFs now contain real Unicode text. With the previous font, it was almost Unicode, but some of the vowels and tone marks were using private Unicode characters, probably to make these characters be positioned correctly on the higher consonants. But then when copying the text to clipboard, or trying to let Google translate such a PDF completely, fails as then many words loose their vowels and the translation turns into glibbersh. But that was already an improvement to the very first full digitally PDFs found in the Royal Gazette website, as those were done with a Mac and had a 8bit codepage instead - making it almost impossible to get the Thai text into someting readable. For a little comparison see the following table:
- 2001-2004: Example PDF, Google translate
- 2005-2010: Example PDF, Google translate
- 2010-12-13 onwards: Example PDF, Google translate
No comments:
Post a Comment