Bug #1071
closedIllegal XML characters in EML modules
0%
Description
One of the people working with Christy Bowles creatd a package and then could
not open it. A look at the dataset module (knb2003.67.1) showed that several
unusual ASCII control characters appeared within the XML data (hex characters
'19', '1C', and '1D'). This caused a parsing error in Morpho. Removing these
characters fixed the error opening the package, but it is unknown how these
ASCII control characters got in the package in the first place!
We probably need to filter out all but CR/LF control characters.
Updated by Dan Higgins over 21 years ago
Apparently, the source of the 'strange' control characters is from pasting text
from a PDF file into the editor text boxes!
Need to filter ASCII characters with values less than 32.
Updated by Dan Higgins over 21 years ago
added filter to 'normalize' method to remove any characters with values less
than 32 except for CR and LF (10 and 13 decimal).