I’m ETLing html pieces from one platform (origin – EPIC) to another (destination – GHC store) and the source platform rejects (on validation) “Unicode character points below 32”. The origin has “U+0010” though.
I see that U+0010 is “Data Link Escape” (https://codepoints.net/U+0010?lang=en),
but I can’t really figure out if this symbol even effects somehow the “look and feel” when this HTML is presented (I was trying to put it in HTML and see it in a browser and couldn’t figure out any special effects – looks like space to me).
SO my intuition is just to replace it by a space character, but wanted to hear from the community if I miss anything and there is a better way to deal with it.
I can’t change the destination platform validation rules to accept this character, neither how it used once loaded to the destination platform – the only thing I have control over is the ETL itself.