Comment Re:Maybe once he figures on /. (Score 1) 103
So the site has a Unicode whitelist of supported characters - basically the printable ASCII set, and as a protection against hacks, it also strips off the high bit.
So knowing very little about unicode but putting my PHB hat on - does this mean that all that needs to happen is some junior developer identifies the top 80% of unicode characters used in comments in the past year (looking at comments rated 4 or 5 to avoid the spam), do a quick manual review and then add them to the whitelist?
That sounds like a job which would take no more than about half a day - followed by whatever time it takes for the usual testing and deployment process.