|
Thanks for the posting.
Yes, you have some very good points. The purpose of using those example is to demonstrate the concept, not necessarily to offer details of implementation.
In fact there are lots of things uncovered by this article:
1. encoding: how to compare a UCS-2 string to a UTF-8 or UTF-16 encoded token
2. entity reference: to compare a UCS-2 string to a token containing entities such as (#&s;)
3. Normalized compare: to compare a UCS-2 string against the normalized UC2 view of the token
overall, I am only trying to offer a starting point, definitely not a whole picture.
Thanks again,
Jimmy
|