|
Thanks for posting the question.
I am not particularly familiar with StAX, so there might be ways to refer to tokens after the StAX parser consumes the entire document.
For "non-extractive" parsing, encoded characters can be decoded on the fly when compared against a Java string.
Entities (especially built-in ones) can also be resolved on the fly during comparison.
For "ne_parseInt(String s1, int offset, int length)," one may get the reference to the character array internally or retrieve individual character by the member methods charAt(int).
|