|
I'm interested in understanding better the relationship between the original XML document and the mapped xml:tm document you illustrate in Figure 1.
Does xml:tm assume that people create content in XML using customized document types or in xml:tm itself? If the former, then I assume that moving from my own document type to xml:tm is a straight XSLT transformation. In that case, I have a question about sentence segmentation.
Most document types do not use markup to distinguish sentences in the original customized document type. Does xml:tm assume that a customized document type will segment sentences? (Does the <text> element in Figure 1 perform the function of segmenting sentences in the source document type?)
Also, the PDF at http://www.xml-intl.com/docs/xml-tm-whitepaper.pdf appears to be bad since when I try to download it I got a seven page document with nothing in it and a locked-up web browser.
|