Davisor Offisor provides pure Java implementation for going directly from Word doc to XML (no "save as RTF" needed). Offisor can be used from command line or through API. There are also several XSL-T examples for XSL-FO, Docbook and XHTML. Learn more from Davisor Offisor pages.