Oxygen XML Author Eclipse plugin integrates the entire DITA for Publishers plugins suite and provides some
possibilities for migrating content from Microsoft Office® (and other Office-type formats) to
DITA. There are also possibilities for migrating various other types of formats. For more
information, see Migrating Various Document Formats to and from DITA.
Migration from Office-type formats to XML is rarely perfect and manual changes may need to be
made to the converted content, but the methods described below should help you find the best
approach for your particular case.
Smart Paste (Single Document)
- Open the document in MS Office (or other similar application), select all the content,
and copy it.
- Open Oxygen XML Author Eclipse plugin and create a new DITA topic.
- Paste the selected content in Author mode. The Smart Paste
functionality will attempt to convert the content to DITA structure.
HTML to DITA (Single Document)
- Save your document as HTML.
- Once you have converted it to HTML, you have several possibilities:
- In Oxygen XML Author Eclipse plugin, select to import it as XHTML. Then, open the XHTML in Oxygen XML Author Eclipse plugin
and use one of the XHTML to DITA transformation scenarios to convert
the content to DITA structure.
- Open the HTML file in any web browser, select all of its content, and copy it. Then,
open Oxygen XML Author Eclipse plugin, create a new DITA topic, and paste the selected
content in Author mode. The Smart Paste
functionality will attempt to convert the HTML content to DITA
structure.
Word to LibreOffice to DITA (Single Document)
- Open the document in the LibreOffice application and save it as DocBook.
- Open the DocBook document in Oxygen XML Author Eclipse plugin.
- Run the built-in DocBook to DITA transformation scenario.
- You may need to make some manual adjustments for elements that could not be
mapped.
Word to DITA using DITA For Publishers (Single Document)
- Save the document in the MS Word DOCX format.
- Open it in the Archive Browser view in Oxygen XML Author Eclipse plugin and
then open the document.xml file contained in the archive.
- Run the built-in DOCX DITA transformation scenario. This
scenario runs a build file over the DOCX archive and should produce a DITA project that
contains a DITA map and multiple topics.
- You may need to do some manual reconfiguration to map DOCX styles to DITA content. The
XSLT conversion is part of the DITA For Publishers plugin and there is
documentation for it available here: http://www.dita4publishers.org/d4p-users-guide/user_docs/d4p-users-guide/word2dita/word2dita-intro.html.
Word to DocBook to DITA (Multiple Documents)
Migrating Excel and Other Types of Spreadsheets to DITA
It is possible to convert Microsoft Excel (or other similar types of documents) to DITA. To
do this, copy the spreadsheet content and paste it in an open DITA topic in
Author mode. The Smart Paste
functionality will attempt to convert the content to DITA structure.
Resources
For more information about migrating to DITA, see the following
resources: