Features: HTML Conversion

"Docx2HTML version" specialized for HTML conversion of docx files was added from Office Server Document Converter V9.0.

Docx2HTML Features

Convert docx format files edited and saved in Microsoft Word into simple, easy-to-edit HTML. You can easily create Web pages from documents created in familiar Microsoft Word.

Proprietary program to convert Word documents to HTML

  • An independent conversion engine that analyzes docx files edited and saved in Microsoft Word and converts them to HTML.

  • Convert without Microsoft Word installation.

Create simple and extra tag-free HTML

  • Converts HTML to accurate, easy-to-edit HTML that conforms to HTML standard specifications, unlike Microsoft Word's built-in HTML output feature.

  • Only the content (HTML) is generated as output, so it is easier to edit and manage, and can be easily incorporated into existing Web pages.

Various HTML can be converted by specifying options

Command line conversion can be specified by conversion option parameters.

Here are some examples of command line options:

  • -emptyP

    By default, empty lines (lines with only line breaks) in Word are ignored in HTML output. If this option is specified, <p> </p> tags will be outputted. The right example below has this option enabled.





  • -nonrefid

    Word creates a lot of unreferenced IDs in docx files after repeated editing, but by default Docx2HTML scans the document file and internally deletes unreferenced IDs. If this option is specified, it will not remove them. The right example below has this option enabled.





  • -endl

    This option outputs a line break at the end of the block tag. This makes it easier to edit and manage after output. The left example below has this option enabled.





  • -xhtml

    By default, tags in HTML syntax are outputted. If this option is specified, tags in XML syntax are outputted. The right example below has this option enabled.




Conversion Option Setting File

  • The conversion option settings file is a file that stores parameter values for conversion options.

  • When executing the command line, specify the name of the conversion option setting file to be read.

  • If you want to repeat a conversion with the same settings, specify a conversion option setting file instead of specifying options each time.

  • Example usage of a conversion options settings file:

    (In the following example, the file is named settings.xml)


    -settings settings.xml
    <?xml version="1.0"?>
        <enable-XHTML enable="true"/>
        <enable-endl enable="true"/>
        <enable-empty-paragraph enable="false"/>
        <enable-non-reference-id enable="false"/>


Command-line, .NET and Java interfaces are available as embedding interfaces. These interfaces allow for easy embedding into applications.


  • .doc format documents cannot be converted.

  • Folders and files on Microsoft's OneDrive cannot be specified as the target or destination for conversion.

  • This product discards all layout specifications in a Word-created document and represents content as pure HTML tags.

  • Settings such as table widths, ruled line thicknesses, etc., backgrounds, and text alignment specifications within tables are ignored. These specifications should be specified in CSS for the output HTML.