Microsoft Office XML formats


The Microsoft Office XML formats are XML-based document formats introduced in versions of Microsoft Office prior to Office 2007. Microsoft Office XP introduced a new XML format for storing Excel spreadsheets and Office 2003 added an XML-based format for Word documents.
These formats were succeeded by Office Open XML in Microsoft Office 2007.

File formats

Besides differences in the schema, there are several other differences between the earlier Office XML schema formats and Office Open XML.
  • Whereas the data in Office Open XML documents is stored in multiple parts and compressed in a ZIP file conforming to the Open Packaging Conventions, Microsoft Office XML formats are stored as plain single monolithic XML files. Also, embedded items like pictures are stored as binary encoded blocks within the XML. In the case of Office Open XML, the header, footer, comments of a document etc. are all stored separately.
  • XML Spreadsheet documents cannot store Visual Basic for Applications macros, auditing tracer arrows, charts and other graphic objects, custom views, drawing object layers, outlining, scenarios, shared workbook information and user-defined function categories. In contrast, the newer Office Open XML formats support full document fidelity.
  • Poor backward compatibility with the version of Word/Excel prior to the one in which they were introduced. For example, Word 2002 cannot open Word 2003 XML files unless a third-party converter add-in is installed. Microsoft has released a Word 2003 XML Viewer which allows WordProcessingML files saved by Word 2003 to be viewed as HTML from within Internet Explorer. For Office Open XML, Microsoft provides converters for Office 2003, Office XP and Office 2000.
  • Office Open XML formats are also defined for PowerPoint 2007, equation editing, vector drawing, charts and text art.

    Word XML format example




xmlns:w="http://schemas.microsoft.com/office/word/2003/wordml"
xmlns:wx="http://schemas.microsoft.com/office/word/2003/auxHint"
xmlns:o="urn:schemas-microsoft-com:office:office"
w:macrosPresent="no"
w:embeddedObjPresent="no"
w:ocxPresent="no"
xml:space="preserve">

This is the title
Darl McBride
Bill Gates
1
0
2007-03-15T23:05:00Z
2007-03-15T23:05:00Z
1
6
40
SCO Group, Inc.
1
1
45
11.6359


w:ascii="Times New Roman"
w:fareast="Times New Roman"
w:h-ansi="Times New Roman"
w:cs="Times New Roman" />























































































This is the first paragraph








This is a heading




w:right="1800"
w:bottom="1440"
w:left="1800"
w:header="720"
w:footer="720"
w:gutter="0" />







Excel XML spreadsheet example









ID
Project
Reporter
Assigned To
Priority
Severity
Reproducibility
Product Version
Category
Date Submitted
OS
OS Version
Platform
View Status
Updated
Summary
Status
Resolution
Fixed in Version


0000033
CognaLearn Intedashboard
janardhana.l

normal
text
always

GUI
2016-10-14



public
2016-10-14
IE8 browser_Modules screen tool tip text is shown twice
new
open