53

Microsoft Excel's ".xlsx" files are zip files that each contain a set of files.

Could someone please provide a link that concisely describes the full structure/syntax/markup/format of the embedded .xml files (the headers are less interesting)?

For example, it's hard to find online explanations on what the c, t, and s elements represent.

Mauricio Cortazar
  • 4,049
  • 2
  • 17
  • 27
none
  • 4,669
  • 14
  • 62
  • 102
  • 2
    Have you seen [Microsoft Office XML Formats](https://secure.wikimedia.org/wikipedia/en/wiki/Microsoft_Office_XML_formats) and [Office Open XML](https://secure.wikimedia.org/wikipedia/en/wiki/Office_Open_XML) ? Both of these links contain reference links to the content you probably want. – Nic Gibson Feb 03 '11 at 12:31
  • yes , i googled it a lot. nothing useful – none Feb 03 '11 at 12:33
  • what did you google *for*? I think you might be looking up the wrong things. – Nic Gibson Feb 03 '11 at 13:34
  • 12
    I agree with the OP, this format is not well documented. It's a reasonably simple format (in essence at least), but I could not find any decent tutorial-style documentation. Was able to figure out myself, with a bit of help from stackoverflow. Did not bother to read offical docs since they are ridiculously large, must have been written by complete chuckle-heads. I might blog on it, then link to my post here as a preferable answer. – Sam Watkins Jun 26 '12 at 09:59
  • 2
    The documentation for all ms office files can be found [here](http://www.ecma-international.org/publications/standards/Ecma-376.htm). – Matti Virkkunen Feb 03 '11 at 12:19
  • This document has some of the more Excel specific markup used in the files [MS-XLSX](http://msdn.microsoft.com/en-us/library/dd922181.aspx) – Jimmy Feb 03 '11 at 12:23
  • 2
    You can find a useful description of the documentation here - [part 1](http://blogs.msdn.com/b/chrisrae/archive/2010/09/25/where-is-the-documentation-for-office-2010-s-docx-xlsx-pptx-formats.aspx) and [part 2](http://blogs.msdn.com/b/chrisrae/archive/2010/10/06/where-is-the-documentation-for-office-s-docx-xlsx-pptx-formats-part-2-office-2010.aspx). This is a good starting point for diving into the standard by Chris Rae. – Vladimir Grigorov Sep 25 '12 at 12:44

2 Answers2

41

This PowerPoint deck, entitled "SpreadsheetML Basics", from Microsoft, provides a good overview. It is a 30-page presentation including diagrams, and it is not practical to include its contents here.

Four different stackoverflow reviewers have jumped on this post to delete it because the stackoverflow policy objects to simple links. I do not think that is appropriate to this question, because this document is exactly what has been requested. However I cannot be bothered to spend more time on it and (for example) spend an hour trying to summarise some portions to make them happy. If they want to delete again, then good luck to them.

Oliver Bock
  • 4,829
  • 5
  • 38
  • 62
6

This is an old question -- but the Excel XML (ISO/IEC 29500) standard is now well documented in varying degrees of detail. For example see this and this and parsing discussion here.

ashleedawg
  • 20,365
  • 9
  • 72
  • 105
  • 8
    It's almost 2018, and the claims by the OP are still true, IMHO. If you want to write a XLSX parser, you don't want to read the several-thousand pages specification. And the other links you posted are quite incomplete (no mention of the "f" value for formulas, for example). A fairly complete description of the XLSX format could be written in 10 pages or so (at least containing all the fields and values for the 95% of XLSX files out there). But there's no such document (at least I didn't find anything close to it). – cesss Dec 29 '17 at 00:14
  • I have a related problem. I have generated an Open XML document using Ruby on Rails. I am able to open the generated XML in notepad and view the XML which looks all good. But I am not able to open it in Excel software. Would you be able to help me with this please? I have posted the issue in detail here - https://stackoverflow.com/questions/56038034/how-to-save-and-open-an-open-xml-document-as-excel-file – Biju May 09 '19 at 03:08
  • If you could ignore the Ruby on Rails aspect and just give some pointers as to how to save an Open XML document as Excel, that'd be VERY helpful starter! Currently I have no clue what am missing. – Biju May 09 '19 at 03:09
  • In simple words, how do I open an Office XML document in Excel software? – Biju May 09 '19 at 05:50