Table of Contents
KWord uses two open source, independently developed standards for its file format. The combination was chosen for its balance between convenience and open development models.
First, it should be noted that all KWord files are multiple XML files that are compressed to reduce their space requirements.
Select the KWord version you are interested in:
The XML files are compressed into a single file using the same algorithm as used by tar.
You can uncompress the files with the following command:
%
tar -xzvf
filename
This will expand the KWord document file into its component files.
The text portion of all KWord files are XML (eXtensible Markup Language) files.
For more information on XML documents, processors and technology, please visit
World Wide Web Consortium XML pages |
XML.org Resource Guide |
The XML FAQ |
All KWord documents consist of at least two XML files:
maindoc.xml
This file contains the bulk of the KWord text, tables and formula information. It is marked with XML tags according to the official DTD. A copy of the KWord 1.1 DTD is located at: http://www.koffice.org/DTD/kword-1.1.dtd.
documentinfo.xml
This file contains the document information. This is information entered into the dialog boxes when selecting -> from the menubar. This information is useful for tracking authors, contact information etc.
The DTD for KOffice 1.1 is located at: http://www.koffice.org/DTD/document-info-1.1.dtd.
In addition, there may be other files included in the KWord document file. Pictures, embedded documents and other binary information are stored within the KWord document as separate files.
For more specific information on KWord file storage or other internal information, please see The KOffice API and the General TDE developer information pages.
The text files are compressed into a single file using the same algorithm as used by zip. This change was made because of its broad use in other open source office suites and its improved performance with lower memory requirements.
You can uncompress the files with the following command:
%
unzip
filename
This will expand the KWord document file into its component files.
The text portion of all KWord files are XML (eXtensible Markup Language) files.
For more information on XML documents, processing and technology, please visit
World Wide Web Consortium XML pages |
XML.org Resource Guide |
The XML FAQ |
All KWord documents consist of at least three files:
maindoc.xml
This file contains the bulk of the KWord text, tables and formula information. It is marked with XML tags according to the official DTD.
A copy of the KWord 1.2 DTD is located at: http://www.koffice.org/DTD/kword-1.2.dtd.
documentinfo.xml
This file contains the document information. This is information entered into the dialog boxes when selecting -> from the menubar. This information is useful for tracking authors, contact information etc.
The DTD for KOffice 1.2 is located at: http://koffice.kde.org/DTD/document-info-1.2.dtd.
mimetype
This file contains the mimetype for KWord files. This information is used by TDE to determine that this is a KWord file.
This file always contains: application/x-kword
In addition, there may be other files included in the KWord document file. Pictures, embedded documents and other binary information are stored within the KWord document as separate files.
For more specific information on KWord file storage or other internal information, please see The KOffice API and the General TDE developer information pages.
The text files are compressed into a single file using the same algorithm as used by zip. This change was made because of its broad use in other open source office suites and its improved performance with lower memory requirements.
You can uncompress the files with the following command:
%
unzip
filename
This will expand the KWord document file into its component files.
The text portion of all KWord files are XML (eXtensible Markup Language) files.
For more information on XML documents, processing and technology, please visit
World Wide Web Consortium XML pages |
XML.org Resource Guide |
The XML FAQ |
All KWord documents consist of at least three files:
maindoc.xml
This file contains the bulk of the KWord text, tables and formula information. It is marked with XML tags according to the official DTD.
A copy of the KWord 1.3 DTD is located at: http://www.koffice.org/DTD/kword-1.3.dtd.
documentinfo.xml
This file contains the document information. This is information entered into the dialog boxes when selecting -> from the menubar. This information is useful for tracking authors, contact information etc.
The DTD for KOffice 1.3 is located at: http://koffice.kde.org/DTD/document-info-1.3.dtd.
mimetype
This file contains the mimetype for KWord files. This information is used by TDE to determine that this is a KWord file.
This file always contains: application/x-kword
In addition, there may be other files included in the KWord document file. Pictures, embedded documents and other binary information are stored within the KWord document as separate files.
For more specific information on KWord file storage or other internal information, please see The KOffice API and the General TDE developer information pages.
Would you like to comment or contribute an update to this page?
Send feedback to the TDE Development Team