Guest post: Manuel López-Ibáñez on (open?) document formats

I am always interested in those working on or about the tools we use everyday. As translators, we do not get to pick our document format of choice. Wouldn’t life be easier if people used compatible formats? Ah, Utopia, we are walking towards you…

Let me introduce this (very witty) short article my friend Manuel López-Ibáñez has written on the subject. He is currently pursuing a PhD thesis at the School of the Built Environment of Napier University in Edinburgh (UK).
Feel free to agree, comment (or even disagree!) in the comments section. It is moderated exclusively to get rid of spam.
Microsoft’s new trap: don’t get caught!

Perhaps you have heard of a free (both free as beer and free as
speech) office suite called OpenOffice [1]. For those who haven’t, in
short, OpenOffice is like Word, Excel and Powerpoint but different.

Apart from being software libre [2] and gratis, another interesting
feature of OpenOffice is that its document formats are standardized. A
document format describes how a document (like a presentation) is
saved in your hard drive. For a program to read your presentation and
show it in your screen, it must perfectly understand the
presentation’s format. OpenOffice uses a format that its based on a
cool and modern technology called XML.

The very best bit is that the OpenOffice XML format is an
international standard called the *OpenDocument* format. Being an
international standard means that the complete description of the
format is publicly available and everybody is welcome to use this
description in its own software. Actually, it is encouraged that you
follow exactly the description, otherwise you should not say that your
program handles OpenDocument formats.

On the other hand, Word, Excel and Powerpoint are closed, proprietary,
non-standard formats. Only Microsoft knows how they work. If you want
your program to use those formats, you have to pay Microsoft to see
the description and sign a document stating that you are not going to
make public that information before the hell freezes, and perhaps not
even after that. Microsoft may decide that they don’t like your
program (or you, or your country), and refuse to give you the
description of Word documents. Your only chance is to «guess» how Word
documents work. And it is amazingly difficult to guess such things.
And yet, OpenOffice is able to handle Word documents almost perfectly.

Moreover, nobody knows what is going on in a Word document: you may
think that you have deleted something but it might be just hidden
there. This last part may sound conspiracy mumbo-jumbo. Or theoretical
things that never happen in practice [3]. Go tell Alastair Campbell
[4]. All these cases have been discovered by people while trying to
guess how Microsoft formats work. Microsoft knows exactly what is
going on and which information is hidden and how to recover it. That
any government is using such documents to transfer sensitive
information is just creepy.

For the reasons above, OpenDocument, the format used by OpenOffice and
many other programs, is very interesting. It is so interesting that
Microsoft didn’t want the name of the format to include the words
OpenOffice. So Microsoft lobbied the standards committee to change the
original name of «OpenOffice Document Format» into OpenDocument.
Well, this doesn’t actually seem such an evil thing to do, does it? It
is a standard format. It would be nice to honor their inventors, that
is, OpenOffice but it is also fine if the name is not associated to
any particular program, since anybody is welcome to use it.

Interestingly, Microsoft is going to change its document formats for
Microsoft Office 2007 and they have decided to also use XML
technology. But instead of using the standard OpenDocument format,
Microsoft is going to use their own closed, non-standard format:
Office Open XML I actually needed to read this again to fully grasp

1) Microsoft lobbied to change the name of the public standard format
from Open Office format to OpenDocument. This is the format used by
Open Office suite of programs.
2) Microsoft’s new closed, non-standard format will be called Office Open.

If you still don’t get it, don’t worry, That is precisely the point
[5]. To confuse people so they are tricked to say things like «Word
uses an open format» (no, it uses a format called Office Open that is
actually closed) or «Open Office is a format supported by Word» (no,
Office Open is the non-standard format supported by Microsoft. Open
Office uses the standard format OpenDocument).

Don’t let Microsoft mock you or your friends. Don’t get caught in
Microsoft’s trap [6].


Related Posts

05 Jun
AMADIS ’07: II Congreso de Accesibilidad a los medios audiovisuales para personas con discapacidad

Via descubro el siguiente congreso que me interesa, y bastante: II Congreso de Accesibilidad a los medios audiovisuales para personas con discapacidad, AMADIS´07 Congreso Internacional, Universidad de Granada, Facultad de Traducción e Interpretación 21 y 22 de junio de 2007 La Universidad de Granada, junto con la Facultad de Traducción e Interpretación y el

26 Feb
READING CLUB BI: premio del mes al título equívoco

«Reenviar: Actividad READING CLUB BI». He de reconocer que abrí el correo electrónico con gran curiosidad. BI, ¿qué querría decir? ¿de qué me suena? ¿Un club de bislama, el idioma, en su abreviatura ISO 639-1? ¿Sobre bismuto, el elemento? ¿Sobre Bussiness Intelligence, el concepto? ¿Del Bachillerato Internacional, pero sería mezclar idiomas? Pero no. Estas preguntas

21 Feb
Día internacional de la lengua materna

Visto lo duro que está siendo mantener este blog al día (y es una pena, con la de cosas que están pasando) quizá hoy sea un buen día para (reintentar, abandonar, repetir) reintentar, por supuesto, escribir en este blog. Y digo hoy, porque es el Día internacional de la lengua materna, una ocasión tan buena


Leave a comment