xlsx2txt, excel2txt – translate OpenML Microsoft Excel(tm) documents


xlsx2txt [ –b ] [ –crange ] [ –Csymb ] [ –dstr ] [ –q ] [ –snum ] [ –t ] [ –T ] [ file.xlsx ]
excel2txt [ options ] [ opc–dir ]


Microsoft's new format for Office documents is a zip'ed directory hierarchy containing XML files. This format is known as the ``Open Packaging Convention'' or OPC.
xlsx2txt is an rc(1) script that uses fs/zipfs (see tapefs(4)) and opc/excel2txt to extract a static table from the body of an exel xlsx document and write it on the standard output. Any option that can be passed to xlsx2txt may also be passed to excel2txt.

–b    Allow blank rows to be printed, by default these are elided.
–c range

Select the collum ranges to be printed. The format of the range is similar to standard plan9 character class specification. Any combination of comma seperated numbers and number ranges (two numbers seperated by a dash) may be specified.
C symb
Currency symbol for financial number formats, defaults to UK pounds (£).
d str
Set the field delimiter to the string str. This also implies whitespace cell length padding is disabled.
q    Quote cells using the usual plan9 conventions (see quote(2)).
–s num
Select the sheet number to print, if not specified sheet 1 is printed by default.
t    Truncate long fields so they layout nicely on screen.
–T    Add tbl(1) formatting instructions to the output. Tables are often wide, but switching to landscape mode can help (see lp(1)).
Tbl(1) and troff(1) cannot handle arbitarirly large tables and may die with copious diagnositcs. Reducing the number of columns printed and truncating long cells can help but this is a poor solution at best.




docx2troff(1), tapefs(4), xml(2)
``2007 Office Document: Open XML Markup Explained'',–us/download/details.aspx?id=15359