This program extracts text from MS-Word files, trying to preserve as many special printable characters as possible. catdoc supports everything up to Word-97. Also supported are MS Write documents and RTF files.
It doesn't even try to preserve fancy Word formatting, because Word users usually don't care about document structure, and it is this very thing which is important to LaTeX users.
Also provided is xls2csv, which extracts data from Excel spreadsheets and outputs it in comma-separated-value format and catppt, which extracts data from PowerPoint presentations.
This package suggests tk because it also includes wordview, an optional Tk-based GUI for catdoc. The MIME config provided in this package will use wordview is X is running, or catdoc directly if it is not.
Homepage: http://freshmeat.net/projects/catdoc
|
|
|
| 硬件架构 | 软件包大小 | 安装后大小 | 文件 |
|---|---|---|---|
| amd64 | 630.0 kB | 2664 kB | [文件列表] |
| armel | 620.6 kB | 2648 kB | [文件列表] |
| hppa | 595.6 kB | 2664 kB | [文件列表] |
| i386 | 580.6 kB | 2648 kB | [文件列表] |
| ia64 | 621.9 kB | 2764 kB | [文件列表] |
| mips | 600.1 kB | 2688 kB | [文件列表] |
| mipsel | 600.0 kB | 2688 kB | [文件列表] |
| powerpc | 587.5 kB | 2648 kB | [文件列表] |
| s390 | 591.3 kB | 2656 kB | [文件列表] |
| sparc | 580.5 kB | 2648 kB | [文件列表] |