Actually I created a simple infrastructure of ReadableAttachmentProviders so any binary attachment type can be interpreted and indexed. For now I have support for Word, PowerPoint, Excel, OpenDocument Text, PDF, HTML and XML documents. Contact me if you'd like to see how they work or I could submit the code changes to the core JSPWiki. There's one small change in LuceneSearchProvider, some additional code for handling the attachments and the actual readable attachment providers code. I'm using odfstream library for OpenDocument, PDFBox for parsing PDFs and POI libraries for Word, Excel and PowerPoint, also wrote my own for stripping markup from xml and html.
--MaciejR, 23-Aug-2007
Add new attachment
List of attachments
| Kind | Attachment Name | Size | Version | Date Modified | Author | Change note |
|---|---|---|---|---|---|---|
PDF |
N198603.PDF | 355.7 kB | 1 | 10-Apr-2007 12:25 | 203.101.42.112 |