|
||||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | |||||||||
See:
Description
| Interface Summary | |
|---|---|
| Importer | Interface for all importers. |
| Class Summary | |
|---|---|
| AbstractImporter | Abstract class for all importers to extend from. |
| HtmlImporter | This importer uses org.htmlpraser to obtain plain text from an HTML file. |
| PdfImporter | |
| TextImporter | TextImporter implements the simples importer of plain text, therefore it just returns the content as it is. |
Importers implement text cleaning from different file formats.
Each importer parses a file depending on its extension, and implements a method to return the content as plain text.
|
||||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | |||||||||