|
||||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |
See:
Description
Interface Summary | |
---|---|
Importer | Interface for all importers. |
Class Summary | |
---|---|
AbstractImporter | Abstract class for all importers to extend from. |
HtmlImporter | This importer uses org.htmlpraser to obtain plain text from an HTML file. |
PdfImporter | |
TextImporter | TextImporter implements the simples importer of plain text, therefore it just returns the content as it is. |
Importers implement text cleaning from different file formats.
Each importer parses a file depending on its extension, and implements a method to return the content as plain text.
|
||||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |