Package tml.storage.importers

Importers implement text cleaning from different file formats.

See:
          Description

Interface Summary
Importer Interface for all importers.
 

Class Summary
AbstractImporter Abstract class for all importers to extend from.
HtmlImporter This importer uses org.htmlpraser to obtain plain text from an HTML file.
PdfImporter  
TextImporter TextImporter implements the simples importer of plain text, therefore it just returns the content as it is.
 

Package tml.storage.importers Description

Importers implement text cleaning from different file formats.

Package Specification

Each importer parses a file depending on its extension, and implements a method to return the content as plain text.