Note
This documentation is for the new OMERO 5.1. version. See the latest OMERO 5.0.x version or the previous versions page to find documentation for the OMERO version you are using if you have not upgraded yet.
File parsers extract text from various file types and provide it as a Reader to the FullTextBridge for use during search indexing. Plain text formats can use the default fileParser bean, but any specialized format, such as PDF or RTF requires special libraries and special registration.
Currently, configuration takes places solely in service-ome.api.Search.xml. Eventually, it should be able to replace file parsers at configuration or even runtime.
File type | Parser |
application/pdf | http://pdfbox.apache.org |
text/xml | (internal) |
text/plain | (internal) |
text/csv | (internal) |
The base class for File parsers are FileParser.java
See also