OMERO.dropbox
=============

DropBox was originally designed as the first stage of the file
system changes referred to as :doc:`/developers/Server/FS`. It utilizes a file
system monitor to find newly uploaded files and run a fully automatic
import on those files if possible. This release of OMERO.dropbox runs on
the same machine as the OMERO.server and watches designated areas of the
local filesystem for new or modified files. If those files are
importable, then an automatic import is initiated. OMERO.dropbox is
started automatically when the OMERO.server starts and it will run if
the prerequisites below are met.

Prerequisites
-------------

In addition to the general :doc:`system-requirements` OMERO.dropbox has
the following more specific requirements:

-   OMERO.dropbox is built on underlying OS file-notification system, and so
    is only available for specific versions of certain operating systems.
    OMERO.dropbox has been tested on the following systems:

    -   Linux with kernel 2.6.13 and higher.
    -   Mac OS 10.6 and later.
    -   Windows XP, Vista, 7, Server 2003, Server 2008 and Server 2008R2,
        but see note below.

-   In addition some platforms require further Python packages to be
    available:

    -   Mac OS systems that use a macports install of Python will need to
        have FSEvents available in the PYTHONPATH. This will require a
        path of the form
        ``/System/Library/Frameworks/Python.framework/Versions/2.X/Extras/lib/python/PyObjC/``
        to be added, according to the version of Python used.

-   The filesystem which OMERO.dropbox watches must be local to the given
    operating system. Watching a network-attached share (NAS) is strictly
    ***not*** supported.

.. note:: It is likely, but not guaranteed, that DropBox will run on later
    versions of Windows, however only tested versions are supported by default.
    
    Setting the property :property:`omero.fs.platformCheck` to ``False``
    in :file:`etc/grid/templates.xml`
    
    or using :omerocmd:`config set`
    
    and restarting the server will allow DropBox to be started on an untested 
    version of Windows. If you do successfully run DropBox on an as yet untested
    version of Windows please do let us know via the 
    :community_plone:`forums and mailing lists <>`.

Using DropBox
-------------

In its default configuration the monitored area of the file system is a
``DropBox`` subdirectory of the OmeroBinaryRepository directory. The
system administrator should create ``DropBox`` and then under that a
directory for each user, using their omero username. The ownership and
permissions should be set so that a user can copy files into their
DropBox directory:

::

    /OMERO/DropBox/amy
                  /emily
                  /edgar
                  /root
                  /zak

Experimenters can add subdirectories under their named directory for
convenience. Copying or moving a file of an importable file type into a
named directory or nested subdirectory will initiate an automatic import
of that file for that user. Multi-file formats will be imported after
the last required file of a set is copied into the directory. Images and
plates will be imported into the group the user was last logged into, with
images placed into ``Orphaned images``.

Acquisition systems can then be configured to drop a user's images into
a given DropBox.

.. note::

    -   The DropBox system is designed for image files to be copied in
        at normal acquisition rates. Copying many files en masse may
        result in files failing to import.

    -   It is also intended as a write-once system. Modifying an image
        after it has been imported may result in that modified image also
        being imported depending on the operating system and how the image
        was modified.

    -   Once directories are created within DropBox or files are copied or
        moved into DropBox they should not be moved, renamed or otherwise
        changed. Images may be imported again or already imported images may
        become unreadable.

Permissions
-----------

Changing the permissions of a directory within DropBox may result in duplicate
imports as a newly readable directory appears identical to a new directory. If
directories need to be modified it is recommended that the DropBox system is
stopped and then restarted around any changes, as below.
::

    $ bin/omero admin ice server disable DropBox
    $ bin/omero admin ice server stop DropBox
    $ bin/omero admin ice server disable MonitorServer
    $ bin/omero admin ice server stop MonitorServer

    # make any directory changes

    $ bin/omero admin ice server enable MonitorServer
    $ bin/omero admin ice server enable DropBox

.. note::

    Any new files copied into DropBox during this disabled period will not
    be detected and thus not imported.


Log files
---------

The log files :file:`var/log/FileServer.log`, :file:`var/log/MonitorServer.log`
and :file:`var/log/DropBox.log` will indicate success or otherwise of start-up
of the two components.
Once running, :file:`var/log/MonitorServer.log` will log file events seen within
designated file areas and :file:`var/log/DropBox.log` will log the progress
of any file imports.

Unicode path and file names
---------------------------

If file or path names contain Unicode characters this can cause DropBox to
fail. This can be remedied by the use of a :file:`sitecustomize.py` or
:file:`usercustomize.py` file containing the following::

    import sys
    reload(sys)
    sys.setdefaultencoding('utf-8')

For more details on using customization files in Python see:
`site — Site-specific configuration hook <https://docs.python.org/2.7/library/site.html>`_.
For more discussion on this issue within OMERO see the forum post:
`Dropbox halts on certain unicode characters <https://www.openmicroscopy.org/community/viewtopic.php?f=4&t=7810#p15910>`_.

.. note::
    If a customization file is used and the OMERO server is upgraded please
    ensure the file is still available to DropBox after the upgrade.

Advanced use
------------

OMERO.dropbox can be configured in several ways through
:file:`etc/grid/templates.xml`. In its default configuration, as detailed
above, it monitors the subdirectory ``DropBox`` of the OMERO data
directory for all users.

A number of the properties in :file:`templates.xml` accept a semi-colon
separated list of values. This extended configuration allows a site to
watch multiple directories, and configure each for a different user, a
different type of file, etc. Any value missing from the configuration
(e.g. ``value="1;;2"``) will be replaced by the default value.

One example alternative configuration would be to watch specific
directories for specific users. In the example below two directories are
monitored, one for user ``amy`` and one for ``zak``:

::

    <property name="omero.fs.importUsers"  value="amy;zak"/>
    <property name="omero.fs.watchDir"  value="/home/amy/myData;/home/zak/work/data"/>

The remaining properties have been left at their default values for both
users.

To limit DropBox to import only files belonging to specific image types
the following property can be set,

::

    <property name="omero.fs.readers"  value="/home/amy/my_readers.txt;"/>

Here only the image types listed in :file:`my_readers.txt` will be imported
for the user ``amy`` while the system-wide :file:`readers.txt` will be used
for ``zak``.

For a full description of the properties see below.

Properties
^^^^^^^^^^

Each property takes the form of a single item or a semi-colon separated
list of items. Where the item is a list, values within that list should
be comma separated.

-   importUsers

    The importUsers is either ``default`` or a list of OMERO user names. In the
    case of the value being ``default``, the same configuration is applied to
    all users and each subsequent configuration setting should be a single
    value. In the case of this value being a list of users, each subsequent
    value should be a list of the same length as the number of users. The
    default value is ``default``.

    ::

        <property name="omero.fs.importUsers"  value="default"/>


-   watchDir

    The absolute directory path of interest for each user. The default is
    empty.

    ::

        <property name="omero.fs.watchDir"  value=""/>

-   eventTypes

    For automatic import Creation and Modification events are monitored. It
    is also possible to monitor Deletion events though these are not used by
    DropBox. The default is Creation,Modification.

    ::

        <property name="omero.fs.eventTypes"  value="Creation,Modification"/>

-   pathMode

    By default existing and newly created subdirectories are monitored. It
    is possible to restrict monitoring to a single directory ("Flat"), only
    existing subdirectories ("Recurse"), or all subdirectories ("Follow").
    For DropBox to function correctly the mode should be Follow. The default
    is Follow.

    ::

        <property name="omero.fs.pathMode"  value="Follow"/>

-   whitelist

    A list of file extensions of interest. An empty list implies all file
    extensions are monitored. The default is an empty list.

    ::

        <property name="omero.fs.whitelist"  value=""/>

-   blacklist

    A list of subdirectories to ignore. Not currently supported.

    ::

        <property name="omero.fs.blacklist"  value=""/>

-   timeout

    This timeout in seconds is used by one-shot monitors. This property is
    not used by DropBox.

    ::

        property name="omero.fs.timeout"  value="0.0"/>

-   blockSize

    The number of events that should be propagated to DropBox in one go.
    Zero implies all events possible. The default is zero.

    ::

        <property name="omero.fs.blockSize"  value="0"/>

-   ignoreSysFiles

    If this is True events concerning system files, such as filenames
    beginning with a dot or default new folder names, are ignored. The exact
    events ignored will be OS-dependent. The default is True.

    ::

        <property name="omero.fs.ignoreSysFiles"  value="True"/>

-   ignoreDirEvents

    If this is True then the creation and modification of subdirectories is
    not reported to DropBox. The default is True.

    ::

        <property name="omero.fs.ignoreDirEvents"  value="True"/>

-   dirImportWait

    The time in seconds that DropBox should wait after being notified of a
    file before starting an import on that file. This allows for companion
    files or filesets to be copied. If a new file is added to a fileset
    during this wait period DropBox begins waiting again. The default is 60
    seconds.

    ::

        <property name="omero.fs.dirImportWait"  value="60"/>

-   fileBatch

    The number of files that can be copied in before processing the batch.
    In cases where there are large numbers of files in a typical file set it
    may be more efficient to set this value higher. The default is 10.

    ::

        <property name="omero.fs.fileBatch"  value="10"/>

-   throttleImport

    The time in seconds that DropBox should wait after initiating an import
    before initiating a second import. If imports are started too close
    together connection issues can arise. The default is 10 seconds.

    ::

        <property name="omero.fs.throttleImport"  value="10"/>

-   readers

    A file of readers. If this is a valid file then it is used to filter
    those events that are of interest. Only files corresponding to a reader
    in the file will be imported. The default is empty.

    ::

        <property name="omero.fs.readers"  value=""/>

-   importArgs

    A string of extra arguments supplied to the importer. This could
    include, for example, an email address to report failed imports to:
    ``--report --email test@example.com``. The default is empty. For
    details on available extra arguments see :doc:`/users/cli/import`.

    ::

        <property name="omero.fs.importArgs"  value=""/>

Example
^^^^^^^

Here's a full example of a configuration for two users:

::

    <property name="omero.fs.importUsers"     value="amy;zak"/>
    <property name="omero.fs.watchDir"        value="/home/amy/myData;/home/zak/work/data"/>
    <property name="omero.fs.eventTypes"      value="Creation,Modification;Creation,Modification"/>
    <property name="omero.fs.pathMode"        value="Follow;Follow"/>
    <property name="omero.fs.whitelist"       value=";"/>
    <property name="omero.fs.blacklist"       value=";"/>
    <property name="omero.fs.timeout"         value="0.0;0.0"/>
    <property name="omero.fs.blockSize"       value="0;0"/>
    <property name="omero.fs.ignoreSysFiles"  value="True;True"/>
    <property name="omero.fs.ignoreDirEvents" value="True;True"/>
    <property name="omero.fs.dirImportWait"   value="60;60"/>
    <property name="omero.fs.fileBatch"       value="10;10"/>
    <property name="omero.fs.throttleImport"  value="10;10"/>
    <property name="omero.fs.readers"         value="/home/amy/my_readers.txt;"/>
    <property name="omero.fs.importArgs"      value="--report;--report --email zak@example.com"/>

.. seealso:: 

    :doc:`/users/cli/import`

    :doc:`/users/cli/import-target`

    :doc:`/sysadmins/in-place-import`