OMERO Virtual Knowledge Graph

This repository contains the code to create a virtual knowledge graph for OMERO using ontop-vkg mappings.

Install the Ontop command line client

The utility script in utils/install_ontop.sh can be used to install the ontop cli into ontop-cli. We will assume the binary ontop is located in that directory. The script also installs the postgresql jdbc driver into ontop-cli/jdbc/.

Deployment

To deploy your own OMERO–VKG instance, use the interactive deployment script included in this repository (it assumes cookiecutter is installed and in your $PATH. If not, check https://www.cookiecutter.io).

1. Generate a deployment directory

From the top-level directory, run:

./deployment_cookiecutter.sh

The script will guide you through a series of questions, including:

PostgreSQL username, password, and host.
The RDF prefix for the deployment (used as folder name and ontology prefix).
The site URI (base URI for your instance).
A SQL filter controlling which OMERO users' data are exposed. Only data owned by the user(s) matching the filter and by users in the same groups as the filtered user(s) are mapped and accessible in the virtual KG. (e.g. =2 for a particular public OMERO user, or >=0 for all users).
Whether to create a QLever SPARQL endpoint and its UI.

After completing the prompts, a new directory named after your chosen prefix (e.g. ome_instance) will be created.

2. Contents of the deployment directory

A typical deployment directory includes:

PREFIX.ttl – The mapping ontology
PREFIX.obda – OBDA mappings with your site prefix and URI
PREFIX.properties – Database connection settings for ONTOP
catalog-v001.xml – Imported third-party ontologies
portal.toml – Metadata portal configuration
PREFIX-ontop-endpoint.sh – Script to start the ONTOP SPARQL endpoint
PREFIX-ontop-materialize.sh – Script to materialize the RDF graph
qlever/ (optional) – Helper scripts for QLever indexing, server and qlever UI

3. Start the ONTOP SPARQL endpoint

cd PREFIX
./PREFIX-ontop-endpoint.sh

This will launch the ontop sparql query interface at http://localhost:8080, the endpoint is at http://localhost:8080/sparql (the Ontop endpoint will use the properties file that was automatically setup by the interactive deployment script).

4. (Optional) Use QLever as a high-performance SPARQL endpoint

cd PREFIX/
 ./PREFIX-ontop-materialize.sh  # Materialize RDF graph (.ttl format) 
cd qlever
./reindex_ome_data.sh           # Build QLever index
./start_qlever.sh               # Start QLever SPARQL server
./launch_qlever-ui-mpiebkg.sh   # Start the QLever web UI (optional)

Note: Materialization and QLever reindexing should be performed periodically. Otherwise the data will gradually become outdated.

For more details see
➡️ Qlever configuration

Create read-only OMERO DB user

Consult utils/setup_ontop_dbuser.sh and queries/sql/ontop_user.sql to setup the read-only DB user.

Test setup

Run

ontop-cli/ontop validate -m PREFIX/PREFIX.obda -t PREFIX/PREFIX.ttl -p PREFIX/PREFIX.properties -x PREFIX/catalog-v001.xml

to validate your deployment.

Launch OMERO-VKG

Change into the deployment directory

cd PREFIX

and run the omero-ontop.sh script

bash omero-ontop.sh

This will launch the OMERO Virtual Knowledge Graph SPARQL endpoint at http://localhost:8080. You may wish to configure a different port and/or hostname. Consult the ontop-cli user manual to this effect (ontop-cli/ontop help endpoint).

Development

For development, the omero-test-infra docker-compose file can be used. Follow these step to set it up:

Get omero-test-infra

In the root of this repository:

git clone https://github.com/ome/omero-test-infra .omero

Patch the port mapping configuration

We need to access omero's postgresql database. Inside the container, it runs on port 5432 but is not mapped to the host. We patch the docker-compose file to have the database served on postgresql://localhost:15432.

cp utils/portmapping.patch .omero
cd .omero
patch -p1 < portmapping.patch
cd ..

Get omero-py

Install omero-py via pip or from conda-forge. The script /utils/install_omero-py.sh/ downloads and installs miniconda to the user's home directory and install omero-py as well as pytest and rdflib into the base environment.

source utils/install_omero-py.sh

Launch omero-test-infra

.omero/docker dev start_up

Add ontop database user

This step must be redone every time after resetting the test infrastructure.

utils/setup_ontop_dbuser.sh

Populate omero with test data.

We need something to play with, so let's create some projects and datasets, import a few images and annotate with key-value pairs (map annotations) and tags.

utils/insert_data.sh

Launch ontop endpoint

Assuming ontop is in your path:

utils/test_infra-ontop.sh

The commandline arguments point to the mappings file, mapping ontology, database connection details (properties), ontology import catalog, respectively. The --dev flag starts ontop int development mode. Edits to the mappings or ontology will trigger a restart of the endpoint. By default, the ontop endpoint is served at http://localhost:8080/sparql , the query editor is at http://localhost:8080 . Use the --port option to configure a different port.

Run tests

Finally,

pytest

will run the python test suite.

Reset database

To restart from a blank omero-test-infra (without images, datasets, projects, or annotations), run

.omero/docker srv

Don't forget to restart it according to above.

Acknowledgments

This project was developed with support from the Biohackathon 2024

This work is further supported by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – 501864659 (NFDI4BIOIMAGE).

Name		Name	Last commit message	Last commit date
Latest commit History 411 Commits
.github/workflows		.github/workflows
docs		docs
img		img
omero-ontop-mappings		omero-ontop-mappings
qlever		qlever
queries		queries
templates		templates
test		test
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
deployment_cookiecutter.sh		deployment_cookiecutter.sh
prepare_obda_template.py		prepare_obda_template.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
tutorial.org		tutorial.org

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

OMERO Virtual Knowledge Graph

Install the Ontop command line client

Deployment

1. Generate a deployment directory

2. Contents of the deployment directory

3. Start the ONTOP SPARQL endpoint

4. (Optional) Use QLever as a high-performance SPARQL endpoint

Create read-only OMERO DB user

Test setup

Launch OMERO-VKG

Development

Get omero-test-infra

Patch the port mapping configuration

Get omero-py

Launch omero-test-infra

Add ontop database user

Populate omero with test data.

Launch ontop endpoint

Run tests

Reset database

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

German-BioImaging/omero-ontop-mappings

Folders and files

Latest commit

History

Repository files navigation

OMERO Virtual Knowledge Graph

Install the Ontop command line client

Deployment

1. Generate a deployment directory

2. Contents of the deployment directory

3. Start the ONTOP SPARQL endpoint

4. (Optional) Use QLever as a high-performance SPARQL endpoint

Create read-only OMERO DB user

Test setup

Launch OMERO-VKG

Development

Get omero-test-infra

Patch the port mapping configuration

Get omero-py

Launch omero-test-infra

Add ontop database user

Populate omero with test data.

Launch ontop endpoint

Run tests

Reset database

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages