The workflow of the pipeline behind Vapur represented below. We first split the abstracts to sentences and use BERN to detect the entities in the text. We then identify the biochemical relations with the relation extraction model that we trained and reform the output as an inverted index of relations. Vapur leverages this inverted index to retrieve relevant publications to the query as categorized by related entities.
@inproceedings{koksal2020vapur,
title={Vapur: A Search Engine to Find Related Protein-Compound Pairs in COVID-19 Literature},
author={Köksal, Abdullatif and Dönmez, Hilal and Özçelik, Rıza and Ozkirimli, Elif and Özgür, Arzucan},
booktitle={Proceedings of the 1st Workshop on NLP for COVID-19 (Part 2) at EMNLP 2020},
year={2020}
}
We would like to thank Enes Çakır for his efforts on the deployment of Vapur.
Icons including favicon, drug, and gene, made by Freepik from www.flaticon.com.
Logo is made by Ali Ozgon from Shutterstock.
BERN
Genia
SysAdmins.co.za
arzucan.ozgurboun.edu.tr