Customize Consent Preferences

We use cookies to help you navigate efficiently and perform certain functions. You will find detailed information about all cookies under each consent category below.

The cookies that are categorized as "Necessary" are stored on your browser as they are essential for enabling the basic functionalities of the site. ... 

Always Active

Necessary cookies are required to enable the basic features of this site, such as providing secure log-in or adjusting your consent preferences. These cookies do not store any personally identifiable data.

No cookies to display.

Functional cookies help perform certain functionalities like sharing the content of the website on social media platforms, collecting feedback, and other third-party features.

No cookies to display.

Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics such as the number of visitors, bounce rate, traffic source, etc.

No cookies to display.

Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.

No cookies to display.

Advertisement cookies are used to provide visitors with customized advertisements based on the pages you visited previously and to analyze the effectiveness of the ad campaigns.

No cookies to display.

Technology-Assisted Reviews

Introduction

Technology Assisted Reviews (TAR) is a relatively new computer science field that employs information retrieval, machine learning and naturallanguage processing (NLP) techniques, to reduce the workload on screening for Systematic Literature Reviews (SLRs).  SLRs  seek to collect evidence from research publications that fit strict pre-specified eligibility criteria to answer a specific research question. They aim to minimize bias by using explicit, systematic methods documented in advance with a protocol.  The preparation stage includes the tasks of developingthe research question, searching for relevant SLRs, writing the needed protocol, and defining a search strategy. The search strategy includes Boolean queries  adapted for each medical database. These Boolean queries typically have very complicated syntax and are usually built by highly trained information specialists. The queries are submitted to medical databases during the retrieval stage, resulting in a vast set of possibly relevant studies.Subsequently, every study is screened in the appraisal stage, using the title and abstract (abstract-level assessment), and irrelevant studies are removed. Additional assessment is conducted based on the full-text (content-level assessment) of the remaining studies. In the last stage, data are extracted, converted, and synthesized from the relevant studies. The final scientific paper incorporates all this data with the addition of a meta-analysis of the included studies.

Current methods of creating an SLR are labor-intensive and time-consuming, involving large monetary costs. On average, an SLR costs approximately more than $140K and the time a scientist spends to complete it is 1.72 years. The most cost-intensive and time-consuming part when creating a systematic review is the screening process, i.e. the appraisal stage. On average, more than 5,500 documents are returned from the databases and less than 4.7% (3%) of them are relevant at the abstract (document) level.

Our Contribution

The primary contribution of this work is a novel method for TAR with state-of-the-art results. Our method implements a full pipeline from protocol toscreening papers with state-of-the-art results, assisting the researcher in three parts of the process of preparing an SLR: the preparation, the retrieval, and the appraisal. The key novel characteristics of our method are:

  • it relies solely on the protocol of a systematic review, overcoming the need for constructing a specialized Boolean query (preparation).
  • it combines an inter-review learning-to-rank method with an iterative intra-review relevance feedback mechanism (retrieval and appraisal)
  • it incorporates domain specialized features deriving from the latest advances of the NLP field (retrieval and appraisal).

As a secondary contribution, we publish an updated version of the Task II of the CLEF 2019 eHealth lab dataset. The updated dataset fixes previous format issues and provides an up-to-date version of the dataset that includes the latest revisions of the included SLRs.

Achievements

Our methods achieved the second best in result in Task II of CLEF eHealth lab in 2017 and 2018.

Publications

A. Anagnostou, A. Lagopoulos, G. Tsoumakas, I. Vlahavas (2017) Combining Inter-Review Learning-to-Rank and Intra-Review Incremental Training for Title and Abstract Screening in Systematic Reviews, eHealth Lab of the 8th Conference and Labs of the Evaluation Forum (CLEF)

A. Minas, A. Lagopoulos, G. Tsoumakas (2018) Aristotle University’s Approach to the Technologically Assisted Reviews in Empirical Medicine Task of the 2018 CLEF eHealth Lab, eHealth Lab of the 9th Conference and Labs of the Evaluation Forum (CLEF 2018)

A. Lagopoulos, A. Anagnostou, A. Minas, G. Tsoumakas (2018) Learning-to-Rank and Relevance Feedback for Literature Appraisal in Empirical Medicine, Conference and Labs of the Evaluation Forum (CLEF 2018)

A. Lagopoulos, & G. Tsoumakas (2020). From Protocol to Screening: A Hybrid Learning Approach for Technology-Assisted Systematic Literature Reviews. arXiv preprint arXiv:2011.09752.