Abstract

Summary

Accurate pathogen identification is crucial during outbreaks, especially with the emergence of new variants requiring frequent primer updates. However, resources for maintaining up-to-date verification of primer sequences are often limited, which poses challenges for reliable diagnosis and hinders potential monitoring efforts based on genome sequencing. To address this, we introduce ViralPrimer, a web server facilitating primer design, SARS-CoV-2 and Mpox variant monitoring, and adaptation to future threats. ViralPrimer aims to enhance diagnostic accuracy with its comprehensive primer database, mutation analysis assistance, and user primer upload feature. Its adaptable design allows monitoring of other rapidly mutating pathogens, contributing to broader public health protection efforts.

Availability and implementation

ViralPrimer is freely accessible and open to all users with no login requirement at https://viralprimer.elte.hu/. The application is hosted on a DJANGO v3.2.13 web server, with a PostgreSQL database, and the frontend was implemented using jQuery v3.6.0, vanilla JavaScript vES6, and Bootstrap v5.1.

1 Introduction

The importance of accurate diagnostics in comprehending and managing rapidly mutating viral pathogens has become increasingly evident, particularly in the context of the SARS-CoV-2 pandemic, which presents a challenge to the effectiveness of diagnostic techniques (Fernandes et al. 2022). Although reverse transcription-polymerase chain reaction (RT-PCR) is currently considered the gold standard for diagnosing SARS-CoV-2, other nucleic acid amplification tests (NAATs) such as loop-mediated isothermal amplification (LAMP) offer rapid, robust, and cost-effective alternatives (Kashir and Yaqinuddin 2020). Additionally, next-generation sequencing (NGS) can be used not only for detecting viral presence (Bloom et al. 2021) but also for tracking the evolution of SARS-CoV-2 to identify emerging variants (Rosenthal et al. 2022). Consequently, these diagnostic methods as well as sequencing technologies require continuous evaluation, and the tracking of mutations in the target regions of the primers is critical for the reliable and accurate detection and sequencing of the viral genome.

It was shown that mutations in primer target regions can impact binding efficiency, potentially leading to false negative diagnostic results (Ai et al. 2020, Chen et al. 2020, Li et al. 2020a,b, Mentes et al. 2022). A recent study estimates that up to 29% of tests could result in misdiagnosis, particularly in highly affected populations (disease prevalence up to 50%), highlighting the importance of careful primer design (Pecoraro et al. 2022). While diagnostic primers are crucial for avoiding misdiagnosis, sequencing primers play a vital role in obtaining high-quality genome assemblies for tracking viral evolution. However, the potential impact of primer failure differs between these methods: diagnostic errors have more immediate epidemiological consequences, while errors in sequencing may result in “dropout” or the loss of an amplicon, leading to incomplete genome sequences and lower-quality genomic data (Sanderson and Barrett 2021, Bei et al. 2022). The latter is exemplified by the revisions of the ARTIC network and Midnight sequencing primers in response to the emergence of new variants such as Delta or Omicron (Bei et al. 2022, Lambisia et al. 2022, Ulhuq et al. 2023).

Given the importance of accurate primer design to accommodate viral evolution, the lack of readily available and current resources poses a significant challenge. To address this, we propose ViralPrimer, a website to help with primer design, monitor SARS-CoV-2 variants, and adapt to future viral threats. The web server provides an extensive database of diagnostic, sequencing, and LAMP primers and facilitates analysis through visualizations of genomic regions. The variant data is obtained from GISAID (https://gisaid.org/) (Khare et al. 2021) and the COVID-19 Data Portal (http://www.covid19dataportal.org) (Cantelli et al. 2021), which can be filtered temporally, across various countries of sample origin and by mutational profiles based on Variants of Concern (VOC) lineages. Users can also upload their own primer sequences. The potential of this tool lies in the identification of mutations that may affect primer binding efficiency. Mutations in the primer target regions are classified as high-risk mutations if they occur at the 3ʹ end of the primer or is an indel, and as moderate-risk if they have a minor effect on binding, however, the presence of three or more moderate-risk mutations in a primer's target region can also disrupt binding, although careful detection thresholds and multiple primer systems can mitigate this issue.

ViralPrimer not only enhances the ability to respond effectively to the evolving SARS-CoV-2 landscape but also prepares for future viral threats. Its adaptable design allows monitoring of other rapidly mutating viruses, as demonstrated by the integration of Mpox data.

2 Server description

ViralPrimer utilizes DJANGO's third major release (version 3.2.13) for backend functionality. Information is stored in a consistent relational database under the PostgreSQL framework. Data is either fetched directly from the database or computed in real-time from available information, eliminating the need for external APIs. The frontend combines DJANGO template language, jQuery (version 3.6.0), vanilla JavaScript (ES6), and Bootstrap (version 5.1) for its design. The visual elements are predominantly created using jQuery and vanilla JavaScript, with Plotly.js (version 2.11) utilized for chart rendering. Despite its reliance on advanced web technologies, ViralPrimer ensures compatibility across all browsers that support HTML5 and WebP.

3 Web interface

ViralPrimer is accessible online at https://viralprimer.elte.hu/, offering an array of user-friendly options for data retrieval. The homepage presents a broad overview of the server's capabilities, including direct links to introductory guides and example use cases. Navigation is facilitated through eight primary tabs: Home, Search, Browse, Primer mapping, Mutation Analysis, Help/About, Statistics, and Tutorials. The Search tab enables users to investigate mutations in a selected reference genome region with advanced filtering options, including collection date (the date the sample was collected), variant (the viral variant present in the sample), country (the geographical location where the sample was collected), and methods (whether the primer is used for sequencing or diagnostic purposes). The Browse page provides direct links to related entries, along with their labels and unique identifiers and also presents tables grouping data based on specific attributes. The Primer Mapping tab enables users to upload their designed PCR primer sequences to check for mutations in a specific genomic region. The Mutation Analysis tab allows users to input multiple positions or mutations, either at the amino acid or nucleic acid level, and assess their potential impact on selected primers for a given pathogen. For more refined search, the advanced filtering offers similar options to those in the Primer Mapping tab. The Help/About, Statistics and Tutorials pages are designed to help users navigate the server by providing examples, data-related information and statistics.

We also present a visual analytics framework within the ViralPrimer that can help users to explore the relationship between primer design and genomic mutations. This framework comprises five key components: Header, Simple Genome Viewer, Detailed Region Viewer, Mutations in Primer Target Regions, and Summary Plots. The Header offers essential contextual information and interactive features for navigating genomic regions. The Simple Genome Viewer provides an overview with zoom and navigation functions, delivering detailed primer and gene information. The Detailed Region Viewer offers a magnified view of genomic regions, outlining genes, proteins, primers, and mutations. The Mutations in Primer Target Regions section identifies and classifies mutations within primer target regions, aiding primer design. The Summary Plots segment offers graphical summaries of mutation distributions, allowing temporal and geographic analyses of mutation trends of the selected genomic region.

4 Data source

SARS-CoV-2 mutation data was obtained from GISAID (https://gisaid.org/) (Khare et al. 2021) and the COVID-19 Data Portal (http://www.covid19dataportal.org) (Cantelli et al. 2021, Harrison et al. 2021, Rahman et al. 2024), using high-quality samples with confirmed mutations [ENA project: PRJEB43947], a process extensively detailed in our previous article (Mentes et al. 2022). The Kooplex collaboration platform (https://k8plex-veo.vo.elte.hu/hub/) (Visontai et al. 2019) was used to investigate variants and samples from systematically analysed raw reads. Briefly, to ensure high quality, human samples with an estimated N-content of no >10% were selected. Genomic positions with a sequencing depth below 100 and an alternate allele frequency below 0.5 were excluded from analysis. Mpox data was processed following a similar procedure, undergoing systematic analysis of raw read data accessible through the COVID-19 Data Portal to produce a standardized set of variant calls (in VCF format) and consensus sequences [ENA project: PRJEB55823].

The reference sequences and feature data for the virus were obtained from the GenBank file, using the NC_045512.2 reference (Wu et al. 2020) [identical to MN996528.1 used by GISAID as the reference (https://gisaid.org/wiv04/) (Zhou et al. 2020)] of SARS-CoV-2 and the NC_063383.1 reference (Mauldin et al. 2022) of the Mpox genomes. References and features data are available for download under the Genes tab of the Browse section on ViralPrimer.

In our previous investigation (Mentes et al. 2022), we collected numerous primers from reputable scientific literature, which were designed for both traditional and RT-qPCR methods to detect SARS-CoV-2. We have since expanded the repertoire of primers within the ViralPrimer web server by incorporating additional primer systems, including sequencing and LAMP techniques. Additionally, the server hosts diagnostic and sequencing primers designed for the detection of Mpox. Primer data are available for download at the Primers tab of the Browse section of ViralPrimer. Mutation data can be downloaded from the Download section on the Help/About page.

5 Result and discussion

The primary goal of our web server is to integrate genomic mutation data of prevalent global pandemic pathogens with up-to-date information, including VOC variants, geographic locations and date of collection, and sequence information of publicly available PCR, LAMP and sequencing primers (Fig. 1). Previously, various websites have been created to track SARS-CoV-2 variants and monitor mutations in primer binding regions (Primer-monitor, Campbell et al. 2021, CoV2ID, Carneiro et al. 2020, CoVrimer, Vural-Ozdeniz et al. 2021, AssayM, Naeem and Pain 2020). However, some of these platforms are no longer available a few years after the start of the COVID-19 pandemic, host outdated data or require a registration (such as PrimerChecker at GISAID.org). Therefore, a comprehensive, sustainable, and responsive platform is needed to address these challenges. This is crucial not only for efficiently tracking the evolution of pathogens like SARS-CoV-2 but also for swift adaptability to emerging pandemic pathogens to effectively monitor future viral threats in diagnostics.

Overview of the ViralPrimer project. (A) The efficiency of PCR tests is affected by the variability in viral genome. Mutations near the 3ʹ end of the primer target region (designated as high-risk) can lead to false-negative results, highlighting the significance of continuous primer assessment. (B) Our project uses data from a mutation database incorporating the analysis results of various public sample sets (GISAID, COVID-19 Data Portal), enabling the assessment of mutations and samples concerning primer binding sites. Classification relies on the genomic location, length, and co-occurrence of mutations in each primer target region. The main features of the server are the options of searching in the genome, browsing primers, and primer mapping. The ViralPrimer webserver offers visualization tools, such as region viewers and mutation statistics, to aid in obtaining results and selecting optimal primers for specific regions. This tool further addresses the ongoing need for rapid primer testing and ensures the accuracy of diagnostic results.
Figure 1.

Overview of the ViralPrimer project. (A) The efficiency of PCR tests is affected by the variability in viral genome. Mutations near the 3ʹ end of the primer target region (designated as high-risk) can lead to false-negative results, highlighting the significance of continuous primer assessment. (B) Our project uses data from a mutation database incorporating the analysis results of various public sample sets (GISAID, COVID-19 Data Portal), enabling the assessment of mutations and samples concerning primer binding sites. Classification relies on the genomic location, length, and co-occurrence of mutations in each primer target region. The main features of the server are the options of searching in the genome, browsing primers, and primer mapping. The ViralPrimer webserver offers visualization tools, such as region viewers and mutation statistics, to aid in obtaining results and selecting optimal primers for specific regions. This tool further addresses the ongoing need for rapid primer testing and ensures the accuracy of diagnostic results.

The ViralPrimer web server has a comprehensive collection of NAATs primers, along with their associated mutations. We collected a total of 1372 primer oligos from 50 primer sets (10 LAMP sets, 4 sequencing sets, 35 diagnostic PCR sets). For SARS-CoV-2, a total of 19 007 636 samples were processed from two databases (GISAID—16 160 967 and COVID-19 Data Portal—2 846 669), while for Mpox, 1574 samples were included. The features allow for quick verification of mutations that affect both public and custom primers. Case studies demonstrating the use of the ViralPrimer website are available on the Tutorials page: one investigates the impact of Omicron XBB.1.5 variant mutations (Use Case 1), another focuses on advances in SARS-CoV-2 whole genome sequencing and evaluates the effectiveness of long-range PCR primers (Kandel et al. 2024) (Use Case 2), and a third provides a guide to using the Mutation Analysis page with Mpox amino acid mutations as an example (Wang et al. 2023) (Use Case 3). The data in the web server is regularly updated to ensure that it contains the most current information. In the future, we plan to expand the ViralPrimer to include viruses that are of global concern.

The ViralPrimer could fill a critical gap in the ability to respond rapidly to emerging viral threats by providing up-to-date and user-friendly resources for primer design and tracking viral evolution.

Acknowledgements

The authors thank Ádám Dán for fruitful discussions on the topic and valuable suggestions for research objectives. We gratefully acknowledge all data contributors, i.e. the Authors and their Originating laboratories responsible for obtaining the specimens, and their Submitting laboratories for generating the genetic sequence and metadata and sharing via the GISAID Initiative, on which this research is based. The authors data available at this link: https://doi.org/10.55876/gis8.231212uv. We are also grateful to all those who uploaded and continue to upload raw read data to the EMBL-EBI European Nucleotide Archive. Support from ELIXIR Hungary (www.elixir-hungary.org) also acknowledged.

Author contributions

N.D. contributed to the development of the web server, the conception and design of the study, and the methodology. Zs.D. contributed to the conception of the study and the writing of the manuscript. I.Cs. contributed to the conception of the study, the writing of the manuscript, and funding. A.M.-H. contributed to the writing of the manuscript, data acquisition, methodology, data curation, and web server testing. O.A.P. contributed to the writing of the manuscript, data curation, and web server testing. J.S., K.P., D.V., and G.E. contributed to the acquisition of the data and methodology. A.M. contributed to the coordination of the study, the development of the webserver, data analysis, conception, and writing of the manuscript. All authors read and approved the final version of the manuscript.

Conflict of interest: None declared.

Funding

This work was supported by the European Union's Horizon 2020 research and innovation programme under grant agreements No. 874735 (VEO) and No. 101046203 (BY-COVID). The work of N.D. was supported by the ÚNKP-23-3 New National Excellence Program of the Ministry for Culture and Innovation from the source of the National Research, Development and Innovation Fund and DKOP-23 Doctoral Excellence Program of the Ministry for Culture and Innovation from the source of the National Research, Development and Innovation Fund.

Data availability

The data underlying this article are freely available on the ViralPrimer website at https://viralprimer.elte.hu/. Mutation data used on the website can be downloaded via https://viralprimer.elte.hu/help#download. Detailed NAATs' primer information is accessible at https://viralprimer.elte.hu/browse/primers. Primary data are sourced from the GISAID repository (https://gisaid.org/, with sample authors acknowledged) and the COVID-19 Data Portal (http://www.covid19dataportal.org, ENA projects: PRJEB43947, PRJEB55823). Reference sequences and feature data for SARS-CoV-2 and Mpox were obtained from GenBank (NC_045512.2 and NC_063383.1).

References

Ai
T
,
Yang
Z
,
Hou
H
et al.
Correlation of chest CT and RT-PCR testing for coronavirus disease 2019 (COVID-19) in China: a report of 1014 cases
.
Radiology
2020
;
296
:
E32
40
.

Bei
Y
,
Pinet
K
,
Vrtis
KB
et al.
Overcoming variant mutation-related impacts on viral sequencing and detection methodologies
.
Front Med (Lausanne)
2022
;
9
:
989913
.

Bloom
JS
,
Sathe
L
,
Munugala
C
et al.
Massively scaled-up testing for SARS-CoV-2 RNA via next-generation sequencing of pooled and barcoded nasal and saliva samples
.
Nat Biomed Eng
2021
;
5
:
657
65
.

Campbell
MA, Bei Y, Nichols NM
et al. Primer Monitor: An Online Tool to Track SARS-CoV-2 Variants that May Impact Primers Used in Diagnostic Assays. Ipswich, MA, USA: New England Biolabs Inc.,
2021
.

Cantelli
G
,
Cochrane
G
,
Brooksbank
C
et al.
The European Bioinformatics Institute: empowering cooperation in response to a global health crisis
.
Nucleic Acids Res
2021
;
49
:
D29
37
.

Carneiro
J
, Gomes C, Couto C et al. CoV2ID: Detection and Therapeutics Oligo Database for SARS-CoV-2. bioRxiv, ,
2020
, preprint: not peer reviewed.

Chen
Z
,
Li
Y
,
Wu
B
et al.
A patient with COVID-19 presenting a false-negative reverse transcriptase polymerase chain reaction result
.
Korean J Radiol
2020
;
21
:
623
4
.

Fernandes
Q
,
Inchakalody
VP
,
Merhi
M
et al.
Emerging COVID-19 variants and their impact on SARS-CoV-2 diagnosis, therapeutics and vaccines
.
Ann Med
2022
;
54
:
524
40
.

Harrison
PW
,
Lopez
R
,
Rahman
N
et al.
The COVID-19 data portal: accelerating SARS-CoV-2 and COVID-19 research through rapid open access data sharing
.
Nucleic Acids Res
2021
;
49
:
W619
23
.

Kandel
S
,
Hartzell
SL
,
Ingold
AK
et al.
Genomic surveillance of SARS-CoV-2 using long-range PCR primers
.
Front Microbiol
2024
;
15
:
1272972
.

Kashir
J
,
Yaqinuddin
A.
Loop mediated isothermal amplification (LAMP) assays as a rapid diagnostic for COVID-19
.
Med Hypotheses
2020
;
141
:
109786
.

Khare
S
,
Gurry
C
,
Freitas
L
et al.
GISAID’s role in pandemic response
.
China CDC Wkly
2021
;
3
:
1049
51
.

Lambisia
AW
,
Mohammed
KS
,
Makori
TO
et al.
Optimization of the SARS-CoV-2 ARTIC network V4 primers and whole genome sequencing protocol
.
Front Med
2022
;
9
:
836728
.

Li
D
,
Wang
D
,
Dong
J
et al.
False-negative results of real-time reverse-transcriptase polymerase chain reaction for severe acute respiratory syndrome coronavirus 2: role of deep-learning-based CT diagnosis and insights from two cases
.
Korean J Radiol
2020a
;
21
:
505
8
.

Li
Y
,
Yao
L
,
Li
J
et al.
Stability issues of RT‐PCR testing of SARS‐CoV‐2 for hospitalized patients clinically diagnosed with COVID‐19
.
J Med Virol
2020b
;
92
:
903
8
.

Mauldin
MR
,
McCollum
AM
,
Nakazawa
YJ
et al.
Exportation of monkeypox virus from the african continent
.
J Infect Dis
2022
;
225
:
1367
76
.

Mentes
A
,
Papp
K
,
Visontai
D
et al. ;
VEO Technical Working Group
.
Identification of mutations in SARS-CoV-2 PCR primer regions
.
Sci Rep
2022
;
12
:
18651
.

Naeem
R
,
Pain
A.
assayM: a web application to monitor mutations in COVID-19 diagnostic assays. bioRxiv, ,
2020
, preprint: not peer reviewed.

Pecoraro
V
,
Negro
A
,
Pirotti
T
et al.
Estimate false‐negative RT‐PCR rates for SARS‐CoV‐2. A systematic review and meta‐analysis
.
Eur J Clin Investigation
2022
;
52
:e13706.

Rahman
N
,
O'Cathail
C
,
Zyoud
A
et al.
Mobilisation and analyses of publicly available SARS-CoV-2 data for pandemic responses
.
Microb Genom
2024
;
10
:
001188
.

Rosenthal
SH
,
Gerasimova
A
,
Ruiz-Vega
R
et al.
Development and validation of a high throughput SARS-CoV-2 whole genome sequencing workflow in a clinical laboratory
.
Sci Rep
2022
;
12
:
2054
.

Sanderson
T
,
Barrett
JC.
Variation at spike position 142 in SARS-CoV-2 Delta genomes is a technical Artifact caused by dropout of a sequencing amplicon
.
Wellcome Open Res
2021
;
6
:
305
.

Ulhuq
FR
,
Barge
M
,
Falconer
K
et al.
Analysis of the ARTIC V4 and V4.1 SARS-CoV-2 primers and their impact on the detection of Omicron BA.1 and BA.2 lineage-defining mutations
.
Microb Genom
2023
;
9
:
000991
.

Visontai
D, Stéger J, Szalai-Gindl JM
et al.
Kooplex: collaborative data analytics portal for advancing sciences
. arXiv, arXiv:1911.09553,
2019
, preprint: not peer reviewed.

Vural-Ozdeniz
M
,
Akturk
A
,
Demirdizen
M
et al.
CoVrimer: a tool for aligning SARS-CoV-2 primer sequences and selection of conserved/degenerate primers
.
Genomics
2021
;
113
:
3174
84
.

Wang
L
,
Shang
J
,
Weng
S
et al.
Genomic annotation and molecular evolution of monkeypox virus outbreak in 2022
.
J Med Virol
2023
;
95
:
e28036
.

Wu
F
,
Zhao
S
,
Yu
B
et al.
A new coronavirus associated with human respiratory disease in China
.
Nature
2020
;
579
:
265
9
.

Zhou
P
,
Yang
X-L
,
Wang
X-G
et al.
A pneumonia outbreak associated with a new coronavirus of probable bat origin
.
Nature
2020
;
579
:
270
3
.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
Associate Editor: Christina Kendziorski
Christina Kendziorski
Associate Editor
Search for other works by this author on: