ORF3d is a gene found in SARS-CoV-2 (the virus that causes COVID-19) and at least one closely related coronavirus found in pangolins, though it is not found in other closely related viruses within the Sarbecovirus subgenus. It is 57 codons long and encodes a novel 57 amino acid residue protein of unknown function.[1] At least two isoforms have been described, of which the shorter 33-residue form, ORF3d-2, may be more highly expressed, or even the only form expressed.[1][2] It is reported to be antigenic and antibodies to the ORF3d protein occur in patients recovered from COVID-19.[3] There is no homolog in the genome of the otherwise closely related SARS-CoV (which causes the disease SARS).[1][4]

ORF3d
Identifiers
OrganismSARS-CoV-2
SymbolORF3d
UniProtP0DTG0
Search for
StructuresSwiss-model
DomainsInterPro

Nomenclature

edit

There has been significant confusion in the scientific literature around the nomenclature used for the accessory proteins of SARS-CoV-2, especially several overlapping genes with ORF3a.[4] Many scientific papers have referred to ORF3d and its protein product as ORF3b, due to confusion caused by differences in the length of ORF3b in SARS-CoV (about 155 codons) and SARS-CoV-2 (only 22 codons).[4] Exacerbating the confusion, both the 57-codon protein product[5] and the 22-codon protein product[6] have been described to have similar effects as interferon antagonists.[4]

The recommended nomenclature for SARS-CoV-2 uses the term ORF3b for the 22-codon gene homologous to the 5' end of ORF3b in SARS-CoV, and uses the term ORF3d for the 57-codon gene.[4]

Comparative genomics

edit

ORF3d is an overlapping gene whose open reading frame overlaps both ORF3a and ORF3c in the SARS-CoV-2 genome. This potentially represents a rare example of all three possible reading frames of the same sequence region encoding functional proteins.[1][4] ORF3d is not present in SARS-CoV or other related coronaviruses, except for a coronavirus found in pangolins.[1] SARS-CoV-2 genome sequences have been extensively sampled throughout the COVID-19 pandemic, and examples of SARS-CoV-2 variants with truncations in ORF3d due to the introduction of a stop codon have been identified with relatively high prevalence.[1][7]

Bioinformatics analysis of the ORF3d region suggests that the sequence of the predicted protein product is not well conserved and raises the possibility that the gene does not encode a functional protein, despite experimental evidence of protein expression.[8]

Expression

edit

The ORF3d protein has two isoforms, one 57 amino acid residues long and one 33 residues long, the latter of which is known as ORF3d-2.[1] There is experimental evidence from studies such as ribosome profiling for expression of at least ORF3d-2, without clear evidence for the full-length ORF3d.[1][2]

Function

edit

The function of the ORF3d protein is not known, and it is possible that the gene does not code for a protein with any functional role in the viral life cycle.[8] When expressed under experimental conditions in cell culture, the ORF3d protein appears to be an interferon antagonist.[5]

Robust antibody responses to peptides from ORF3d have been reported in patients recovered from COVID-19.[3]

References

edit
  1. ^ a b c d e f g h Nelson CW, Ardern Z, Goldberg TL, Meng C, Kuo CH, Ludwig C, et al. (October 2020). "Dynamically evolving novel overlapping gene as a factor in the SARS-CoV-2 pandemic". eLife. 9: e59633. doi:10.7554/eLife.59633. PMC 7655111. PMID 33001029.
  2. ^ a b Finkel Y, Mizrahi O, Nachshon A, Weingarten-Gabbay S, Morgenstern D, Yahalom-Ronen Y, et al. (January 2021). "The coding capacity of SARS-CoV-2" (PDF). Nature. 589 (7840): 125–130. Bibcode:2021Natur.589..125F. doi:10.1038/s41586-020-2739-1. PMID 32906143. S2CID 218582461.
  3. ^ a b Hachim A, Kavian N, Cohen CA, Chin AW, Chu DK, Mok CK, et al. (October 2020). "ORF8 and ORF3b antibodies are accurate serological markers of early and late SARS-CoV-2 infection". Nature Immunology. 21 (10): 1293–1301. doi:10.1038/s41590-020-0773-7. PMID 32807944. S2CID 221136730.
  4. ^ a b c d e f Jungreis I, Nelson CW, Ardern Z, Finkel Y, Krogan NJ, Sato K, et al. (June 2021). "Conflicting and ambiguous names of overlapping ORFs in the SARS-CoV-2 genome: A homology-based resolution". Virology. 558: 145–151. doi:10.1016/j.virol.2021.02.013. hdl:1721.1/130363. PMC 7967279. PMID 33774510.
  5. ^ a b Lu R, Zhao X, Li J, Niu P, Yang B, Wu H, et al. (February 2020). "Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding". Lancet. 395 (10224): 565–574. doi:10.1016/S0140-6736(20)30251-8. PMC 7159086. PMID 32007145.
  6. ^ Konno Y, Kimura I, Uriu K, Fukushi M, Irie T, Koyanagi Y, et al. (September 2020). "SARS-CoV-2 ORF3b Is a Potent Interferon Antagonist Whose Activity Is Increased by a Naturally Occurring Elongation Variant". Cell Reports. 32 (12): 108185. doi:10.1016/j.celrep.2020.108185. PMC 7473339. PMID 32941788.
  7. ^ Lam JY, Yuen CK, Ip JD, Wong WM, To KK, Yuen KY, Kok KH (December 2020). "Loss of orf3b in the circulating SARS-CoV-2 strains". Emerging Microbes & Infections. 9 (1): 2685–2696. doi:10.1080/22221751.2020.1852892. PMC 7782295. PMID 33205709.
  8. ^ a b Jungreis I, Sealfon R, Kellis M (May 2021). "SARS-CoV-2 gene content and COVID-19 mutation impact by comparing 44 Sarbecovirus genomes". Nature Communications. 12 (1): 2642. Bibcode:2021NatCo..12.2642J. doi:10.1038/s41467-021-22905-7. hdl:1721.1/130581. PMC 8113528. PMID 33976134.