Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/26553
Title: GENCODE: reference annotation for the human and mouse genomes in 2023
Authors: Frankish, A
Carbonell-Sala, S
Diekhans, M
Jungreis, I
Loveland, JE
Mudge, JM
Sisu, C
Wright, JC
Arnan, C
Barnes, I
Banerjee, A
Bennett, R
Berry, A
Bignell, A
Boix, C
Calvet, F
Cerdán-Vélez, D
Cunningham, F
Davidson, C
Donaldson, S
Dursun, C
Fatima, R
Giorgetti, S
Giron, CG
Gonzalez, JM
Hardy, M
Harrison, PW
Hourlier, T
Hollis, Z
Hunt, T
James, B
Jiang, Y
Johnson, R
Kay, M
Lagarde, J
Martin, FJ
Gómez, LM
Nair, S
Ni, P
Pozo, F
Ramalingam, V
Ruffier, M
Schmitt, BM
Schreiber, JM
Steed, E
Suner, M-M
Sumathipala, D
Sycheva, I
Uszczynska-Ratajczak, B
Wass, E
Yang, YT
Yates, A
Zafrulla, Z
Choudhary, JS
Gerstein, M
Guigo, R
Hubbard, TJP
Kellis, M
Kundaje, A
Paten, B
Tress, ML
Flicek, P
Issue Date: 24-Nov-2022
Publisher: Oxford University Press on behalf of Nucleic Acids Research
Citation: Frankish, A. et al. (2023) 'GENCODE: reference annotation for the human and mouse genomes in 2023', Nucleic acids research, 51 (D1), pp. D942 - D949. doi: 10.1093/nar/gkac1071.
Abstract: Copyright © The Author(s) 2022. GENCODE produces high quality gene and transcript annotation for the human and mouse genomes. All GENCODE annotation is supported by experimental data and serves as a reference for genome biology and clinical genomics. The GENCODE consortium generates targeted experimental data, develops bioinformatic tools and carries out analyses that, along with externally produced data and methods, support the identification and annotation of transcript structures and the determination of their function. Here, we present an update on the annotation of human and mouse genes, including developments in the tools, data, analyses and major collaborations which underpin this progress. For example, we report the creation of a set of non-canonical ORFs identified in GENCODE transcripts, the LRGASP collaboration to assess the use of long transcriptomic data to build transcript models, the progress in collaborations with RefSeq and UniProt to increase convergence in the annotation of human and mouse protein-coding genes, the propagation of GENCODE across the human pan-genome and the development of new tools to support annotation of regulatory features by GENCODE. Our annotation is accessible via Ensembl, the UCSC Genome Browser and https://www.gencodegenes.org.
Description: Data availability: No new data were generated or analysed in support of this research.
URI: https://bura.brunel.ac.uk/handle/2438/26553
DOI: https://doi.org/10.1093/nar/gkac1071
Other Identifiers: ORCID iDs: Adam Frankish https://orcid.org/0000-0002-4333-628X; Mark Diekhans https://orcid.org/0000-0002-0430-0989; Irwin Jungreis https://orcid.org/0000-0002-3197-5367; Jane E Loveland https://orcid.org/0000-0002-7669-2934; Cristina Sisu https://orcid.org/0000-0001-9371-0797; Carme Arnan https://orcid.org/0000-0002-7431-2088; Fiona Cunningham https://orcid.org/0000-0002-7445-2419; Carlos Garcıa Giron https://orcid.org/0000-0002-0935-7271; Peter W Harrison https://orcid.org/0000-0002-4007-2899; Thibaut Hourlier https://orcid.org/0000-0003-4894-7773; Rory Johnson https://orcid.org/0000-0003-4607-2782; Fergal J Martin https://orcid.org/0000-0002-1672-050X; Surag Nair https://orcid.org/0000-0002-6216-2457; Magali Ruffier https://orcid.org/0000-0002-8386-1580; Marie-Marthe Suner https://orcid.org/0000-0002-0380-7171; Andrew Yates https://orcid.org/0000-0002-8886-4772; Anshul Kundaje https://orcid.org/0000-0003-3084-2287; Benedict Paten https://orcid.org/0000-0001-8863-3539; Michael L Tress ; Paul Flicek https://orcid.org/0000-0001-9046-6370; https://orcid.org/0000-0002-3897-7955.
Appears in Collections:Dept of Life Sciences Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright © The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.1.52 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons