| Title: | Gene and Exon Data from Ensembl | 
| Version: | 1.0.2 | 
| Description: | Gene and exon information from Ensembl genome builds GRCh38.p13 (104) and GRCh37 (v40) to use with the 'topr' package. | 
| License: | LGPL (≥ 3) | 
| Encoding: | UTF-8 | 
| LazyData: | true | 
| LazyDataCompression: | xz | 
| RoxygenNote: | 7.2.0 | 
| Suggests: | testthat (≥ 3.0.0), knitr, rmarkdown, markdown | 
| Config/testthat/edition: | 3 | 
| Depends: | R (≥ 3.5.0) | 
| NeedsCompilation: | no | 
| Packaged: | 2023-02-10 10:56:27 UTC; thorhildur | 
| Author: | Thorhildur Juliusdottir [cph, aut, cre], Andri Stefansson [aut] | 
| Maintainer: | Thorhildur Juliusdottir <totajuliusd@gmail.com> | 
| Repository: | CRAN | 
| Date/Publication: | 2023-02-10 12:00:02 UTC | 
Ensembl exons build HG38-104-5-2
Description
https://www.ensembl.info/2021/05/05/ensembl-104-has-been-released/
Usage
ENSEXONS
Format
A data frame with 40,122 rows and 7 variables:
- chrom
- Chromosome on build version 38 (GRCh38/hg38) 
- gene_start
- genetic position of gene start on build version 38 
- gene_end
- genetic position of gene end on build version 38 
- gene_symbol
- The name of the gene 
- exon_chromstart
- genetic positions of exon start 
- exon_chromend
- genetic position of exon end 
Details
exons on chrY and chrM were excluded from the exon dataset
Ensembl exons build HG37 (GRCh37) v40 from gencode
Description
exons on chrY and chrM were excluded from the exon dataset
Usage
ENSEXONS_37
Format
A data frame with 62,739 rows and 7 variables:
- chrom
- Chromosome on build version 38 (GRCh38/hg38) 
- gene_start
- genetic position of gene start on build version 38 
- gene_end
- genetic position of gene end on build version 38 
- gene_symbol
- The name of the gene 
- exon_chromstart
- genetic positions of exon start 
- exon_chromend
- genetic position of exon end 
Ensembl genes build HG38.104-5-2
Description
https://www.ensembl.info/2021/05/05/ensembl-104-has-been-released/
Usage
ENSGENES
Format
A data frame with 40,122 rows and 5 variables:
- chrom
- Chromosome on build version 38 (GRCh38/hg38) 
- gene_start
- genetic position of gene start on build version 38 
- gene_end
- genetic position of gene end on build version 38 
- gene_symbol
- The name of the gene 
- biotype
- the biotype of the gene 
Details
genes on chrY and chrM were excluded
Ensembl genes build HG37 (GRCh37) v40 from gencode
Description
genes on chrY and chrM were excluded
Usage
ENSGENES_37
Format
A data frame with 62,743 rows and 5 variables:
- chrom
- Chromosome on build version 38 (GRCh38/hg38) 
- gene_start
- genetic position of gene start on build version 38 
- gene_end
- genetic position of gene end on build version 38 
- gene_symbol
- The name of the gene 
- biotype
- the biotype of the gene 
toprdata
Description
Gene and exon datasets to use with the 'topr' package
datasets
The two datasets are:
-  ENSGENESGene information from genome build GRCh38
-  ENSEXONSExon information from genome build GRCh38
-  ENSGENES_37Gene information from genome build GRCh37
-  ENSEXONS_37Exon information from genome build GRCh37
Examples
library(toprdata)
head(ENSGENES)
head(ENSEXONS)