-
Notifications
You must be signed in to change notification settings - Fork 7
TableInfo: protein_gene.tsv
This table must not be manually created. Users should skip this, and all other tables marked "Built by script" in this summary, preparing only the rest of their datapackage's TSV files (those marked "Prepared by submitter") for submission. Once the "Prepared by submitter" tables are ready, users should then use the C2M2 submission prep script to automatically generate this table (and the other "Built by script" tables) using the information in the "Prepared by submitter" tables.
Each row in this table is equivalent to the statement "protein X is known to be associated with gene Y", for one particular (protein X, gene Y) pair; contents are autoloaded where available from UniProtKB by the submission prep script, which adds one row for every gene term associated with every protein term used in collection_protein.tsv.
USERS PLEASE NOTE: the creation of this table is currently stubbed pending reevaluation of the benefits of automating its construction in this way. The submission prep script will generate a header-only version of the TSV for this table, independent of any included protein values. This is expected and will not invalidate your submission.
All associations expressed in this table have been predetermined by the UniProt curators: associations included in protein_gene.tsv
for a given submission will be those that contain protein terms used in submitter-prepared tables.
Field | Field Description | Required? | Field Value Type | Extra Info |
---|---|---|---|---|
protein | A UniProt Knowledgebase (UniProtKB) protein accession (AC) [part 1 of 2-component composite primary key] | Required | string | Example: Q6GZX4
|
gene | An Ensembl gene ID [part 2 of 2-component composite primary key] | Required | string | Example: ENSG00000010404
|
-
Tutorials
-
C2M2 Table Guide
-
Table Summary
- analysis_type.tsv
- anatomy.tsv
- assay_type.tsv
- biofluid.tsv
- biosample.tsv
- biosample_disease.tsv
- biosample_from_subject.tsv
- biosample_gene.tsv
- biosample_in_collection.tsv
- biosample_substance.tsv
- collection.tsv
- collection_anatomy.tsv
- collection_biofluid.tsv
- collection_compound.tsv
- collection_defined_by_project.tsv
- collection_disease.tsv
- collection_gene.tsv
- collection_in_collection.tsv
- collection_phenotype.tsv
- collection_protein.tsv
- collection_substance.tsv
- collection_taxonomy.tsv
- compound.tsv
- data_type.tsv
- dcc.tsv (formerly
primary_dcc_contact.tsv
- disease.tsv
- file.tsv
- file_describes_biosample.tsv
- file_describes_collection.tsv
- file_describes_subject.tsv
- file_format.tsv
- file_in_collection.tsv
- gene.tsv
- id_namespace.tsv
- ncbi_taxonomy.tsv
- phenotype.tsv
- phenotype_disease.tsv
- phenotype_gene.tsv
- project.tsv
- project_in_project.tsv
- protein.tsv
- protein_gene.tsv
- subject.tsv
- subject_disease.tsv
- subject_in_collection.tsv
- subject_phenotype.tsv
- subject_race.tsv
- subject_role_taxonomy.tsv
- subject_substance.tsv
- substance.tsv
- Reference Tables
-
Table Summary