Data Models

PyUniProt uses SQLAlchemy to store the data in the database. Use instance of pyuniprot.manager.query.QueryManager to query the content of the database.

Entity–relationship model:

_images/all.png

Entry

class pyuniprot.manager.models.Entry(**kwargs)[source]

UniProt entry with relations to other models

Query Entry with pyuniprot.manager.query.QueryManager.entry()

Variables:

Table view

_images/sequence.png

Links

For more information on UniProt website:

Accession

class pyuniprot.manager.models.Accession(**kwargs)[source]

Provides a stable way of identifying UniProtKB entries.

Variables:

Table view

_images/accession.png

Links

More information about alternative names on UniProt help about ‘Accession’

OtherGeneName

class pyuniprot.manager.models.OtherGeneName(**kwargs)[source]

All gene names which are not primary

Query Entry with pyuniprot.manager.query.QueryManager.other_gene_names()

Variables:

Table view

_images/other_gene_name.png

Links

For more information on UniProt website:

Sequence

class pyuniprot.manager.models.Sequence(**kwargs)[source]

Amino acid sequence

Variables:

Table view

_images/sequence.png

Links

For more information on UniProt website:

Disease

class pyuniprot.manager.models.Disease(**kwargs)[source]
Variables:
  • identifier (str) – Disease identifier
  • ref_id (str) – Disease reference identifier
  • ref_type (str) – Disease reference type
  • name (str) – Disease name
  • acronym (str) – Disease acronym
  • description (str) – Disease description
  • disease_comments (str) – 1:n to DiseaseComment

Table view

_images/disease.png

DiseaseComment

class pyuniprot.manager.models.DiseaseComment(**kwargs)[source]

Disease and comment linked to an entry (protein)

Variables:

AlternativeFullName

class pyuniprot.manager.models.AlternativeFullName(**kwargs)[source]

Alternative full name

Variables:

Table view

_images/alternative_full_name.png

Links

More information about alternative names on UniProt help about ‘Protein names’

AlternativeShortName

class pyuniprot.manager.models.AlternativeShortName(**kwargs)[source]

Alternative short name

Variables:

Table view

_images/alternative_short_name.png

Links

More information about alternative names on UniProt help about ‘Protein names’

Accession

class pyuniprot.manager.models.Accession(**kwargs)[source]

Provides a stable way of identifying UniProtKB entries.

Variables:

Table view

_images/accession.png

Links

More information about alternative names on UniProt help about ‘Accession’

Pmid

class pyuniprot.manager.models.Pmid(**kwargs)[source]

PMID - The unique identifier assigned to a record when it enters PubMed.

Variables:
  • pmid (str) – PubMed identifier
  • first (str) – first page of publication
  • last (str) – last page of publication
  • volume (int) – Volume
  • name (str) – Name of Journal
  • date (datetime.datetime) – Publication date
  • title (str) – Title of publication

Table view

_images/pmid.png

Links

OrganismHost

class pyuniprot.manager.models.OrganismHost(**kwargs)[source]

NCBI taxonomy database identifier of the organism host

Variables:

Table view

_images/organism_host.png

Links

DbReference

class pyuniprot.manager.models.DbReference(**kwargs)[source]

Cross reference to other databases and information resources

Variables:
  • type_ (str) – Type of cross reference
  • identifier (str) – Unique identifier of cross reference
  • entry (Entry) – Entry object

Table view

_images/db_reference.png

Links

Feature

class pyuniprot.manager.models.Feature(**kwargs)[source]

Sequence annotations describe regions or sites of interest in the protein sequence, such as post-translational modifications, binding sites, enzyme active sites, local secondary structure or other characteristics reported in the cited references. In the moment we don’t save the positions. If this is strongly neede contact the PyUniProt team on github.

Variables:
  • type_ (str) – Type of feature
  • identifier (str) – Feature identifier
  • description (str) – Feature description
  • entry (Entry) – Entry object

Table view

_images/feature.png

Links

Function

class pyuniprot.manager.models.Function(**kwargs)[source]

General description of the function(s) of a protein

Variables:

Table view

_images/function.png

Links

Keyword

class pyuniprot.manager.models.Keyword(**kwargs)[source]

UniProt keywords summarise the content of a UniProtKB entry and facilitate the search for proteins of interest.

Variables:
  • name (str) – Keyword
  • identifier (str) – Keyword identifier
  • entries (list) – list of Entry object

Table view

_images/keyword.png

Links

ECNumber

class pyuniprot.manager.models.ECNumber(**kwargs)[source]

Enzyme Commission number (EC number) is a classification system for enzymes

Variables:

Table view

_images/ec_number.png

Links

SubcellularLocation

class pyuniprot.manager.models.SubcellularLocation(**kwargs)[source]

Subcellular location of protein

Variables:
  • location (str) – Subcellular location
  • entries (list) – list of Entry object

Table view

_images/subcellular_location.png

Links

TissueSpecificity

class pyuniprot.manager.models.TissueSpecificity(**kwargs)[source]

Description of the expression of a gene in different tissues

Variables:

Table view

_images/tissue_specificity.png

Links

TissueInReference

class pyuniprot.manager.models.TissueInReference(**kwargs)[source]

Tissue described in the reference

Variables:entries (list) – list of Entry object

Table view