DataCite Reader

This notebook shows how to read metadata from DataCite, using a DataCite DOI.

Fetch metadata

from commonmeta import Metadata

# Fetch metadata from a DataCite DOI
string = 'https://doi.org/10.5061/dryad.8515'
metadata = Metadata(string)

# Check that metadata was fetched successfully
print(metadata.state)
findable

Inspect the metadata

The following metadata are required for each DataCite resource:

  • id: the persistent identifier of the resource, typically a DOI
  • type: the type of the resource in commonmeta format, e.g. Dataset, Software, PhysicalObject or JournalArticle
  • titles: the title(s) of the resource
  • creators: the creator(s)/author(s) of the resource
  • publisher: the publisher of the resource
  • publication_year: the publication year of the resource

In addition, there are plenty of optional metadata. They are converted into the standardized commonmeta format used internally. This format is close to the metadata format used by DataCite.

commonmeta = metadata.write()
print(commonmeta)
{
    "id": "https://doi.org/10.5061/dryad.8515",
    "type": "Dataset",
    "url": "https://datadryad.org/stash/dataset/doi:10.5061/dryad.8515",
    "contributors": [
        {
            "type": "Person",
            "contributorRoles": [
                "Author"
            ],
            "givenName": "Benjamin",
            "familyName": "Ollomo",
            "affiliation": [
                {
                    "name": "Centre International de Recherches M\u00e9dicales de Franceville"
                }
            ]
        },
        {
            "type": "Person",
            "contributorRoles": [
                "Author"
            ],
            "givenName": "Patrick",
            "familyName": "Durand",
            "affiliation": [
                {
                    "name": "French National Centre for Scientific Research"
                }
            ]
        },
        {
            "type": "Person",
            "contributorRoles": [
                "Author"
            ],
            "givenName": "Franck",
            "familyName": "Prugnolle",
            "affiliation": [
                {
                    "name": "French National Centre for Scientific Research"
                }
            ]
        },
        {
            "type": "Person",
            "contributorRoles": [
                "Author"
            ],
            "givenName": "Emmanuel J. P.",
            "familyName": "Douzery"
        },
        {
            "type": "Person",
            "contributorRoles": [
                "Author"
            ],
            "givenName": "C\u00e9line",
            "familyName": "Arnathau",
            "affiliation": [
                {
                    "name": "French National Centre for Scientific Research"
                }
            ]
        },
        {
            "type": "Person",
            "contributorRoles": [
                "Author"
            ],
            "givenName": "Dieudonn\u00e9",
            "familyName": "Nkoghe",
            "affiliation": [
                {
                    "name": "Centre International de Recherches M\u00e9dicales de Franceville"
                }
            ]
        },
        {
            "type": "Person",
            "contributorRoles": [
                "Author"
            ],
            "givenName": "Eric",
            "familyName": "Leroy",
            "affiliation": [
                {
                    "name": "Centre International de Recherches M\u00e9dicales de Franceville"
                }
            ]
        },
        {
            "type": "Person",
            "contributorRoles": [
                "Author"
            ],
            "givenName": "Fran\u00e7ois",
            "familyName": "Renaud",
            "affiliation": [
                {
                    "name": "French National Centre for Scientific Research"
                }
            ]
        }
    ],
    "titles": [
        {
            "title": "Data from: A new malaria agent in African hominids."
        }
    ],
    "publisher": {
        "name": "Dryad"
    },
    "date": {
        "published": "2011-02-01T17:22:41Z",
        "available": "2011-02-01T17:22:41Z"
    },
    "subjects": [
        {
            "subject": "Plasmodium",
            "schemeUri": "https://github.com/PLOS/plos-thesaurus",
            "subjectScheme": "PLOS Subject Area Thesaurus"
        },
        {
            "subject": "Malaria",
            "schemeUri": "https://github.com/PLOS/plos-thesaurus",
            "subjectScheme": "PLOS Subject Area Thesaurus"
        },
        {
            "subject": "mitochondrial genome"
        },
        {
            "subject": "Parasites"
        }
    ],
    "language": "en",
    "sizes": [
        "107328 bytes"
    ],
    "version": "1",
    "license": {
        "id": "CC0-1.0",
        "url": "https://creativecommons.org/publicdomain/zero/1.0/legalcode"
    },
    "descriptions": [
        {
            "description": "Plasmodium falciparum is the major human malaria agent responsible for 200 to 300 million infections and one to three million deaths annually, mainly among African infants. The origin and evolution of this pathogen within the human lineage is still unresolved. A single species, P. reichenowi, which infects chimpanzees, is known to be a close sister lineage of P. falciparum. Here we report the discovery of a new Plasmodium species infecting Hominids. This new species has been isolated in two chimpanzees (Pan troglodytes) kept as pets by villagers in Gabon (Africa). Analysis of its complete mitochondrial genome (5529 nucleotides including Cyt b, Cox I and Cox III genes) reveals an older divergence of this lineage from the clade that includes P. falciparum and P. reichenowi (approximately 21+/-9 Myrs ago using Bayesian methods and considering that the divergence between P. falciparum and P. reichenowi occurred 4 to 7 million years ago as generally considered in the literature). This time frame would be congruent with the radiation of hominoids, suggesting that this Plasmodium lineage might have been present in early hominoids and that they may both have experienced a simultaneous diversification. Investigation of the nuclear genome of this new species will further the understanding of the genetic adaptations of P. falciparum to humans. The risk of transfer and emergence of this new species in humans must be now seriously considered given that it was found in two chimpanzees living in contact with humans and its close relatedness to the most virulent agent of malaria.",
            "descriptionType": "Abstract"
        },
        {
            "description": "Ollomo_PLoSPathog_2009Nucleotide alignment concatenation of 4 mitochondrial genes for 17 Plasmodium species and one outgroup.Ollomo_PLoSPathog_2009_PHYMLMaximum likelihood tree inferred from the 4-gene concatenation using PHYML.",
            "descriptionType": "Other"
        }
    ],
    "geo_locations": [
        {
            "geoLocationPlace": "Africa"
        }
    ],
    "provider": "DataCite"
}