Occurrence download formats
Data downloads are available from GBIF in three primary formats:
-
Simple. This format contains a selection of commonly used terms, after the data has been aligned to GBIF’s taxonomic and geographic indices and structured vocabularies
-
Downloads created on www.gbif.org or through the API using the format
SIMPLE_CSV
are produced in a tab-separated text format, suitable for use with spreadsheets and programming/scripting languages -
Occurrence data accessed through cloud services, or with the API format
SIMPLE_PARQUET
, are produced in Apache Parquet format. The fields are the same as for tab-separated text format.
-
-
Darwin Core Archive (API:
DWCA
). This is a compressed Zip file, containing data in tab-separated text format, and metadata in XML format.-
occurrence.txt
contains occurrence data after interpretation by GBIF’s systems. -
multimedia.txt
contains information on multimedia (images, audio, video) relating to the occurrences. -
verbatim.txt
contains the original, uninterpreted data, without modifications by GBIF’s systems.
-
-
Species List (API:
SPECIES_LIST
). This is a summary format containing the distinct list of species names returned by the filter.
The header row (first row) of all these files contain the short name of the terms they contain. Most of the terms are defined by the Darwin Core standard. For example, the column catalogNumber
contains data of the Darwin Core term http://rs.tdwg.org/dwc/terms/catalogNumber.
Simple download β Term definitions
The definitions marked with π¦ are from the Darwin Core standard.
The definitions marked with π are from GBIF, and may reflect the result of interpretation and data quality procedures applied by GBIF, or they may not be part of Darwin Core.
Column name | Data type | Nullable | Definition |
---|---|---|---|
String |
No |
π Unique GBIF key for the occurrence. We aim to keep these keys stable, but this is not possible in every case. |
|
String |
No |
π The UUID of the GBIF dataset containing this occurrence. |
|
String |
Yes |
π¦ An identifier for the Occurrence (as opposed to a particular digital record of the occurrence). In the absence of a persistent global unique identifier, construct one from a combination of identifiers in the record that will most closely make the occurrenceID globally unique. |
|
String |
Yes |
π The kingdom name (excluding authorship) for the kingdom from the GBIF backbone matched to this occurrence. |
|
String |
Yes |
π The phylum name (excluding authorship) for the phylum from the GBIF backbone matched to this occurrence. |
|
String |
Yes |
π The class name (excluding authorship) for the class from the GBIF backbone matched to this occurrence. |
|
String |
Yes |
π The order name (excluding authorship) for the order from the GBIF backbone matched to this occurrence. |
|
String |
Yes |
π The family name (excluding authorship) for the family from the GBIF backbone matched to this occurrence. |
|
String |
Yes |
π The genus name (excluding authorship) for the genus from the GBIF backbone matched to this occurrence. |
|
String |
Yes |
π The species name (excluding authorship) for the species from the GBIF backbone matched to this occurrence. |
|
String |
Yes |
π The infraspecific name part of the species name from the GBIF backbone matched to this occurrence. |
|
String |
Yes |
π The taxonomic rank of the most specific name in the scientificName. |
|
String |
Yes |
π The scientific name (including authorship) for the taxon from the GBIF backbone matched to this occurrence. This could be a synonym, see also |
|
String |
Yes |
||
String |
Yes |
π¦ The authorship information for the scientificName formatted according to the conventions of the applicable nomenclaturalCode. |
|
String |
Yes |
π The 2-letter country code (as per ISO-3166-1) of the country, territory or area in which the occurrence was recorded. |
|
String |
Yes |
||
String |
Yes |
π The name of the next-smaller administrative region than country (state, province, canton, department, region, etc.) in which the occurrence occurs. This value is unaltered by GBIF’s processing; see also the GADM fields. |
|
String |
Yes |
π A statement about the presence or absence of a Taxon at a Location. For definitions, see the GBIF occurrence status vocabulary. |
|
Integer |
Yes |
π The number of individuals present at the time of the Occurrence. |
|
String |
Yes |
π The UUID of the organization which publishes the dataset containing this occurrence. |
|
Double |
Yes |
π The geographic latitude (in decimal degrees, using the WGS84 datum) of the geographic centre of the location of the occurrence. |
|
Double |
Yes |
π The geographic longitude (in decimal degrees, using the WGS84 datum) of the geographic centre of the location of the occurrence. |
|
Double |
Yes |
π The horizontal distance (in metres) from the given decimalLatitude and decimalLongitude describing the smallest circle containing the whole of the Location. |
|
Double |
Yes |
π A decimal representation of the precision of the coordinates given in the decimalLatitude and decimalLongitude. |
|
Double |
Yes |
π Elevation (altitude) in metres above sea level. This is not a current Darwin Core term. |
|
Double |
Yes |
π The value of the potential error associated with the elevation. This is not a current Darwin Core term. |
|
Double |
Yes |
π Depth in metres below sea level. This is not a current Darwin Core term. |
|
Double |
Yes |
π The value of the potential error associated with the depth. This is not a current Darwin Core term. |
|
ISO 8601 Date |
Yes |
π The date-time during which an Event occurred. For occurrences, this is the date-time when the event was recorded. Not suitable for a time in a geological context. Note: This field is planned to expand to allow date ranges. See issue. |
|
Integer |
Yes |
π The integer day of the month on which the Event occurred. |
|
Integer |
Yes |
||
Integer |
Yes |
π The four-digit year in which the event occurred, according to the Common Era calendar. |
|
Integer |
Yes |
π A taxon key from the GBIF backbone for the most specific (lowest rank) taxon for this occurrence. This could be a synonym, see |
|
Integer |
Yes |
π A taxon key from the GBIF backbone for the species of thisoccurrence. |
|
String |
Yes |
π The values of the Darwin Core term Basis of Record which can apply to occurrences. See GBIF’s Darwin Core Type Vocabulary for definitions. |
|
String |
Yes |
π¦ The name (or acronym) in use by the institution having custody of the object(s) or information referred to in the record. |
|
String |
Yes |
π¦ The name, acronym, coden, or initialism identifying the collection or data set from which the record was derived. |
|
String |
Yes |
π¦ An identifier (preferably unique) for the record within the data set or collection. |
|
String |
Yes |
π¦ An identifier given to the Occurrence at the time it was recorded. Often serves as a link between field notes and an Occurrence record, such as a specimen collector’s number. |
|
String array, delimited with |
Yes |
π A list (concatenated and separated) of names of people, groups, or organizations who assigned the Taxon to the occurrence. |
|
ISO 8601 Date |
Yes |
π The date on which the subject was determined as representing the Taxon. |
|
String |
Yes |
π A legal document giving official permission to do something with the occurrence. |
|
String |
Yes |
π¦ A person or organization owning or managing rights over the resource. |
|
String array, delimited with |
Yes |
π A person, group, or organization responsible for recording the original occurrence. |
|
String array, delimited with |
Yes |
π A list (concatenated and separated) of nomenclatural types (type status, typified scientific name, publication) applied to the occurrence. |
|
String structure |
Yes |
π Statement about whether an organism or organisms have been introduced to a given place and time through the direct or indirect activity of modern humans. Values are aligned to the GBIF EstablishmentMeans vocabulary,which is derived from the Darwin Core EstablishmentMeans vocabulary. |
|
ISO 8601 Date |
Yes |
π The time this occurrence was last processed by GBIF’s interpretation system βPipelinesβ. This is the time the record was last changed in GBIF, not the time the record was last changed by the publisher. Data is also reprocessed when we changed the taxonomic backbone, geographic data sources or other interpretation procedures. An earlier interpretation system distinguished between βparsingβ and βinterpretationβ, but in the current system there is only one process β the two dates will always be the same. |
|
String array, delimited with |
Yes |
π The media type given as Dublin Core type values, in particular StillImage, MovingImage or Sound. |
|
String array, delimited with |
Yes |
π A specific interpretation issue found during processing and interpretation of the record. See the list of occurrence issues and the OccurrenceIssue enumeration for possible values and definitions. |
DWCA downloads
Darwin Core Archive downloads from gbif.org contain the following files:
occurrence.txt
-
Occurrence data after interpretation by GBIF. Described in detail below.
multimedia.txt
-
Occurrence multimedia data after interpretation by GBIF. Described in detail below.
verbatim.txt
-
Occurrence data without interpretation by GBIF. Described in detail below.
meta.xml
-
The Darwin Core Archive metafile, describing the structure of the archive β the file formats, column names and their terms.
metadata.xml
-
Metadata about the download in Ecological Metadata Language (EML).
rights.txt
-
Licence information for all the datasets with occurrences in the download.
citations.txt
-
Citations for all the datasets with occurrences in the download.
dataset/*.xml
-
EML metadata for every dataset with occurrences in the download.
The data may be read without any special tools, including by spreadsheets such as Microsoft Excel and LibreOffice Calc (see the FAQ). The .txt
files are tab-delimited, and all files are in UTF-8 encoding with Unix-style (\n
) line endings.
There are libraries to read Darwin Core Archives in these programming languages:
-
Java β GBIF dwca-io
-
.NET β DwC-A_dotnet
-
Python β Python DWCA Reader
-
R β finch (NB: abandoned library)
-
Ruby β dwc-archive
Interpreted term definitions (occurrence.txt
)
This is the Darwin Core Archive core entity, with row type Occurrence. Values are tab-delimited and in UTF-8 encoding.
Column name | Data type | Nullable | Definition |
---|---|---|---|
String |
No |
π Unique GBIF key for the occurrence. We aim to keep these keys stable, but this is not possible in every case. |
|
String |
Yes |
π¦ Information about who can access the resource or an indication of its security status. |
|
String |
Yes |
π¦ A bibliographic reference for the resource as a statement indicating how this record should be cited (attributed) when used. |
|
String |
Yes |
||
String |
Yes |
π A legal document giving official permission to do something with the occurrence. |
|
ISO 8601 Date |
Yes |
π The most recent date-time on which the occurrence was changed, according to the publisher. |
|
String |
Yes |
||
String |
Yes |
π A related resource that is referenced, cited, or otherwise pointed to by the described resource. |
|
String |
Yes |
π¦ A person or organization owning or managing rights over the resource. |
|
String |
Yes |
||
String |
Yes |
π¦ An identifier for the institution having custody of the object(s) or information referred to in the record. |
|
String |
Yes |
π¦ An identifier for the collection or dataset from which the record was derived. |
|
String array, delimited with |
Yes |
π An identifier for the set of data. May be a global unique identifier or an identifier specific to a collection or institution. |
|
String |
Yes |
π¦ The name (or acronym) in use by the institution having custody of the object(s) or information referred to in the record. |
|
String |
Yes |
π¦ The name, acronym, coden, or initialism identifying the collection or data set from which the record was derived. |
|
String array, delimited with |
Yes |
π The name identifying the data set from which the record was derived. |
|
String |
Yes |
π¦ The name (or acronym) in use by the institution having ownership of the object(s) or information referred to in the record. |
|
String |
Yes |
π The values of the Darwin Core term Basis of Record which can apply to occurrences. See GBIF’s Darwin Core Type Vocabulary for definitions. |
|
String |
Yes |
π¦ Additional information that exists, but that has not been shared in the given record. |
|
String |
Yes |
π¦ Actions taken to make the shared data less specific or complete than in its original form. Suggests that alternative data of higher quality may be available on request. |
|
String |
Yes |
π¦ A list of additional measurements, facts, characteristics, or assertions about the record. Meant to provide a mechanism for structured content. |
|
String |
Yes |
π¦ An identifier for the Occurrence (as opposed to a particular digital record of the occurrence). In the absence of a persistent global unique identifier, construct one from a combination of identifiers in the record that will most closely make the occurrenceID globally unique. |
|
String |
Yes |
π¦ An identifier (preferably unique) for the record within the data set or collection. |
|
String |
Yes |
π¦ An identifier given to the Occurrence at the time it was recorded. Often serves as a link between field notes and an Occurrence record, such as a specimen collector’s number. |
|
String array, delimited with |
Yes |
π A person, group, or organization responsible for recording the original occurrence. |
|
String array, delimited with |
Yes |
π¦ A list (concatenated and separated) of the globally unique identifier for the person, people, groups, or organizations responsible for recording the original Occurrence. |
|
Integer |
Yes |
π The number of individuals present at the time of the Occurrence. |
|
String |
Yes |
π A number or enumeration value for the quantity of organisms. |
|
String |
Yes |
π The type of quantification system used for the quantity of organisms. |
|
String |
Yes |
π The sex of the biological individual(s) represented in the Occurrence. For definitions, see the GBIF sex vocabulary |
|
String structure |
Yes |
π The age class or life stage of the Organism(s) at the time the Occurrence was recorded. Values are aligned to the GBIF LifeStage vocabulary |
|
String |
Yes |
π¦ The reproductive condition of the biological individual(s) represented in the Occurrence. |
|
String |
Yes |
π¦ The behavior shown by the subject at the time the Occurrence was recorded. |
|
String structure |
Yes |
π Statement about whether an organism or organisms have been introduced to a given place and time through the direct or indirect activity of modern humans. Values are aligned to the GBIF EstablishmentMeans vocabulary,which is derived from the Darwin Core EstablishmentMeans vocabulary. |
|
String structure |
Yes |
π The degree to which an Organism survives, reproduces, and expands its range at the given place and time. Values are aligned to the GBIF DegreeOfEstablishment vocabulary,which is derived from the Darwin Core DegreeOfEstablishment vocabulary. |
|
String structure |
Yes |
π The process by which an Organism came to be in a given place at a given time. Values are aligned to the GBIF Pathway vocabulary,which is derived from the Darwin Core Pathway vocabulary. |
|
String |
Yes |
π¦ A categorical description of the extent to which the georeference has been verified to represent the best possible spatial description for the Location of the Occurrence. |
|
String |
Yes |
π A statement about the presence or absence of a Taxon at a Location. For definitions, see the GBIF occurrence status vocabulary. |
|
String array, delimited with |
Yes |
||
String |
Yes |
π¦ The current state of a specimen with respect to the collection identified in collectionCode or collectionID. |
|
String |
Yes |
π¦ A list (concatenated and separated) of identifiers of other Occurrence records and their associations to this Occurrence. |
|
String |
Yes |
π¦ A list (concatenated and separated) of identifiers (publication, bibliographic reference, global unique identifier, URI) of literature associated with the Occurrence. |
|
String |
Yes |
π¦ A list (concatenated and separated) of identifiers (publication, global unique identifier, URI) of genetic sequence information associated with the Occurrence. |
|
String |
Yes |
π¦ A list (concatenated and separated) of identifiers or names of taxa and the associations of this Occurrence to each of them. |
|
String array, delimited with |
Yes |
π A list (concatenated and separated) of previous or alternate fully qualified catalogue numbers or other human-used identifiers for the same occurrence, whether in the current or any other data set or collection. |
|
String |
Yes |
||
String |
Yes |
π¦ An identifier for the Organism instance (as opposed to a particular digital record of the Organism). May be a globally unique identifier or an identifier specific to the data set. |
|
String |
Yes |
π¦ A textual name or label assigned to an Organism instance. |
|
String |
Yes |
π¦ A description of the kind of Organism instance. Can be used to indicate whether the Organism instance represents a discrete organism or if it represents a particular type of aggregation. |
|
String |
Yes |
π¦ A list (concatenated and separated) of identifiers of other Organisms and the associations of this Organism to each of them. |
|
String |
Yes |
π¦ A list (concatenated and separated) of previous assignments of names to the Organism. |
|
String |
Yes |
||
String |
Yes |
π¦ An identifier for the MaterialSample (as opposed to a particular digital record of the material sample). In the absence of a persistent global unique identifier, construct one from a combination of identifiers in the record that will most closely make the materialSampleID globally unique. |
|
String |
Yes |
π¦ An identifier for the set of information associated with an Event (something that occurs at a place and time). May be a global unique identifier or an identifier specific to the data set. |
|
String |
Yes |
π¦ An identifier for the broader Event that groups this and potentially other Events. |
|
String |
Yes |
π¦ An identifier given to the event in the field. Often serves as a link between field notes and the Event. |
|
ISO 8601 Date |
Yes |
π The date-time during which an Event occurred. For occurrences, this is the date-time when the event was recorded. Not suitable for a time in a geological context. Note: This field is planned to expand to allow date ranges. See issue. |
|
String |
Yes |
||
String |
Yes |
π¦ The earliest integer day of the year on which the Event occurred (1 for January 1, 365 for December 31, except in a leap year, in which case it is 366). |
|
String |
Yes |
π¦ The latest integer day of the year on which the Event occurred (1 for January 1, 365 for December 31, except in a leap year, in which case it is 366). |
|
Integer |
Yes |
π The four-digit year in which the event occurred, according to the Common Era calendar. |
|
Integer |
Yes |
||
Integer |
Yes |
π The integer day of the month on which the Event occurred. |
|
String |
Yes |
π¦ The verbatim original representation of the date and time information for an Event. |
|
String |
Yes |
π¦ A category or description of the habitat in which the Event occurred. |
|
String array, delimited with |
Yes |
π The methods or protocols used during an Event, denoted by an IRI. |
|
String |
Yes |
π A numeric value for a measurement of the size (time duration, length, area, or volume) of a sample in a sampling event. |
|
String |
Yes |
π The unit of measurement of the size (time duration, length, area, or volume) of a sample in a sampling event. |
|
String |
Yes |
||
String |
Yes |
π¦ One of a) an indicator of the existence of, b) a reference to (publication, URI), or c) the text of notes taken in the field about the Event. |
|
String |
Yes |
||
String |
Yes |
π¦ An identifier for the set of location information (data associated with dcterms:Location). May be a global unique identifier or an identifier specific to the data set. |
|
String |
Yes |
π¦ An identifier for the geographic region within which the Location occurred. |
|
String |
Yes |
π¦ A list (concatenated and separated) of geographic names less specific than the information captured in the locality term. |
|
String |
Yes |
π The continent, based on a 7 continent model described on Wikipedia and the World Geographical Scheme for Recording Plant Distributions (WGSRPD). In particular this splits the Americas into North and South America with North America including the Caribbean (except Trinidad and Tobago) and reaching down and including Panama. See the GBIF Continents for the exact divisions. This is a geographical division. See |
|
String |
Yes |
π The name of the water body in which the Location occurs. |
|
String |
Yes |
π¦ The name of the island group in which the Location occurs. |
|
String |
Yes |
π¦ The name of the island on or near which the Location occurs. |
|
String |
Yes |
π The 2-letter country code (as per ISO-3166-1) of the country, territory or area in which the occurrence was recorded. |
|
String |
Yes |
π The name of the next-smaller administrative region than country (state, province, canton, department, region, etc.) in which the occurrence occurs. This value is unaltered by GBIF’s processing; see also the GADM fields. |
|
String |
Yes |
π¦ The full, unabbreviated name of the next smaller administrative region than stateProvince (county, shire, department, etc.) in which the Location occurs. |
|
String |
Yes |
π¦ The full, unabbreviated name of the next smaller administrative region than county (city, municipality, etc.) in which the Location occurs. Do not use this term for a nearby named place that does not contain the actual location. |
|
String |
Yes |
||
String |
Yes |
||
String |
Yes |
π¦ The original description of the elevation (altitude, usually above sea level) of the Location. |
|
String |
Yes |
π¦ The vertical datum used as the reference upon which the values in the elevation terms are based. |
|
String |
Yes |
π¦ The original description of the depth below the local surface. |
|
String |
Yes |
π¦ The lesser distance in a range of distance from a reference surface in the vertical direction, in meters. Use positive values for locations above the surface, negative values for locations below. If depth measures are given, the reference surface is the location given by the depth, otherwise the reference surface is the location given by the elevation. |
|
String |
Yes |
π¦ The greater distance in a range of distance from a reference surface in the vertical direction, in meters. Use positive values for locations above the surface, negative values for locations below. If depth measures are given, the reference surface is the location given by the depth, otherwise the reference surface is the location given by the elevation. |
|
String |
Yes |
π¦ Information about the source of this Location information. Could be a publication (gazetteer), institution, or team of individuals. |
|
String |
Yes |
||
Double |
Yes |
π The geographic latitude (in decimal degrees, using the WGS84 datum) of the geographic centre of the location of the occurrence. |
|
Double |
Yes |
π The geographic longitude (in decimal degrees, using the WGS84 datum) of the geographic centre of the location of the occurrence. |
|
Double |
Yes |
π The horizontal distance (in metres) from the given decimalLatitude and decimalLongitude describing the smallest circle containing the whole of the Location. |
|
Double |
Yes |
π A decimal representation of the precision of the coordinates given in the decimalLatitude and decimalLongitude. |
|
String |
Yes |
π¦ The ratio of the area of the point-radius (decimalLatitude, decimalLongitude, coordinateUncertaintyInMeters) to the area of the true (original, or most specific) spatial representation of the Location. Legal values are 0, greater than or equal to 1, or undefined. A value of 1 is an exact match or 100% overlap. A value of 0 should be used if the given point-radius does not completely contain the original representation. The pointRadiusSpatialFit is undefined (and should be left empty) if the original representation is a point without uncertainty and the given georeference is not that same point (without uncertainty). If both the original and the given georeference are the same point, the pointRadiusSpatialFit is 1. |
|
String |
Yes |
π¦ The coordinate format for the verbatimLatitude and verbatimLongitude or the verbatimCoordinates of the Location. |
|
String |
Yes |
π¦ The ellipsoid, geodetic datum, or spatial reference system (SRS) upon which coordinates given in verbatimLatitude and verbatimLongitude, or verbatimCoordinates are based. |
|
String |
Yes |
π¦ A Well-Known Text (WKT) representation of the shape (footprint, geometry) that defines the Location. A Location may have both a point-radius representation (see decimalLatitude) and a footprint representation, and they may differ from each other. |
|
String |
Yes |
π¦ The ellipsoid, geodetic datum, or spatial reference system (SRS) upon which the geometry given in footprintWKT is based. |
|
String |
Yes |
π¦ The ratio of the area of the footprint (footprintWKT) to the area of the true (original, or most specific) spatial representation of the Location. Legal values are 0, greater than or equal to 1, or undefined. A value of 1 is an exact match or 100% overlap. A value of 0 should be used if the given footprint does not completely contain the original representation. The footprintSpatialFit is undefined (and should be left empty) if the original representation is a point without uncertainty and the given georeference is not that same point (without uncertainty). If both the original and the given georeference are the same point, the footprintSpatialFit is 1. |
|
String |
Yes |
π¦ A list (concatenated and separated) of names of people, groups, or organizations who determined the georeference (spatial representation) for the Location. |
|
String |
Yes |
||
String |
Yes |
π¦ A description or reference to the methods used to determine the spatial footprint, coordinates, and uncertainties. |
|
String |
Yes |
π¦ A list (concatenated and separated) of maps, gazetteers, or other resources used to georeference the Location, described specifically enough to allow anyone in the future to use the same resources. |
|
String |
Yes |
π¦ Notes or comments about the spatial description determination, explaining assumptions made in addition or opposition to the those formalized in the method referred to in georeferenceProtocol. |
|
String |
Yes |
π¦ An identifier for the set of information associated with a GeologicalContext (the location within a geological context, such as stratigraphy). May be a global unique identifier or an identifier specific to the data set. |
|
String |
Yes |
π¦ The full name of the earliest possible geochronologic eon or lowest chrono-stratigraphic eonothem or the informal name ("Precambrian") attributable to the stratigraphic horizon from which the cataloged item was collected. |
|
String |
Yes |
π¦ The full name of the latest possible geochronologic eon or highest chrono-stratigraphic eonothem or the informal name ("Precambrian") attributable to the stratigraphic horizon from which the cataloged item was collected. |
|
String |
Yes |
π¦ The full name of the earliest possible geochronologic era or lowest chronostratigraphic erathem attributable to the stratigraphic horizon from which the cataloged item was collected. |
|
String |
Yes |
π¦ The full name of the latest possible geochronologic era or highest chronostratigraphic erathem attributable to the stratigraphic horizon from which the cataloged item was collected. |
|
String |
Yes |
π¦ The full name of the earliest possible geochronologic period or lowest chronostratigraphic system attributable to the stratigraphic horizon from which the cataloged item was collected. |
|
String |
Yes |
π¦ The full name of the latest possible geochronologic period or highest chronostratigraphic system attributable to the stratigraphic horizon from which the cataloged item was collected. |
|
String |
Yes |
π¦ The full name of the earliest possible geochronologic epoch or lowest chronostratigraphic series attributable to the stratigraphic horizon from which the cataloged item was collected. |
|
String |
Yes |
π¦ The full name of the latest possible geochronologic epoch or highest chronostratigraphic series attributable to the stratigraphic horizon from which the cataloged item was collected. |
|
String |
Yes |
π¦ The full name of the earliest possible geochronologic age or lowest chronostratigraphic stage attributable to the stratigraphic horizon from which the cataloged item was collected. |
|
String |
Yes |
π¦ The full name of the latest possible geochronologic age or highest chronostratigraphic stage attributable to the stratigraphic horizon from which the cataloged item was collected. |
|
String |
Yes |
π¦ The full name of the lowest possible geological biostratigraphic zone of the stratigraphic horizon from which the cataloged item was collected. |
|
String |
Yes |
π¦ The full name of the highest possible geological biostratigraphic zone of the stratigraphic horizon from which the cataloged item was collected. |
|
String |
Yes |
π¦ The combination of all litho-stratigraphic names for the rock from which the cataloged item was collected. |
|
String |
Yes |
π¦ The full name of the lithostratigraphic group from which the cataloged item was collected. |
|
String |
Yes |
π¦ The full name of the lithostratigraphic formation from which the cataloged item was collected. |
|
String |
Yes |
π¦ The full name of the lithostratigraphic member from which the cataloged item was collected. |
|
String |
Yes |
π¦ The full name of the lithostratigraphic bed from which the cataloged item was collected. |
|
String |
Yes |
π¦ An identifier for the Identification (the body of information associated with the assignment of a scientific name). May be a global unique identifier or an identifier specific to the data set. |
|
String |
Yes |
π¦ A string representing the taxonomic identification as it appeared in the original record. |
|
String |
Yes |
π¦ A brief phrase or a standard term ("cf.", "aff.") to express the determiner’s doubts about the Identification. |
|
String array, delimited with |
Yes |
π A list (concatenated and separated) of nomenclatural types (type status, typified scientific name, publication) applied to the occurrence. |
|
String array, delimited with |
Yes |
π A list (concatenated and separated) of names of people, groups, or organizations who assigned the Taxon to the occurrence. |
|
String array, delimited with |
Yes |
π¦ A list (concatenated and separated) of the globally unique identifier for the person, people, groups, or organizations responsible for assigning the Taxon to the subject. |
|
ISO 8601 Date |
Yes |
π The date on which the subject was determined as representing the Taxon. |
|
String |
Yes |
π¦ A list (concatenated and separated) of references (publication, global unique identifier, URI) used in the Identification. |
|
String |
Yes |
π¦ A categorical indicator of the extent to which the taxonomic identification has been verified to be correct. |
|
String |
Yes |
||
String |
Yes |
π¦ An identifier for the set of taxon information (data associated with the Taxon class). May be a global unique identifier or an identifier specific to the data set. |
|
String |
Yes |
π¦ An identifier for the nomenclatural (not taxonomic) details of a scientific name. |
|
String |
Yes |
π¦ An identifier for the name usage (documented meaning of the name according to a source) of the currently valid (zoological) or accepted (botanical) taxon. |
|
String |
Yes |
π¦ An identifier for the name usage (documented meaning of the name according to a source) of the direct, most proximate higher-rank parent taxon (in a classification) of the most specific element of the scientificName. |
|
String |
Yes |
π¦ An identifier for the name usage (documented meaning of the name according to a source) in which the terminal element of the scientificName was originally established under the rules of the associated nomenclaturalCode. |
|
String |
Yes |
π¦ An identifier for the source in which the specific taxon concept circumscription is defined or implied. See nameAccordingTo. |
|
String |
Yes |
π¦ An identifier for the publication in which the scientificName was originally established under the rules of the associated nomenclaturalCode. |
|
String |
Yes |
π¦ An identifier for the taxonomic concept to which the record refers - not for the nomenclatural details of a taxon. |
|
String |
Yes |
π The scientific name (including authorship) for the taxon from the GBIF backbone matched to this occurrence. This could be a synonym, see also |
|
String |
Yes |
π¦ The full name, with authorship and date information if known, of the currently valid (zoological) or accepted (botanical) taxon. |
|
String |
Yes |
π¦ The full name, with authorship and date information if known, of the direct, most proximate higher-rank parent taxon (in a classification) of the most specific element of the scientificName. |
|
String |
Yes |
π¦ The taxon name, with authorship and date information if known, as it originally appeared when first established under the rules of the associated nomenclaturalCode. The basionym (botany) or basonym (bacteriology) of the scientificName or the senior/earlier homonym for replaced names. |
|
String |
Yes |
π¦ The reference to the source in which the specific taxon concept circumscription is defined or implied - traditionally signified by the Latin "sensu" or "sec." (from secundum, meaning "according to"). For taxa that result from identifications, a reference to the keys, monographs, experts and other sources should be given. |
|
String |
Yes |
π¦ A reference for the publication in which the scientificName was originally established under the rules of the associated nomenclaturalCode. |
|
String |
Yes |
π¦ The four-digit year in which the scientificName was published. |
|
String |
Yes |
π¦ A list (concatenated and separated) of taxa names terminating at the rank immediately superior to the taxon referenced in the taxon record. |
|
String |
Yes |
π The kingdom name (excluding authorship) for the kingdom from the GBIF backbone matched to this occurrence. |
|
String |
Yes |
π The phylum name (excluding authorship) for the phylum from the GBIF backbone matched to this occurrence. |
|
String |
Yes |
π The class name (excluding authorship) for the class from the GBIF backbone matched to this occurrence. |
|
String |
Yes |
π The order name (excluding authorship) for the order from the GBIF backbone matched to this occurrence. |
|
String |
Yes |
π The family name (excluding authorship) for the family from the GBIF backbone matched to this occurrence. |
|
String |
Yes |
π¦ The full scientific name of the subfamily in which the taxon is classified. |
|
String |
Yes |
π The genus name (excluding authorship) for the genus from the GBIF backbone matched to this occurrence. |
|
String |
Yes |
π The genus name part of the species name from the GBIF backbone matched to this occurrence. |
|
String |
Yes |
π The subgenus name (excluding authorship) for the subgenus from the GBIF backbone matched to this occurrence. |
|
String |
Yes |
π¦ The infrageneric part of a binomial name at ranks above species but below genus. |
|
String |
Yes |
π The specific name part of the species name from the GBIF backbone matched to this occurrence. |
|
String |
Yes |
π The infraspecific name part of the species name from the GBIF backbone matched to this occurrence. |
|
String |
Yes |
π¦ Part of the name of a cultivar, cultivar group or grex that follows the scientific name. |
|
String |
Yes |
π The taxonomic rank of the most specific name in the scientificName. |
|
String |
Yes |
π¦ The taxonomic rank of the most specific name in the scientificName as it appears in the original record. |
|
String |
Yes |
||
String |
Yes |
π¦ The nomenclatural code (or codes in the case of an ambiregnal name) under which the scientificName is constructed. |
|
String |
Yes |
π The status of the use of the scientificName as a label for a taxon. |
|
String |
Yes |
π¦ The status related to the original publication of the name and its conformance to the relevant rules of nomenclature. It is based essentially on an algorithm according to the business rules of the code. It requires no taxonomic opinion. |
|
String |
Yes |
||
String |
No |
π The UUID of the GBIF dataset containing this occurrence. |
|
String |
Yes |
π The country, territory or island based on ISO-3166 of the organization publishing the dataset containing this occurrence. |
|
ISO 8601 Date |
Yes |
π The time this occurrence was last processed by GBIF’s interpretation system βPipelinesβ. This is the time the record was last changed in GBIF, not the time the record was last changed by the publisher. Data is also reprocessed when we changed the taxonomic backbone, geographic data sources or other interpretation procedures. An earlier interpretation system distinguished between βparsingβ and βinterpretationβ, but in the current system there is only one process β the two dates will always be the same. |
|
Double |
Yes |
π Elevation (altitude) in metres above sea level. This is not a current Darwin Core term. |
|
Double |
Yes |
π The value of the potential error associated with the elevation. This is not a current Darwin Core term. |
|
Double |
Yes |
π Depth in metres below sea level. This is not a current Darwin Core term. |
|
Double |
Yes |
π The value of the potential error associated with the depth. This is not a current Darwin Core term. |
|
Double |
Yes |
π The distance in metres of the occurrence from a centroid known to be applied to occurrences during georeferencing. This can potentially indicate low-precision georeferencing, check the values of |
|
String array, delimited with |
Yes |
π A specific interpretation issue found during processing and interpretation of the record. See the list of occurrence issues and the OccurrenceIssue enumeration for possible values and definitions. |
|
String array, delimited with |
Yes |
π The media type given as Dublin Core type values, in particular StillImage, MovingImage or Sound. |
|
Boolean |
Yes |
π Boolean indicating that a valid latitude and longitude exists. |
|
Boolean |
Yes |
π Boolean indicating that some spatial validation rule has not passed. |
|
Integer |
Yes |
π A taxon key from the GBIF backbone for the most specific (lowest rank) taxon for this occurrence. This could be a synonym, see |
|
Integer |
Yes |
π A taxon key from the GBIF backbone for the accepted taxon of this occurrence. |
|
Integer |
Yes |
π A taxon key from the GBIF backbone for the kingdom of thisoccurrence. |
|
Integer |
Yes |
π A taxon key from the GBIF backbone for the phylum of thisoccurrence. |
|
Integer |
Yes |
π A taxon key from the GBIF backbone for the class of thisoccurrence. |
|
Integer |
Yes |
π A taxon key from the GBIF backbone for the order of thisoccurrence. |
|
Integer |
Yes |
π A taxon key from the GBIF backbone for the family of thisoccurrence. |
|
Integer |
Yes |
π A taxon key from the GBIF backbone for the genus of thisoccurrence. |
|
Integer |
Yes |
π A taxon key from the GBIF backbone for the subgenus of thisoccurrence. |
|
Integer |
Yes |
π A taxon key from the GBIF backbone for the species of thisoccurrence. |
|
String |
Yes |
π The species name (excluding authorship) for the species from the GBIF backbone matched to this occurrence. |
|
String |
Yes |
π The accepted scientific name (including authorship) for the taxon from the GBIF backbone matched to this occurrence. |
|
String |
Yes |
||
String |
Yes |
π The scientific name that is based on the type specimen. This is not yet a Darwin Core term, see the proposal to add it. |
|
String |
Yes |
π The technical protocol by which this occurrence was retrieved from the publisher’s systems. |
|
ISO 8601 Date |
Yes |
π The time this occurrence was last processed by GBIF’s interpretation system βPipelinesβ. This is the time the record was last changed in GBIF, not the time the record was last changed by the publisher. Data is also reprocessed when we changed the taxonomic backbone, geographic data sources or other interpretation procedures. An earlier interpretation system distinguished between βparsingβ and βinterpretationβ, but in the current system there is only one process β the two dates will always be the same. |
|
ISO 8601 Date |
Yes |
π The time this occurrence was last retrieved from the publisher’s systems. |
|
String |
Yes |
π Boolean indicating if the publishing country is different to the location country. |
|
String |
Yes |
π The relative measurement of the quantity of the organism (i.e. without absolute units). |
|
String array, delimited with |
Yes |
||
String |
Yes |
π The identifier for the top-level division from the GADM database. This is usually a three-letter code from ISO 3166. |
|
String |
Yes |
π The English name for the top-level division from the GADM database. |
|
String |
Yes |
π The identifier for the first-level division from the GADM database. |
|
String |
Yes |
π The English name for the first-level division from the GADM database. |
|
String |
Yes |
π The identifier for the second-level division from the GADM database. |
|
String |
Yes |
π The English name for the second-level division from the GADM database. |
|
String |
Yes |
π The identifier for the third-level division from the GADM database. |
|
String |
Yes |
π The English name for the third-level division from the GADM database. |
|
String |
Yes |
π The IUCN Red List Category of the taxon of this occurrence. See the GBIF vocabulary for the values and their definitions, and the IUCN Red List of Threatened Species dataset in GBIF for the version of the Red List GBIF’s interpretation procedures are using. |
|
String structure |
Yes |
Multimedia term definitions (multimedia.txt
)
Column name | Data type | Nullable | Definition |
---|---|---|---|
String |
No |
π Unique GBIF key for the occurrence. We aim to keep these keys stable, but this is not possible in every case. |
|
String |
Yes |
||
String |
Yes |
π The format the image is exposed in. It is recommended to use a IANA registered media type, but known file suffices are permissible too. See http://www.iana.org/assignments/media-types/media-types.xhtml |
|
String |
Yes |
π The public URL that identifies and locates the media file directly, not the html page it might be shown on. It is highly recommended that a URL to a media file of good resolution is provided or at least dc:reference in cases no public URI exists. |
|
String |
Yes |
π A related resource that is referenced, cited, or otherwise pointed to by the described resource. |
|
String |
Yes |
π The media items title. Strongly recommended as in many cases this will be used as the hyperlink text, and should be used accrodingly. |
|
String |
Yes |
||
String |
Yes |
π If the media item was derived or taken from another source this is the reference to that resource. For example a book from which an image was scanned or the original provider of a photo/graphic, such as photography agencies. |
|
String |
Yes |
π A class or description for whom the image is intended or useful |
|
String |
Yes |
||
String |
Yes |
π The person that took the image, recorded the video or sound |
|
String |
Yes |
π Any contributor in addition to the creator that helped in recording the media item |
|
String |
Yes |
||
String |
Yes |
π A legal document giving official permission to do something with the occurrence. |
|
String |
Yes |
π¦ A person or organization owning or managing rights over the resource. |
Verbatim term definitions (verbatim.txt
)
Data in this table is not modified by GBIF interpretation processes, except for conversion to Unicode and possible changes to whitespace (spaces, tabs, newlines etc).
Species list downloads β Term definitions
Column name | Data type | Nullable | Definition |
---|---|---|---|
Integer |
No |
π A taxon key from the GBIF backbone for the most specific (lowest rank) taxon for this occurrence. This could be a synonym, see |
|
String |
Yes |
π The scientific name (including authorship) for the taxon from the GBIF backbone matched to this occurrence. This could be a synonym, see also |
|
Integer |
Yes |
π A taxon key from the GBIF backbone for the accepted taxon of this occurrence. |
|
String |
Yes |
π The accepted scientific name (including authorship) for the taxon from the GBIF backbone matched to this occurrence. |
|
String |
Yes |
||
String |
Yes |
π The taxonomic rank of the most specific name in the scientificName. |
|
String |
Yes |
π The status of the use of the scientificName as a label for a taxon. |
|
String |
Yes |
π The kingdom name (excluding authorship) for the kingdom from the GBIF backbone matched to this occurrence. |
|
Integer |
Yes |
π A taxon key from the GBIF backbone for the kingdom of thisoccurrence. |
|
String |
Yes |
π The phylum name (excluding authorship) for the phylum from the GBIF backbone matched to this occurrence. |
|
Integer |
Yes |
π A taxon key from the GBIF backbone for the phylum of thisoccurrence. |
|
String |
Yes |
π The class name (excluding authorship) for the class from the GBIF backbone matched to this occurrence. |
|
Integer |
Yes |
π A taxon key from the GBIF backbone for the class of thisoccurrence. |
|
String |
Yes |
π The order name (excluding authorship) for the order from the GBIF backbone matched to this occurrence. |
|
Integer |
Yes |
π A taxon key from the GBIF backbone for the order of thisoccurrence. |
|
String |
Yes |
π The family name (excluding authorship) for the family from the GBIF backbone matched to this occurrence. |
|
Integer |
Yes |
π A taxon key from the GBIF backbone for the family of thisoccurrence. |
|
String |
Yes |
π The genus name (excluding authorship) for the genus from the GBIF backbone matched to this occurrence. |
|
Integer |
Yes |
π A taxon key from the GBIF backbone for the genus of thisoccurrence. |
|
String |
Yes |
π The species name (excluding authorship) for the species from the GBIF backbone matched to this occurrence. |
|
Integer |
Yes |
π A taxon key from the GBIF backbone for the species of thisoccurrence. |
|
String |
Yes |
π The IUCN Red List Category of the taxon of this occurrence. See the GBIF vocabulary for the values and their definitions, and the IUCN Red List of Threatened Species dataset in GBIF for the version of the Red List GBIF’s interpretation procedures are using. |