Data warehouse fields
This document describes some key fields of the Data Warehouse data model and fields that may require further clarification. A full model for responses, containing all possible fields, can be found from api.laji.fi OpenApi (swagger) specification.
Document
| documentId | URI | Id of the Document |
| collectionId | URI | Id of the Collection |
| sourceId | URI | Id of the Source |
| secureLevel | Enum | NONE, 1KM-100KM, HIGHEST |
| secureReasons | Array(Enum) | Reason the document was secured |
| partial | boolean | Is the document in it's original form or has something been detached from it/has this document been detached from some other document, because of data concealment. |
| editorUserIds | Array(String) | Owners of the document |
| linkings.editors | Array(Person) | FinBIF Persons that the user id's have been attached to |
| keywords | Array(String) | Alternative ids, project names, etc. (Exists also |
| createdDate | Date | Created at the original source (not always known) |
| modifiedDate | Date | Last modified at the original source (not always known) |
| loadDate | Date | Last loaded to FinBIF DW |
| firstLoadDate | Date | First time loaded to FinBIF DW |
| facts | Array(Fact) | Document facts |
| media | Array(Media) | Document media (specimen labels, etc.) |
| quality.issue | Quality | General issues about the document, marked by DW or the original source. |
| annotations | Array(Annotation) | Annotations (quality control markings) made about the document. |
Gathering
| gatheringId | URI | Gathering id |
| team | Array(String) | Names of the observers |
| observerUserIds | Array(String) | Observer user ids |
| linkings.observers | Array(Person) | FinBIF Persons that the user id's have been attached to |
| eventDate.begin | Date | Begin date |
| eventDate.end | Date | End date |
| hourBegin/End | Int | Time of day, hour: begin/end. Stored as reported. Assume the observer has used the local timezone at the time of making the observation, |
| minutesBegin/end | Int | Time of day, minutes: begin/end. |
| displayDateTime | String | Date and times formatted to sortable and human readable format. |
| higherGeography | String | Verbatim from the source (continent etc.) |
| country | String | Verbatim country from the source |
| biogeographicalProvince | String | Verbatim biogeographical province from the source |
| province | String | Verbatim administrative province from the source |
| municipality | String | Verbatim municipality from the source |
| locality | String | Verbatim locality from the source (village, river, etc.) |
| media | Array(Media) | Gathering media (habitat images, trip photos, etc.) |
| quality.issue | Quality | General issues about the gathering event (not time, geo). Marked by DW or the original source. |
| quality.timeIssue | Quality | Issuses about dates, times. Marked by DW or the original source. |
| quality.locationIssue | Quality | Issues about coordinates, other geo-information. Marked by DW or the original source. |
| facts | Array(Fact) | Gathering facts |
| interpretations | Interpretations | Geo related interpretations (see bellow) |
| conversions | Conversions | Coordinate and time conversions (see bellow) |
Gathering.interpretations
| coordinateAccuracy | Int | DW interprets accuracy of coordinates in meters based on geography, coordinates or coordintesVerbatim. The figure is indicative. |
| sourceOfCoordinates | Enum | How the DW resolved coordinates (that can be found from conversions-object) |
| country | URI | Interpreted country id (/area endpoint). |
| sourceOfCountry | Enum | How the country was interpreted |
| biogeographicalProvince | URI | Interpreted biogeographical province id (/area). This is a single province (for example using coordinate center point). |
| sourceOfBiogeographicalProvince | Enum | How the bio-province was interpreted |
| biogeographicalProvinces | Array(URI) | Ids all all the bio-provinces that touch the coordinate area |
| finnishMunicipality | URI | Interpreted Finnish municipality id (/area). This is a single municipality (for example using coordinate center point). |
| sourceOfFinnishMunicipality | Enum | How the municipality was interpreted |
| finnishMunicipalities | Array(URI) | Ids all all the municipalities that touch the coordinate area |
| countryDisplayname | String | Name of the interpreted country in Finnish |
| biogeographicalProvinceDisplayname | String | Name of the interpreted bio-province in Finnish + abbreviation in Latin |
| municipalityDisplayname | String | Name of the interpreted municipality in Finnish |
In case the coordinates have been concealed, coordinate accuracy indicates the accuracy of the coarsed coordinates, not the accuracy of the original coordinates.
Gathering.conversions
| century | Int | If time span within one century |
| decade | Int | If time span within one decade |
| year | Int | If time span within one year |
| month | Int | If time span within one month) |
| day | Int | If the time span is a single day |
| dayOfYearBegin/End | Int | |
| seasonBegin/End | Int | MMDD, for example 0323 |
| euref.latMin,latMax,lonMin,lonMax | Int | Bounding box, ETRS-TM35FIN coordinates |
| wgs84.latMin,latMax,lonMin,lonMax | Double | Bounding box, WGS84 coordinates |
| ykj.latMin,latMax,lonMin,lonMax | Int | Bounding box, uniform grid (YKJ, yhtenäiskoordinaatit) |
| wgs84CenterPoint.lat,lon | Double | Bounding box center point, WGS84 |
| wgs84Grid1.lat,lon | Double | Center point in 1.0 degree accuracy, WGS84 |
| wgs84Grid05.lat,lon | Double | 0.5 degree accuracy |
| wgs84Grid01.lat,lon | Double | 0.1 degree accuracy |
| wgs84Grid005.lat,lon | Double | 0.05 degree accuracy |
| ykj100km.lat,lon | Int | 100km x 100km; left lower corner of the grid; only if area entirely inside the gird |
| ykj50km.lat,lon | Int | 50km x 50km; only if area entirely inside the gird |
| ykj10km.lat,lon | Int | 10km x 10km; only if area entirely inside the gird |
| ykj1km.lat,lon | Int | 1km x 1km; only if area entirely inside the gird |
| ykj100kmCenter.lat,lon | Int | 100km x 100km; left lower corner of the grid; based on coordinate center point even if area not entirely inside the grid |
| ykj50kmCenter.lat,lon | Int | 50km x 50km; even if area not entirely inside the grid |
| ykj10kmCenter.lat,lon | Int | 10km x 10km; even if area not entirely inside the grid |
| ykj1kmCenter.lat,lon | Int | 1km x 1km; even if area not entirely inside the grid |
| eurefWKT | String(WKT) | Original geography or bounding box/point in WKT format, ETRS-TM35FIN coordinates |
| ykjWKT | String(WKT) | Original geography or bounding box/point in WKT format, YKJ, yhtenäiskoordinaatit |
| wgs84WKT | String(WKT) | Original geography or bounding box/point in WKT format, WGS84 coordinates |
| boundingBoxAreaInSquareMeters | Int | Area of the bounding box in square meters |
Unit
| unitId | URI | Unit id |
| taxonVerbatim | String | Species/taxon scientific, vernacular or other name as reported by the original data source |
| author | String | Scientific name author, reported by the original data source. Not used at the moment for anything by the DW. |
| reportedTaxonId | URI | FinBIF taxon id, reported by the original source. Either taxonVerbatim or reportedTaxonId must be given. If both are given, reportedTaxonId is used (for taxon linking). |
| interpretations.annotatedTaxonId | URI | FinBIF taxon id, resolved by the DW based on annotation data (quality markings of taxon experts, users). |
| linkings.taxon | Taxon | FinBIF taxon, if the occurrence could be linked to taxonomy (based on taxonVerbatim or reportedTaxonId, annotations). Limited taxon model; use /taxa endpoint to get full data model of the taxon if more fields needed. |
| linkings.originalTaxon | Taxon | Same as above, but will ignore annotation data. |
| abundanceString | String | As reported by the original source |
| interpretations.individualCount | Int | Individual count interpreted from the abundanceString (for example "n. 12-15" = 12, "1m2f" = 3) |
| recordBasis | Enum | (specimen, song, footprint, etc) |
| superRecordBasis | Enum | Derived from recordBasis: observation, specimen machine obsevation |
| lifeStage | Enum | Life stage (combines plant, animal life stages) |
| sex | Enum | |
| wild | Boolean | Did the occurrence exist in the wild? Default is null = not known. Based on the original data source. |
| det | String | Name of person who made the identification (as reported by the original data source). |
| media | Array(Media) | Media about the occurrence |
| reportedTaxonConfidence | Enum | As reported by the original data source / original observer |
| quality.issue | Quality | General issues (non identification, reliability related) with the occurrence, marked by the original data source or the DW. |
| quality.taxon | TaxonQuality | Interpreted quality of the occurrence, by the DW |
| annotations | Aray(Annotation) | Annotations (quality control markings) made about the occurrence. |
| facts | Array(Fact) | Unit facts |
Coordinates
| latMin,latMax,lonMin,lonMax | Double | Bounding box |
| type | Enum | Coordinate system |
| accuracyInMeters | Int | As reported by the original source. (For example source reports a WGS84 point and accuracy 1000m) |
Media
| mediaType | Enum | IMAGE|AUDIO|VIDEO |
| author | String | |
| copyrightOwner | String | |
| caption | String | |
| licenseId | URI | Intellectual rights license (metadata/range/ MY.intellectualRightsEnum endpoint) |
| licenseAbbreviation | String | Other license abbreviation if not found among FinBIF metadata |
| fullURL | URI | URI of the full image |
| thumbnailURL | URI | URI of a thumbnail (not required but highly recommended) |
| squareThumbnailURL | URI | URI of a square thumbnail (not required) |
Fact
| fact | String | Name / id of the variable, as reported by the original source |
| value | String | Value, as reported by the original source |
| integerValue | Int | Derived from value |
| decimalValue | Double | Derived from value |
Person
| id | URI | FinBIF person id |
| userId | String | User id in original data source |
| fullName | String | Person full name in FinBIF |
Quality
Occurrences/gatherings/documents that have issues are not shown by default.
| issue | Enum | Type of issue |
| source | Enum | Source of issue (original source or DW) |
| message | String | Notes |
TaxonReliability
| reliability | Enum | Reliability of the occurrence as interpreted by the DW |
| source | Enum | Basis of the reliability |
| message | String | Notes (usually none) |