Contents
Species Analysis Terms
- SDM
- Species Distribution Modeling (SDM) is also known by
several other names, including environmental niche modeling, ecological niche
modeling, and habitat modeling. SDM refers to the process of creating
mathematical formulas (models) to predict the geographic distribution of
species based on where they have been found and the environmental conditions
in those locations.
- Algorithm
- An algorithm is a procedure or formula for solving a problem. There are
multiple algorithms for computing Species Distribution Models (SDM) which
define the relationship between a set of points and the environmental values
at those points. BiotaPhy and Lifemapper provide 12 algorithms
- Presence-Absence Matrix (PAM)
- A binary matrix containing species distributions of 0/1 indicating presence or
non-presence in each grid cell of a region. The matrix may be thought of as a
three-dimensional cube, of binary maps, with one layer per species. The 3-dimensional
matrix is flattened into 2 dimensions, with rows representing sites with an x,y
coordinate for the center of a gridcell on a map, and columns representing
species.
- Phylogenetic Tree
- A data structure containing species names or identifiers for analyzing
evolutionary patterns. BiotaPhy uses phylogenetic trees matching species
data in a gridset to correlate evolutionary patterns with species
distributions and landscape features. API documentation
is at Tree.
- Biogeographic Hypotheses
- Spatial layers for testing the influence of geographic elements, such as
geology, drainage basins, etc, on the biodiversity of a landscape. These
can be in the form of raster or vector files. API documentation
is at BioGeo.
- MCPA
- Meta-Community Phylogenetic Analysis, briefly explained at
MCPA. MCPA is defined by Pedro Peres-Neto in
https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1461-0248.2010.01523.x
- Ultrametric Tree
- A tree whose total branch length from the root to each tip in the tree is the same.
- Helmert Contrast
- An encoding of data such that sites with values of 1 indicate that they fall on one
side of a hypothesis, values of -1 indicate that they belong to the opposite side of
the same hypothesis, and values of zero indicate that the cell does not fall on
either side of the hypothesis.
BiotaPhy-specific Data and Parameter Terms
- Gridset
- The organizing data structure for a single, or group of analyses. A
package may be downloaded for visualization within the BiotaPhy web
client, or further analyses in other software.
- LM Library
- Existing public data in a BiotaPhy installation, including input data,
such as species points, environmental data layers, and computed SDMs.
- Occurrence Layer
- Point data representing specimens collected for a single species or taxa. Data
contains a location, x and y, in some known geographic spatial reference system.
Public data in BiotaPhy installations are in the ‘Geographic’ spatial
reference system, latitude and longitude in decimal degrees. API documentation
is at Occurrence Layer
- Environmental Layer
- Raster data representing environmental values for cells in a map. Data
may be numeric or categorical, with only one value per cell.
Public data in BiotaPhy installations are in the ‘Geographic’ spatial
reference system, latitude and longitude in decimal degrees. API documentation
is at Environmental Layer
- Scenario
- Scenarios consist of a set of environmental layers (i.e. elevation,
precipitation, temperature, soil, etc). For Species Distribution Modeling,
researchers often choose a set of ‘present day’ layers (“modeling scenario”) as an input
along with ‘present day’ species points. The environmental data may be
global, or regional. The resulting models can be
projected onto the same environmental dataset and/or one predicted for a different time
period, or one for another region (“projection scenarios”).
An example of predicted environmental data available in
the BiotaPhy archive is climate data computed for the
International Panel on Climate Change (IPCC) for its Fifth Assessment
Report (AR5, 2013). API documentation is at
Scenario.
- Scenario Package
- A Scenario Package consists of a set of scenarios with the same type of
layers in each. Scenarios within a package can be used together, so species
data may be modeled with one scenario and projected onto another scenario.
Generally, “current day” species data is modeled with current day
environmental data, then may be projected onto environmental data predicted
for the past or future, or onto a different region. API documentation is at
Scenario Package.
- SDM Projection
- Computed SDM models may be applied, or projected back onto the same, or
matching Scenarios. A map created from the projection of this model onto
a Scenario is called an SDM Projection, and is a file of geospatial data in
raster format. Different algorithms produce projections with different values.
The Maxent algorithm produces projections with values denoting the predicted
presence as a value between 0 and 1. Other algorithms produce raster files
with only the values 1 (predicted present) or 0 (not predicted present). API
documentation is at SDM Projection.
- Shapegrid
- A grid encompassing the area of interest in a Gridset for multi-species
analyses. In a gridset, different data layers are intersected with a Shapegrid
to produce Matrices for analyses. Species layers are intersected to
create a PAM, environmental layers are intersected to create an Environmental
Matrix (GRIM) and Biogeographic Hypotheses are intersected to create a BioGeo
Matrix. Intersection parameters define how values are computed for gridcells
from the values in data layers. API documentation is at
Shapegrid
- Global PAM
- A Presence-Absence Matrix (PAM) as described above, containing intersected
data for all species in a gridset. A Global PAM usually refers to a very
large PAM meant to be subsetted for further analysis. API
documentation is at Global PAM.
- Environmental Matrix (GRIM)
- A matrix of values indicating the mean value of each of multiple environmental variables
in a regular grid. The structure is the same of the PAM, but values are not binary.
GRIM is an acronym for Geographic Reference Information Matrix.
- Biogeographic Hypotheses and BioGeo Matrix
- Spatial layers for testing the influence of geographic elements, such as
geology, drainage basins, etc, on the biodiversity of a landscape. These
can be in the form of raster or vector files. Biogeographic Hypotheses may
be intersected with the Shapegrid to produce a BioGeo Matrix, used in MCPA
computations. API documentation is at BioGeo.