| Title: | Resolve Mammal Names with the Mammal Diversity Database |
|---|---|
| Description: | Download, inspect, reconcile, and summarize mammal taxonomic names with the Mammal Diversity Database (MDD). Supports accepted names, synonyms, original combinations, distribution summaries, and mapped outputs derived from packaged MDD releases. Designed for reproducible mammal name resolution workflows in 'R'. |
| Authors: | Paul E. Santos Andrade [aut, cre, cph] |
| Maintainer: | Paul E. Santos Andrade <[email protected]> |
| License: | MIT + file LICENSE |
| Version: | 0.0.0.1 |
| Built: | 2026-05-17 06:23:36 UTC |
| Source: | https://github.com/PaulESantos/rmdd |
Convert mdd_taxon_info to list
## S3 method for class 'mdd_taxon_info' as.list(x, ...)## S3 method for class 'mdd_taxon_info' as.list(x, ...)
x |
An object of class |
... |
Additional arguments. |
A list representation of the mdd_taxon_info object.
This function is experimental and may change in future releases.
Construct a species-level reconciliation backbone that combines accepted MDD checklist names with synonym names and links every matched name to its accepted species context.
build_mdd_match_backbone(checklist = NULL, synonyms = NULL)build_mdd_match_backbone(checklist = NULL, synonyms = NULL)
checklist |
A data frame like |
synonyms |
Optional data frame like |
A tibble used internally by mdd_matching().
bb <- build_mdd_match_backbone( checklist = dplyr::slice(mdd_checklist, 1:10), synonyms = dplyr::slice(mdd_synonyms, 1:20) ) bbbb <- build_mdd_match_backbone( checklist = dplyr::slice(mdd_checklist, 1:10), synonyms = dplyr::slice(mdd_synonyms, 1:20) ) bb
This function is experimental and may change in future releases.
Parse mammal scientific names into genus, optional subgenus, species, optional subspecies, and trailing author text. The parser is intentionally conservative and targets species-level MDD reconciliation workflows.
classify_mammal_names(splist)classify_mammal_names(splist)
splist |
Character vector of scientific names. |
A tibble with one row per input name and standardized columns for species-level reconciliation.
classify_mammal_names(c( "Puma concolor", "Capromys (Pygmaeocapromys) angelcabrerai" ))classify_mammal_names(c( "Puma concolor", "Capromys (Pygmaeocapromys) angelcabrerai" ))
This function is stable and its interface is expected to remain compatible.
mdd_cache_dir(path = NULL)mdd_cache_dir(path = NULL)
path |
Optional path. If |
A scalar character path.
mdd_cache_dir(tempdir())mdd_cache_dir(tempdir())
Current species-level checklist from the Mammal Diversity Database (MDD).
mdd_checklistmdd_checklist
A tibble with 6,871 rows and 52 variables in the MDD v2.4 release,
with source column names normalized to snake_case during data import.
Variables and their original camelCase equivalents:
(sciName) Unique genus–epithet key joined by an
underscore; derived programmatically from genus and
specific_epithet.
(id) Unique integer identifier used for indexing and
permalinking. Initial batch (14 Sep 2020) was numbered from
1,000,001 upward when sorted by phylosort; subsequent additions
start at 1,006,485.
(phylosort) Numeric sort key ordering the 27
extant mammal orders according to the phylogenetic hierarchy in
Figure 1 of the Illustrated Checklist of the Mammals of the
World (2020).
(mainCommonName) Primary vernacular name
following the Handbook of the Mammals of the World style
conventions: all words capitalised except pre-hyphen elements;
"and" constructions use hyphens (e.g. Black-and-white Ruffed
Lemur); directional modifiers are not hyphenated (e.g.
Southwestern Myotis).
(otherCommonNames) Pipe-separated (|)
list of additional vernacular names documented in the literature,
following the same formatting rules as main_common_name;
primarily English, but widely used names from other languages are
occasionally included.
(subclass) Taxonomic subclass; NA when not
applicable.
(infraclass) Taxonomic infraclass; NA when
not applicable.
(magnorder) Taxonomic magnorder; NA when not
applicable.
(superorder) Taxonomic superorder; NA when
not applicable.
(order) Taxonomic order; present for all taxa.
(suborder) Taxonomic suborder; NA when not
applicable; "incertae sedis" for taxa unassigned within an
order that uses suborders.
(infraorder) Taxonomic infraorder; NA when
not applicable; "incertae sedis" when applicable but
unassigned.
(parvorder) Taxonomic parvorder; NA when not
applicable; "incertae sedis" when applicable but unassigned.
(superfamily) Taxonomic superfamily; NA when
not applicable; "incertae sedis" when applicable but
unassigned.
(family) Taxonomic family; present for all taxa.
(subfamily) Taxonomic subfamily; NA when not
applicable; "incertae sedis" when applicable but unassigned.
(tribe) Taxonomic tribe; NA when not applicable;
"incertae sedis" when applicable but unassigned.
(subtribe) Taxonomic subtribe; NA when not
applicable; "incertae sedis" when applicable but unassigned.
(genus) Taxonomic genus; present for all taxa.
(subgenus) Taxonomic subgenus; NA when not
applicable; "incertae sedis" when applicable but unassigned.
(specificEpithet) Taxonomic species
epithet.
(authoritySpeciesAuthor) Author
surname(s) of the original species description, sourced from
Species_Syn_Current; all authors on the author line are
included, with an Oxford comma before the last name when three or
more authors are present; an "in" statement is added when the
work appears in a volume with different editors; shared surnames
are disambiguated by initials or full middle names; Chinese,
Korean, and Indochinese names are written out in full with surname
first and hyphens removed.
(authoritySpeciesYear) Year of the
original species description, sourced from
Species_Syn_Current.
(authorityParentheses) Parenthesis
flag: 0 = no parentheses; 1 = authority in parentheses
(indicating the species was originally described under a
different genus).
(originalNameCombination) Name
combination exactly as it appears in the original description,
sourced from Species_Syn_Current.
(authoritySpeciesCitation) Full
or abbreviated literature citation for the authority publication;
APA format when verified by PDF or physical copy, abbreviated
otherwise.
(authoritySpeciesLink) URL to the
authority publication or abstract page; Biodiversity Heritage
Library page-level links are preferred when available; DOIs are
used for recent publications.
(typeVoucher) Museum catalogue number(s) of
the type series (holotype, syntypes, lectotype, or neotype);
blank when type material has not been verified; multiple syntypes
are listed when applicable.
(typeKind) Category of type specimen listed in
type_voucher: one of "holotype", "syntypes",
"lectotype", "neotype", or "nonexistent" (the last value
is used when the MDD team has confirmed no type material exists);
blank when existence of type material has not been verified.
(typeVoucherURIs) Pipe-separated links
to type material records in external museum collection
databases.
(typeLocality) Geographic locality where the
holotype was collected or observed; edited place names follow CMW
2020 and are updated to current taxonomy; format may not match
Species_Syn_Current as standardisation is ongoing.
(typeLocalityLatitude) Latitude of
the type locality in decimal degrees; sourced from the original
description or via georeferencing (web search or GeoLocate).
(typeLocalityLongitude) Longitude
of the type locality in decimal degrees; sourced as for
type_locality_latitude.
(nominalNames) Pipe-separated list of all
available and unavailable specific epithets subsumed under the
current species concept, each with authority and year; names
originally described in a different genus are shown in
parentheses; reasons for unavailability are noted in brackets;
gender changes are also noted.
(taxonomyNotes) Semicolon-separated notes
by MDD staff documenting taxonomic changes.
(taxonomyNotesCitation) Pipe-
separated APA citations supporting the changes described in
taxonomy_notes.
(distributionNotes) Detailed
distributional narrative including notes on recently introduced
populations; uses abbreviations: Mt/Mts, I/Is, N, S, E, W, C,
and combinations (e.g. NW). Not currently curated — reserved
for future use.
(distributionNotesCitation)
Pipe-separated citations supporting distribution_notes.
Not currently curated — reserved for future use.
(subregionDistribution) Pipe-
separated (|) list of countries with native or reintroduced
extant populations; subnational regions (currently US states only)
are appended in parentheses and comma-separated; uncertain or
possibly extinct occurrences are marked with ?.
(countryDistribution) Pipe-separated
list of countries where the species has a native or ancient
(pre-1500 CE) introduced distribution; recent introductions are
excluded; uncertain presences marked with ?; domesticated
species are listed as "Domesticated"; extinct species reflect
their post-1500 CE range; marine species list coastal or
riverine countries.
(continentDistribution) Pipe-
separated list of continents following the same rules as
country_distribution; recognised values: Africa, Antarctica,
Asia, Europe, North America, Oceania, South America,
Domesticated; continental boundaries follow MDD-specific
definitions (e.g. Asia–Europe split at Ural/Caucasus Mts and
Ural River; Oceania east of Weber's Line).
(biogeographicRealm) Pipe-separated
list of biogeographic realms where the species occurs, following
the same inclusion rules as country_distribution; realm
boundaries follow the WWF schema
(https://en.wikipedia.org/wiki/Biogeographic_realm).
(iucnStatus) IUCN Red List status matched to
MDD species and updated to current taxonomy (IUCN 2025-2
assessment); "NE" = not yet evaluated; standard IUCN acronyms
otherwise; 19 domestic species are also not evaluated.
(extinct) Extinction flag: 0 = extant;
1 = extinct after 1500 CE (following the IUCN criterion).
(domestic) Domestication flag: 0 = wild;
1 = domesticated (19 species including Homo sapiens);
follows the nomenclatural guidelines of Gentry, Clutton-Brock &
Groves (2004, J. Archaeol. Sci. 31(5): 645–651).
(flagged) Taxonomic quality flag: 0 = valid;
1 = flagged as taxonomically questionable or actively debated
in the literature.
(CMW_sciName) Genus–epithet key from the
Illustrated Checklist of the Mammals of the World (CMW),
formatted identically to sci_name.
(diffSinceCMW) CMW comparison flag:
0 = species present in CMW 2020 taxonomy;
1 = species new since CMW 2020.
(MSW3_matchtype) Method by which the taxon
was linked to MSW3: one of "matched", "unmatched", or
"manual".
(MSW3_sciName) Scientific name as matched
to the MSW3 taxonomy.
(diffSinceMSW3) MSW3 comparison flag:
0 = species present in MSW3 (~2004 cutoff);
1 = species new since MSW3.
The checklist is the primary tabular output of the MDD and covers all
extant and recently extinct (post-1500 CE) mammal species recognised
by the database curators. Column names have been normalised from the
original camelCase (sciName, mainCommonName, etc.) to snake_case
during data import; the original names are shown in parentheses in
each \item above for cross-reference with the upstream spreadsheet.
Distribution fields operate on three nested spatial scales:
subregion_distribution (subnational units, currently US states),
country_distribution, and continent_distribution. Multiple values
within each field are pipe-separated (|). Only native ranges and
ancient introductions (before 1500 CE) are included; ? marks
uncertain or possibly extirpated occurrences.
The distribution_notes and distribution_notes_citation columns
are present in the data but are not currently curated and are
reserved for future use.
Cross-release comparison is provided by three column pairs:
cmw_sci_name / diff_since_cmw (vs. CMW 2020) and
msw3_sci_name / msw3_matchtype / diff_since_msw3
(vs. MSW3, ~2004 cutoff).
Mammal Diversity Database release archive
(https://www.mammaldiversity.org). Field definitions are derived
from the META_v2.4.csv file and the column-level annotations in the
MDD_Current sheet of the official MDD spreadsheet.
This function is stable and its interface is expected to remain compatible.
mdd_checklist_records(checklist = NULL)mdd_checklist_records(checklist = NULL)
checklist |
Optional checklist data frame. Defaults to |
A tibble.
mdd_checklist_records()mdd_checklist_records()
This function is experimental and may change in future releases.
Resolve a mammal name with mdd_matching() and map its
country_distribution values against an rnaturalearth countries layer.
The function preserves the package's exact vs partial input validation by
reporting whether the taxon input was matched directly or through a fuzzy
stage before plotting the accepted taxon distribution. Spatial polygons come
from Natural Earth through rnaturalearth::ne_countries().
mdd_distribution_map( name, checklist = NULL, synonyms = NULL, target_df = NULL, atlas = NULL, max_dist = 1, method = "osa", zoom = c("world", "auto", "manual"), xlim = NULL, ylim = NULL, quiet = FALSE, title = NULL, base_fill = "#E8E4DB", dist_fill = "#111111", border_color = "#111111", country_color = "#8B887F", ocean_fill = "#F7F4EE", plot_fill = "#F7F4EE", graticule_color = "#BBB5A8", graticule_linewidth = 0.18, graticule_alpha = 0.55, country_linewidth = 0.22, dist_linewidth = 0.34, title_color = "#111111", title_size = 16, title_face = "bold" )mdd_distribution_map( name, checklist = NULL, synonyms = NULL, target_df = NULL, atlas = NULL, max_dist = 1, method = "osa", zoom = c("world", "auto", "manual"), xlim = NULL, ylim = NULL, quiet = FALSE, title = NULL, base_fill = "#E8E4DB", dist_fill = "#111111", border_color = "#111111", country_color = "#8B887F", ocean_fill = "#F7F4EE", plot_fill = "#F7F4EE", graticule_color = "#BBB5A8", graticule_linewidth = 0.18, graticule_alpha = 0.55, country_linewidth = 0.22, dist_linewidth = 0.34, title_color = "#111111", title_size = 16, title_face = "bold" )
name |
A single scientific name. |
checklist |
Optional checklist data frame. Defaults to |
synonyms |
Optional synonym data frame. Defaults to |
target_df |
Optional reconciliation backbone from
|
atlas |
Optional |
max_dist |
Maximum string distance used if the input name needs fuzzy reconciliation. |
method |
Distance method passed to |
zoom |
Zoom mode. Use |
xlim |
Optional longitude limits used when |
ylim |
Optional latitude limits used when |
quiet |
Logical. If |
title |
Optional plot title. Defaults to the accepted taxon name. |
base_fill |
Fill color for non-selected countries. |
dist_fill |
Fill color for mapped distribution units. |
border_color |
Border color for mapped country outlines. |
country_color |
Border color for non-selected country outlines. |
ocean_fill |
Background color for the map panel. |
plot_fill |
Background color for the full plot area. |
graticule_color |
Color for graticule lines. |
graticule_linewidth |
Line width for graticule lines. |
graticule_alpha |
Alpha level for graticule lines. |
country_linewidth |
Line width for non-selected country outlines. |
dist_linewidth |
Line width for mapped country outlines. |
title_color |
Color for the plot title. |
title_size |
Size of the plot title. |
title_face |
Font face for the plot title. |
A ggplot2 map object. Additional metadata are attached in the
"mdd_distribution_info" attribute.
if (interactive()) { mdd_distribution_map("Lama vicugna", quiet = TRUE) }if (interactive()) { mdd_distribution_map("Lama vicugna", quiet = TRUE) }
This function is experimental and may change in future releases.
Wrapper around mdd_distribution_summary_raw() that optionally excludes
domesticated species and species considered widespread at the requested
geographic level.
mdd_distribution_summary( level = c("country", "continent", "subregion"), checklist = NULL, exclude_domesticated = FALSE, exclude_widespread = FALSE, widespread_threshold = NULL )mdd_distribution_summary( level = c("country", "continent", "subregion"), checklist = NULL, exclude_domesticated = FALSE, exclude_widespread = FALSE, widespread_threshold = NULL )
level |
Geographic level to summarize. Use |
checklist |
Optional checklist data frame. Defaults to
|
exclude_domesticated |
Logical. If |
exclude_widespread |
Logical. If |
widespread_threshold |
Optional threshold used to define widespread
species. If |
A tibble with one row per geographic unit and the columns region,
orders, families, genera, living_species, extinct_species, and
total_species.
mdd_distribution_summary( level = "country", checklist = dplyr::slice(mdd_checklist, 1:50) )mdd_distribution_summary( level = "country", checklist = dplyr::slice(mdd_checklist, 1:50) )
This function is experimental and may change in future releases.
Expand MDD distribution fields and compute diversity summaries by geographic unit. The summary reports counts of distinct orders, families, genera, living species, and extinct species for each unit listed in the selected MDD distribution column.
mdd_distribution_summary_raw( level = c("country", "continent", "subregion"), checklist = NULL )mdd_distribution_summary_raw( level = c("country", "continent", "subregion"), checklist = NULL )
level |
Geographic level to summarize. Use |
checklist |
Optional checklist data frame. Defaults to
|
A tibble with one row per geographic unit and the columns region,
orders, families, genera, living_species, extinct_species, and
total_species.
mdd_distribution_summary_raw( level = "country", checklist = dplyr::slice(mdd_checklist, 1:50) )mdd_distribution_summary_raw( level = "country", checklist = dplyr::slice(mdd_checklist, 1:50) )
This function is stable and its interface is expected to remain compatible.
mdd_download(url, dest_dir = mdd_cache_dir(), dest_file = NULL, mode = "wb")mdd_download(url, dest_dir = mdd_cache_dir(), dest_file = NULL, mode = "wb")
url |
URL to the source file. |
dest_dir |
Destination directory for the downloaded file. |
dest_file |
Optional output filename. |
mode |
Transfer mode passed to |
The downloaded file path, invisibly.
if (interactive()) { mdd_download( url = "https://www.mammaldiversity.org/robots.txt", dest_dir = tempdir() ) }if (interactive()) { mdd_download( url = "https://www.mammaldiversity.org/robots.txt", dest_dir = tempdir() ) }
This function is stable and its interface is expected to remain compatible.
mdd_load(path)mdd_load(path)
path |
Path to a local CSV export from MDD. |
A tibble.
sample_path <- system.file("extdata", "mdd_sample.csv", package = "rmdd") mdd_load(sample_path)sample_path <- system.file("extdata", "mdd_sample.csv", package = "rmdd") mdd_load(sample_path)
This function is experimental and may change in future releases.
Run a staged species-level reconciliation pipeline against an MDD backbone. The workflow prioritizes exact evidence first, then progressively relaxes criteria through exact genus, fuzzy genus, exact species-within-genus, and fuzzy species-within-genus matching.
mdd_matching( x, target_df = NULL, prefilter_genus = TRUE, allow_duplicates = FALSE, max_dist = 1, method = "osa" )mdd_matching( x, target_df = NULL, prefilter_genus = TRUE, allow_duplicates = FALSE, max_dist = 1, method = "osa" )
x |
Character vector of names or a data frame with parsed name columns. |
target_df |
Optional backbone produced by |
prefilter_genus |
Logical. If |
allow_duplicates |
Logical. If |
max_dist |
Maximum string distance used in fuzzy stages. |
method |
Distance method passed to |
A tibble with row-level traceability, pathway flags, matched name context, accepted-name context, and fuzzy distance columns.
checklist <- tibble::tibble( id = c("1", "2"), sci_name = c("Puma_concolor", "Vicugna_vicugna"), genus = c("Puma", "Vicugna"), specific_epithet = c("concolor", "vicugna"), authority_species_author = c("Linnaeus", "Molina") ) synonyms <- tibble::tibble( mdd_syn_id = c("1001", "1002"), mdd_species_id = c("1", "2"), mdd_author = c("Linnaeus", "Molina"), mdd_original_combination = c("Felis concolor", "Auchenia vicugna") ) backbone <- build_mdd_match_backbone(checklist, synonyms) mdd_matching( c("Puma concolor", "Felis concolor", "Pumma concolor"), target_df = backbone )checklist <- tibble::tibble( id = c("1", "2"), sci_name = c("Puma_concolor", "Vicugna_vicugna"), genus = c("Puma", "Vicugna"), specific_epithet = c("concolor", "vicugna"), authority_species_author = c("Linnaeus", "Molina") ) synonyms <- tibble::tibble( mdd_syn_id = c("1001", "1002"), mdd_species_id = c("1", "2"), mdd_author = c("Linnaeus", "Molina"), mdd_original_combination = c("Felis concolor", "Auchenia vicugna") ) backbone <- build_mdd_match_backbone(checklist, synonyms) mdd_matching( c("Puma concolor", "Felis concolor", "Pumma concolor"), target_df = backbone )
This function is experimental and may change in future releases.
Create a reusable name index that combines accepted checklist names, checklist original combinations, and synonym names while preserving the accepted-species linkage for each row.
mdd_name_index(checklist = NULL, synonyms = NULL)mdd_name_index(checklist = NULL, synonyms = NULL)
checklist |
Optional checklist data frame. Defaults to |
synonyms |
Optional synonym data frame. Defaults to |
A tibble with one row per queryable accepted name or synonym.
idx <- mdd_name_index( checklist = dplyr::slice(mdd_checklist, 1:10), synonyms = dplyr::slice(mdd_synonyms, 1:20) ) idxidx <- mdd_name_index( checklist = dplyr::slice(mdd_checklist, 1:10), synonyms = dplyr::slice(mdd_synonyms, 1:20) ) idx
This function is stable and its interface is expected to remain compatible.
Return the recommended citation for the current MDD Zenodo release and, optionally, format a citation for a specific MDD taxon entry.
mdd_reference(taxon_id = NULL, taxon_name = NULL)mdd_reference(taxon_id = NULL, taxon_name = NULL)
taxon_id |
Optional MDD taxon identifier for a specific entry. |
taxon_name |
Optional scientific name to use in the entry citation.
If |
An object of class mdd_reference containing dataset-level and,
when requested, entry-level citation strings.
mdd_reference() mdd_reference(taxon_id = "1001892", taxon_name = "Dipodomys deserti")mdd_reference() mdd_reference(taxon_id = "1001892", taxon_name = "Dipodomys deserti")
This function is stable and its interface is expected to remain compatible.
mdd_release_records( checklist = NULL, synonyms = NULL, type_specimen_metadata = NULL )mdd_release_records( checklist = NULL, synonyms = NULL, type_specimen_metadata = NULL )
checklist |
Optional checklist data frame. |
synonyms |
Optional synonym data frame. |
type_specimen_metadata |
Optional type specimen metadata table. |
A named list of tibbles.
x <- mdd_release_records() names(x)x <- mdd_release_records() names(x)
This function is stable and its interface is expected to remain compatible.
mdd_synonym_records(synonyms = NULL)mdd_synonym_records(synonyms = NULL)
synonyms |
Optional synonym data frame. Defaults to |
A tibble.
mdd_synonym_records()mdd_synonym_records()
Synonymy and nomenclatural table from the Mammal Diversity Database (MDD), covering all names applicable to species- and subspecies-level mammal taxa within Class Mammalia. Designed to align with the online nomenclature database Hesperomys (https://hesperomys.com).
mdd_synonymsmdd_synonyms
A tibble with 44 variables in the MDD v2.4 release, with source
column names normalized to snake_case during data import.
Variables and their original naming equivalents:
(MDD_syn_ID) Unique MDD synonym identification
number assigned to each synonym, starting from 100000001 and
incrementing as new synonyms are added.
(MDD_species) Genus and specific epithet of the
accepted species to which this synonym is assigned, with a space
between the two elements; listed as "genus incertae_sedis" or
"incertae_sedis incertae_sedis" for nomina dubia, nomina
inquirenda, and names based on composite or hybrid type material.
(MDD_species_id) Identification number of the
accepted species as listed in mdd_checklist (the MDD_Current
sheet).
(Hesp_id) Identification number of the matching entry
in the Hesperomys nomenclature database
(https://hesperomys.com), used to link and synchronise data
between MDD and Hesperomys.
(MDD_root_name) Specific epithet used as the
root to form a valid taxon name; spelling is adjusted to match
generic gender when the epithet is an adjective, or Latinised when
originally written in non-Latin characters.
(MDD_author) Author surname(s) of the original
description, following the same formatting conventions as
authority_species_author in mdd_checklist: all authors on the
author line are included; Oxford comma before the last name when
three or more authors are present; an "in" statement is added
when the work appears in a volume with different editors; shared
surnames are disambiguated by initials or full middle names; Chinese,
Korean, and Indochinese names are written with surname first and
hyphens removed.
(MDD_year) Year of the original description as given
in the original publication.
(MDD_authority_parentheses)
Parenthesis flag: 0 = no parentheses; 1 = authority in
parentheses (indicating the species was originally described under a
different genus).
(MDD_nomenclature_status)
Nomenclatural status of the name under the ICZN Code; indicates
whether the name is available, a spelling variant, a name
combination, or unavailable (with the specific reason given); see
Nomenclature_Taxonomy_Metadata in the MDD spreadsheet for full
value definitions.
(MDD_validity) Taxonomic status of the name
under the ICZN Code and primary literature; indicates whether the
name is a valid species or subspecies, or is a nomen dubium, nomen
inquirendum, or based on composite or hybrid type material; see
Nomenclature_Taxonomy_Metadata for full value definitions.
(MDD_original_combination) Name
combination exactly as it appears in the original description; all
parts of the scientific name are written in full even if abbreviated
in the source.
(MDD_original_rank) Taxonomic rank at
which the name was first described; one of "species",
"subspecies", "form", "variety", "infrasubspecific",
"unranked", "synonym", or "other".
(MDD_authority_citation) Full APA
citation of the original description, including specific month and
day of publication when known; populated only when the MDD team has
verified the citation by obtaining a PDF or physical copy.
(MDD_unchecked_authority_citation)
Unverified original description citation in non-standardised format,
often abbreviated from regional or global taxonomic compendia; set
to NA once mdd_authority_citation is filled.
(MDD_sourced_unverified_citations)
Same as mdd_unchecked_authority_citation but includes the sources
from which the citation was obtained; retained as an internal
reference for the MDD team.
(MDD_citation_group) Journal name or city
of publication (for books) of the original description; used
internally by the MDD team when locating citation materials.
(MDD_citation_kind) Whether the MDD team
holds a physical, electronic, or no copy of the original description;
used internally by the MDD team.
(MDD_authority_page) Page number where the
scientific name first appears in the original description; special
cases (footnotes, unnumbered pages, figures, plates) are noted in
parentheses.
(MDD_authority_link) URL to an online copy
of the original description; older publications link to the
Biodiversity Heritage Library or other archives; newer publications
use a DOI.
(MDD_authority_page_link) URL to the
exact first page where the name appears in an online copy; primarily
applicable to records in the Biodiversity Heritage Library,
HathiTrust, Gallica, or the Internet Archive.
(MDD_unchecked_authority_page_link)
Candidate page links identified programmatically from the
Biodiversity Heritage Library but not yet manually verified for
inclusion of the name; multiple links separated by |.
(MDD_old_type_locality) Type locality
as it appeared in the MDD synonym sheet before harmonisation with
Hesperomys; retained for internal reference only — not for general
use.
(MDD_original_type_locality)
Verbatim type locality transcribed from the original description and
verified by the MDD team; blank when a type locality is not
applicable.
(MDD_unchecked_type_locality)
Verbatim type locality from non-original sources, with source
attribution appended; multiple entries from different sources
separated by |; blank when not applicable.
(MDD_emended_type_locality)
Verbatim emended type locality from sources making a restriction,
emendation, or declaration, followed by the reference; currently
empty — reserved for future curation; blank when not applicable.
(MDD_type_latitude) Latitude of the type
locality in decimal degrees; georeferenced from the original
description or via web search / GeoLocate; blank when not
applicable.
(MDD_type_longitude) Longitude of the
type locality in decimal degrees; sourced as for
mdd_type_latitude; blank when not applicable.
(MDD_type_country) Modern country
containing the type locality; see Distribution_List in the MDD
spreadsheet for the full list of country names used; blank when not
applicable.
(MDD_type_subregion) First-level
subnational unit containing the type locality (state, province,
territory, or island group for larger countries); blank when not
applicable.
(MDD_type_subregion2) Second-level
subnational unit within mdd_type_subregion; primarily US counties
and individual islands within archipelagoes; blank when not
applicable.
(MDD_holotype) Museum catalogue number(s) of
the type series (holotype, syntypes, lectotype, or neotype); multiple
syntypes are comma-separated; blank when type material has not been
verified.
(MDD_type_kind) Category of type specimen
listed in mdd_holotype: one of "holotype", "syntypes",
"lectotype", "neotype", or "nonexistent" (confirmed absence
of type material); blank when existence of type material has not
been verified.
(MDD_type_specimen_link) URL(s) to
type material records in external museum collection databases.
(MDD_order) Taxonomic order; "incertae_sedis"
when not assigned to an order.
(MDD_family) Taxonomic family; "incertae_sedis"
when not assigned to a family.
(MDD_genus) Taxonomic genus; "incertae_sedis"
when not assigned to a genus.
(MDD_specificEpithet) Specific epithet
of the accepted species; "incertae_sedis" when not assigned to a
species.
(MDD_subspecificEpithet) Subspecific
epithet of the accepted subspecies; blank when no subspecies are
recognised within the species. Note: this field currently
contains a tentative, unvetted subspecies list — assignments are
incomplete and subject to revision.
(MDD_variant_of) For spelling variants and
name combinations, the name they are a variant of, given with its
original name combination, authority, and MDD synonym ID.
(MDD_senior_homonym) For preoccupied
names (primary or secondary homonymy), the senior homonym that
preoccupies the name, given with its original name combination,
authority, and MDD synonym ID.
(MDD_name_usages) Later citations using the
exact spelling of the name as given in mdd_original_combination;
each citation is abbreviated to authors, year, and page number.
(MDD_comments) Free-text comments on the
nomenclature or taxonomy of the name.
This table covers every name — valid, synonymised, or otherwise — that
has been applied to a species- or subspecies-level mammal taxon and is
recognised by the MDD. It serves as the primary nomenclatural backbone
for the checklist in mdd_checklist and is designed to interoperate
with the Hesperomys database (https://hesperomys.com) via
hesp_id.
Column names have been normalised from the original mixed-case convention
(MDD_syn_ID, MDD_specificEpithet, etc.) to snake_case during data
import; the original names are shown in parentheses in each \item above
for cross-reference with the upstream spreadsheet.
Citation verification follows a two-tier structure: mdd_authority_citation
holds APA-formatted, MDD-verified citations, while
mdd_unchecked_authority_citation holds unverified or abbreviated
citations from secondary sources and is set to NA once the verified
field is populated.
Several fields are retained for internal MDD team reference and should
not be used for analysis: mdd_old_type_locality,
mdd_sourced_unverified_citations, mdd_citation_group, and
mdd_citation_kind. The mdd_emended_type_locality and
mdd_subspecific_epithet fields are present but not yet fully
curated.
Mammal Diversity Database release archive
(https://www.mammaldiversity.org). Field definitions are derived
from the META_v2.4.csv file and the column-level annotations in the
Species_Syn_Current sheet of the official MDD spreadsheet.
This function is experimental and may change in future releases.
Resolve a mammal name against the local MDD backbone and return a structured
object built from mdd_checklist and mdd_synonyms, including the accepted
taxon record, synonym records, and grouped sections that mirror the main
information shown in MDD species pages.
mdd_taxon_info( name, checklist = NULL, synonyms = NULL, target_df = NULL, max_dist = 1, method = "osa" )mdd_taxon_info( name, checklist = NULL, synonyms = NULL, target_df = NULL, max_dist = 1, method = "osa" )
name |
A single scientific name. |
checklist |
Optional checklist data frame. Defaults to |
synonyms |
Optional synonym data frame. Defaults to |
target_df |
Optional reconciliation backbone from
|
max_dist |
Maximum string distance used if the input name needs fuzzy reconciliation. |
method |
Distance method passed to |
An object of class mdd_taxon_info.
checklist <- tibble::tibble( id = "1", sci_name = "Puma_concolor", genus = "Puma", specific_epithet = "concolor", authority_species_author = "Linnaeus", authority_species_year = "1758", order = "Carnivora", family = "Felidae", main_common_name = "Puma" ) synonyms <- tibble::tibble( mdd_syn_id = "1001", mdd_species_id = "1", mdd_author = "Linnaeus", mdd_original_combination = "Felis concolor", mdd_validity = "synonym", mdd_nomenclature_status = "available" ) x <- mdd_taxon_info( "Felis concolor", checklist = checklist, synonyms = synonyms ) x as.list(x)checklist <- tibble::tibble( id = "1", sci_name = "Puma_concolor", genus = "Puma", specific_epithet = "concolor", authority_species_author = "Linnaeus", authority_species_year = "1758", order = "Carnivora", family = "Felidae", main_common_name = "Puma" ) synonyms <- tibble::tibble( mdd_syn_id = "1001", mdd_species_id = "1", mdd_author = "Linnaeus", mdd_original_combination = "Felis concolor", mdd_validity = "synonym", mdd_nomenclature_status = "available" ) x <- mdd_taxon_info( "Felis concolor", checklist = checklist, synonyms = synonyms ) x as.list(x)
This function is experimental and may change in future releases.
Resolve a taxon by accepted MDD identifier or by scientific name against the local MDD name index. The returned object contains the match result, accepted taxon row, linked synonym rows, and the canonical MDD taxon URL.
mdd_taxon_record( name = NULL, taxon_id = NULL, checklist = NULL, synonyms = NULL, target_df = NULL, max_dist = 1, method = "osa" )mdd_taxon_record( name = NULL, taxon_id = NULL, checklist = NULL, synonyms = NULL, target_df = NULL, max_dist = 1, method = "osa" )
name |
Optional scientific name. |
taxon_id |
Optional accepted MDD taxon identifier. |
checklist |
Optional checklist data frame. |
synonyms |
Optional synonym data frame. |
target_df |
Optional reconciliation backbone or name index. |
max_dist |
Maximum string distance used if |
method |
Distance method passed to |
A list of class mdd_taxon_record.
checklist <- tibble::tibble( id = "1", sci_name = "Puma_concolor", genus = "Puma", specific_epithet = "concolor", authority_species_author = "Linnaeus" ) synonyms <- tibble::tibble( mdd_syn_id = "1001", mdd_species_id = "1", mdd_author = "Linnaeus", mdd_original_combination = "Felis concolor" ) mdd_taxon_record( name = "Felis concolor", checklist = checklist, synonyms = synonyms )checklist <- tibble::tibble( id = "1", sci_name = "Puma_concolor", genus = "Puma", specific_epithet = "concolor", authority_species_author = "Linnaeus" ) synonyms <- tibble::tibble( mdd_syn_id = "1001", mdd_species_id = "1", mdd_author = "Linnaeus", mdd_original_combination = "Felis concolor" ) mdd_taxon_record( name = "Felis concolor", checklist = checklist, synonyms = synonyms )
Auxiliary reference table of natural history museum collection metadata
distributed with an MDD release. Each row corresponds to one institution
that holds or has held type material cited in the MDD synonym table
(mdd_synonyms).
mdd_type_specimen_metadatamdd_type_specimen_metadata
A tibble with 138 rows and 5 variables, with source column names
normalized to snake_case during data import:
Standard acronym or abbreviation used to identify
the collection in the MDD and broader taxonomic literature (e.g.
"AMNH", "MNHN", "USNM"); corresponds to the collection codes
cited in the mdd_holotype and mdd_type_specimen_link fields of
mdd_synonyms.
Full official name of the institution or collection
(e.g. "American Museum of Natural History",
"Museum National d'Histoire Naturelle").
City and country where the institution is
located, formatted as "City, Country" (e.g.
"New York, United States of America").
Alternative or historical names by which the
collection has been known in the synonymic literature (e.g.
"British Museum (Natural History)" for BM,
"United States National Museum" for USNM); NA when no
alternative name is recorded.
URL to the institution's
online specimen database or collection portal, when publicly
available; NA when no online resource has been identified.
This table serves as a lookup reference linking collection abbreviations
used throughout mdd_synonyms to their full institutional names,
geographic locations, and online portals. It covers 138 institutions
spanning all major zoogeographic regions, from large international
collections such as AMNH, MNHN, and USNM to regional and national
natural history museums in Latin America, Asia, Africa, and Oceania.
The synonyms_notes column documents historical or colloquial collection
names that may appear in older taxonomic literature, aiding users who
encounter non-standard abbreviations in pre-MDD sources. Notable examples
include BM (formerly British Museum (Natural History)), MCZ
(Museum of Comparative Zoology, Harvard), and ZMA (Zoölogisch Museum,
Amsterdam, merged with RMNH in 2010).
URLs in online_website_database_if_available link directly to
mammalogy or specimen-search pages where possible, and to general
institutional pages otherwise. Link availability and validity may
change over time.
Mammal Diversity Database release archive
(https://www.mammaldiversity.org), distributed as
TypeSpecimenMetadata_...csv within each versioned release.
Print mdd_reference object
## S3 method for class 'mdd_reference' print(x, ...)## S3 method for class 'mdd_reference' print(x, ...)
x |
An object of class |
... |
Additional arguments. |
The mdd_reference object, invisibly.
Print mdd_taxon_info object
## S3 method for class 'mdd_taxon_info' print(x, ...)## S3 method for class 'mdd_taxon_info' print(x, ...)
x |
An object of class |
... |
Additional arguments. |
The mdd_taxon_info object, invisibly.