• About
  • Documentation

  • More Universes
  • Recent Updates
  • Leader board

  • All repositories
  • All packages
  • All articles
  • All datasets
  • All system Libraries
paulesantos
  • Builds
  • Packages
  • Articles
  • Datasets
  • Contribution
  • Badges
  • API
  • Feed

Links topaulesantos

fuzzystring - Fast Fuzzy String Joins for Data Frames

Perform fuzzy joins on data frames using approximate string matching. Implements inner, left, right, full, semi, and anti joins with string distance metrics from the 'stringdist' package, including Optimal String Alignment, Levenshtein, Damerau-Levenshtein, Jaro-Winkler, q-gram, cosine, Jaccard, and Soundex. Uses a 'data.table' backend plus compiled 'C++' result assembly to reduce overhead in large joins, while adaptive candidate planning avoids unnecessary distance evaluations in single-column string joins. Suitable for reconciling misspellings, inconsistent labels, and other near-match identifiers while optionally returning the computed distance for each match. Bibliographic references include Van der Loo, M. P. J. (2014) <https://CRAN.R-project.org/package=stringdist> and Robinson, D. (2015) <https://github.com/dgrtwo/fuzzyjoin>.

Last updated

cpp

5.48 score 5 stars 499 downloads

wcvpmatch - Taxonomic Name Reconciliation Against the 'WCVP' Backbone

Standardizes and reconciles scientific plant names against a World Checklist of Vascular Plants ('WCVP')-style taxonomic backbone. The package parses names into taxonomic components and applies staged exact and fuzzy matching for binomial and trinomial inputs, including infraspecific rank-aware checks. It also returns accepted-name context and row-level matching flags to support reproducible, auditable preprocessing for downstream biodiversity, spatial, and trait analyses. A user-supplied backbone can be passed through 'target_df'; when the optional companion package 'wcvpdata' is installed, its default checklist can also be used.

Last updated

plantwcvp

5.20 score 4 stars 451 downloads

avesperu - Access to the List of Bird Species of Peru

Provides access to the species checklist published in 'List of the Birds of Peru' by Plenge, M. A. and Angulo, F. (version 23-03-2026) <https://sites.google.com/site/boletinunop/checklist>. The package exposes the current Peru bird checklist as an R dataset and includes tools for species lookup, taxonomic reconciliation, and fuzzy matching of scientific names. These features help streamline taxonomic validation for researchers and conservationists.

Last updated

avesavesperubirdperu

5.13 score 3 stars 8 scripts 573 downloads

perumammals - Taxonomic Backbone and Name Validation Tools for Mammals of Peru

Provides a curated taxonomic backbone of mammal species from Peru based on Pacheco et al. (2021) "Lista actualizada de la diversidad de los mamíferos del Perú y una propuesta para su actualización" <doi:10.15381/rpb.v28i4.21019>. The package includes standardized species data, occurrence by ecoregions, endemism status, and tools for validating and matching scientific names through exact and fuzzy procedures. It is designed as a lightweight and dependable reference for ecological, environmental, biogeographic, and conservation workflows that require reliable species information for Peruvian mammals.

Last updated

4.78 score 1 stars 193 downloads

peruflorads43 - Check Threatened Plant Species Status Against Peru's Supreme Decree 043-2006-AG

Provides tools to match plant species names against the official threatened species list of Peru (Supreme Decree 043-2006-AG, 2006). Implements a hierarchical matching pipeline with exact, fuzzy, and suffix matching algorithms to handle naming variations and taxonomic changes. Supports both the original 2006 nomenclature and updated taxonomic names, allowing users to check protection status regardless of nomenclatural changes since the decree's publication. Threat categories follow International Union for Conservation of Nature standards (Critically Endangered, Endangered, Vulnerable, Near Threatened).

Last updated

ds043-2006-agfloraperu

4.54 score 7 scripts 194 downloads

rmdd - Resolve Mammal Names with the Mammal Diversity Database

Download, inspect, reconcile, and summarize mammal taxonomic names with the Mammal Diversity Database (MDD). Supports accepted names, synonyms, original combinations, distribution summaries, and mapped outputs derived from packaged MDD releases. Designed for reproducible mammal name resolution workflows in 'R'.

Last updated

diversitymammals

4.30 score 500 downloads

ppendemic - A Glimpse at the Diversity of Peru's Endemic Plants

Introducing a novel and updated database showcasing Peru's endemic plants. This meticulously compiled and revised botanical collection encompasses a remarkable assemblage of over 7,898 distinct species. The data for this resource was sourced from the work of Govaerts, R., Nic Lughadha, E., Black, N. et al., titled 'The World Checklist of Vascular Plants: A continuously updated resource for exploring global plant diversity', published in Sci Data 8, 215 (2021) <doi:10.1038/s41597-021-00997-6>.

Last updated

endemicperuplants

4.13 score 9 stars 8 scripts 253 downloads

reptiledbr - Interface to the Reptile Database for Querying and Retrieving Taxonomic Data

Provides tools to search, access, and format taxonomic information from the Reptile Database (<http://reptile-database.org>) directly within R. Users can retrieve species-level data, distribution, etymology, synonyms, common names, and other relevant information for reptiles. Designed for taxonomists, ecologists, and biodiversity researchers.

Last updated

3.95 score 1 stars 7 scripts 5 downloads

rsdot - Programmatic Access to Spatial Data from 'SDOT-PCM' Peru

Provides programmatic access to official geographic information from the Secretariat of Demarcation and Territorial Organization ('SDOT') of the Presidency of the Council of Ministers ('PCM') of Peru. Facilitates the download, reading, and manipulation in 'R' of vector layers on geographic modeling, National Institute of Statistics and Informatics ('INEI') census limits, population centers, infrastructure, risks, and hazards. IMPORTANT: 'INEI' census limit information is referential and does not constitute official territorial limits nor has legal demarcatory effect.

Last updated

3.65 score

tidyttmoment - Functional Trait Moment Calculation

Calculates the community four 'moments' (mean, variance, skewness, and kurtosis) of a given trait based on the moments described in Wieczynski et al. (2019) <doi:10.1073/pnas.1813723116>. These functional metrics are extremely useful in characterizing the distribution of traits in a plant community. It also provides tidyverse-friendly wrappers to seamlessly calculate advanced functional diversity indices (e.g., FDis, Rao's Q) using 'fundiversity' (Grenie et al. 2023 <doi:10.1111/ecog.06585>) and functional rarity indices using 'funrar' (Grenie et al. 2017 <doi:10.1111/ddi.12629>). Evaluating these community-weighted moments and diversity metrics allows researchers to evaluate shifts in optimal phenotypes and understand ecological filtering with exactness.

Last updated

3.00 score 155 downloads

redbookperu - Access and Analyze Data from the Red Book of Endemic Plants of Peru

Provides access to and analysis of data from "The Red Book of Endemic Plants of Peru" (León, B., Roque, J., Ulloa, C., Jorgensen, P.M., Pitman, N., Cano, A. 2006) <doi:10.15381/rpb.v13i2.1782>. This package offers comprehensive taxonomic, geographic, and conservation information about Peru's endemic plant species. It includes functions to verify species inclusion, obtain updated taxonomic details, and explore the dataset.

Last updated

2.70 score 1 stars 5 scripts 163 downloads

wcvpdata - World Checklist of Vascular Plants Data

Provides a local interface to the 'World Checklist of Vascular Plants' (WCVP) <https://powo.science.kew.org/about-wcvp> (Version 15, January 2026), sourced from 'Plants of the World Online' <https://powo.science.kew.org/>. Bundles standardized snapshots of global taxonomic and geographic distribution data for vascular plants, designed to support large-scale botanical research and biodiversity analysis directly in 'R'.

Last updated

plantwcvp

2.18 score