CRAN release: 2022-12-07
All grouped resampling functions (
group_bootstraps()) now support stratification. Strata must be constant within each group (@mikemahoney218, #317, #360, #363, #364, #365).
Added a new function,
clustering_cv(), for blocked cross-validation in various predictor spaces. This is a very flexible function, taking arguments to both
cluster_function, allowing it to be used for spatial clustering as well as potentially phylogenetic and other forms of clustering (@mikemahoney218, #351).
The dependency on the ellipsis package has been removed (#393).
Removed an overly strict test in preparation for dplyr 1.1.0 (#380).
CRAN release: 2022-08-08
rset objects now include all parameters used to create them as attributes (#329).
Objects returned by sliding functions now have an
indexattribute, where appropriate, containing the column name used as an index (#329).
poolas attributes to all functions which support stratification (#329).
Changed the “strata” attribute on rset objects so that it now is either a character vector identifying the column used to stratify the data, and is not present (set to
NULL) if stratification was not used. (#329)
Added arguments to control how
group_vfold_cv()combines groups. Use
balance = "groups"to assign (roughly) the same number of groups to each fold, or
balance = "observations"to assign (roughly) the same number of observations to each fold.
CRAN release: 2022-06-24
Note: Using an external vector in selections is ambiguous. i Use `all_of(strata)` instead of `strata` to silence this message. i See <https://tidyselect.r-lib.org/reference/faq-external-vector.html>.
- Added better printing methods for initial split objects.
CRAN release: 2021-11-08
Updated documentation on stratified sampling (#245).
make_splits()to an S3 generic, with the original functionality a method for
listand a new method for dataframes that allows users to create a split from existing analysis & assessment sets (@LiamBlake, #246).
Changed resample “fingerprint” to hash the indices only rather than the entire resample result (including the data object). This is much faster and will still ensure the same resample for the same original data object (#259).
CRAN release: 2021-05-08
propargument to first compute the assessment indices, rather than the analysis indices. This is a minor but breaking change in some situations; the previous implementation could cause an inconsistency in the sizes of the generated analysis and assessment sets when compared to how
propis documented to function (#217, @issactoast).
Re-licensed package from GPL-2 to MIT. See consent from copyright holders here.
Attempts to stratify on a
Survobject now error more informatively (#230).
CRAN release: 2021-02-17
rsetobjects now contain a “fingerprint” attribute that can be used to check to see if the same object uses the same resamples.
A few internal functions were exported so that
rsample-adjacent packages can use the same underlying code.
CRAN release: 2020-09-23
manual_rset()for constructing rset objects manually from custom rsplits (tidymodels/tune#273).
CRAN release: 2020-06-04
Lower threshold for pooling strata to 10% (from 15%) (#149).
val_splitobjects were adjusted to show
two_class_datdata sets were removed. They are in the
CRAN release: 2020-03-31
validation_set()for making a single resample.
Correct the tidy method for bootstraps (#115).
Changes for upcoming `tibble release.
Exported constructors for
CRAN release: 2019-07-12
- Added three functions to compute different bootstrap confidence intervals.
- A new function (
add_resample_id()) augments a data frame with columns for the resampling identifier.
group_vfold_cv()to use tidyselect on the stratification variable.
breaksparameter that specifies the number of bins to stratify by for a numeric stratification variable.
CRAN release: 2019-01-07
Small maintenance release.
fill()was removed per the deprecation warning.
- Small changes were made for the new version of
CRAN release: 2018-11-20
- Added function
initial_time_split()for ordered initial sampling appropriate for time series data.
Changed the R version requirement to be R >= 3.1 instead of 3.3.3.
prepperfunction was moved to the
recipespackage. This makes the
rsampleinstall footprint much smaller.
rsplitobjects are shown differently inside of a tibble.
Moved from the
broompackage to the
CRAN release: 2017-11-12
testingwere added to do training/testing splits prior to resampling.
- Another resampling method,
group_vfold_cv, was added.
rsetobjects to those used by
- A function called
form_predcan be used to determine the original names of the predictors in a formula or
- A vignette and a function (
prepper) were included to facilitate using the
gathermethod was added for
labelsmethod was added for
rsplitobjects. This can help identify which resample is being used even when the whole
rsetobject is not available.
- A variety of
dplyrmethods were added (e.g.
mutate, etc) that work without dropping classes or attributes of the