Updates to documentation (#109) thanks to Emil Hvitfeldt. Add new tokenizers for tweets, Penn Treebank to unnest_tokens(). Better error message (#111) and code styling. Declare dependency for tests
Change how NA values are handled in unnest_tokens so they no longer cause other columns to become NA...
Added documentation for n-grams, skip n-grams, and regex Added codecov and appveyor Added tidiers fo...
WordTokenizers v0.5.5 Diff since v0.5.4 Merged pull requests: Update paper.bib (#47) (@kthyng) Upda...
hunspell is now a suggested dependency, thanks to @MichaelChirico (#221) Added stm() tidiers for hig...
Updates to documentation (#102), README, and vignettes. Add tokenizing by character shingles thanks ...
Wrapper tokenization functions for n-grams, characters, sentences, tweets, and more, thanks to @Coli...
get_sentiments now works regardless of whether tidytext has been loaded or not (#50). unnest_tokens ...
Fix tidier for quanteda dictionary for correct class (#71). Add a pkgdown site. Convert NSE from und...
unnest_tokens can now unnest a data frame with a list column (which formerly threw the error unnest_...
Use vdiffr conditionally Bug fix/breaking change for collapse argument to unnest_functions(). This a...
reorder_within() now handles multiple variables, thanks to @tmastny (#170) Move stopwords to Suggest...
scale_x/y_reordered() now uses a function labels as its main input (#200) Fixed how to_lower is pass...
Improvements to documentation (#117) Fix for NSE thanks to @lepennec (#122). Tidier for estimated re...
Check for installation of stopwords more gracefully Update tidiers and casters for new version of qu...
Fix bug in augment() function for stm topic model. Warn when tf-idf is negative, thanks to @EmilHvit...
Change how NA values are handled in unnest_tokens so they no longer cause other columns to become NA...
Added documentation for n-grams, skip n-grams, and regex Added codecov and appveyor Added tidiers fo...
WordTokenizers v0.5.5 Diff since v0.5.4 Merged pull requests: Update paper.bib (#47) (@kthyng) Upda...
hunspell is now a suggested dependency, thanks to @MichaelChirico (#221) Added stm() tidiers for hig...
Updates to documentation (#102), README, and vignettes. Add tokenizing by character shingles thanks ...
Wrapper tokenization functions for n-grams, characters, sentences, tweets, and more, thanks to @Coli...
get_sentiments now works regardless of whether tidytext has been loaded or not (#50). unnest_tokens ...
Fix tidier for quanteda dictionary for correct class (#71). Add a pkgdown site. Convert NSE from und...
unnest_tokens can now unnest a data frame with a list column (which formerly threw the error unnest_...
Use vdiffr conditionally Bug fix/breaking change for collapse argument to unnest_functions(). This a...
reorder_within() now handles multiple variables, thanks to @tmastny (#170) Move stopwords to Suggest...
scale_x/y_reordered() now uses a function labels as its main input (#200) Fixed how to_lower is pass...
Improvements to documentation (#117) Fix for NSE thanks to @lepennec (#122). Tidier for estimated re...
Check for installation of stopwords more gracefully Update tidiers and casters for new version of qu...
Fix bug in augment() function for stm topic model. Warn when tf-idf is negative, thanks to @EmilHvit...
Change how NA values are handled in unnest_tokens so they no longer cause other columns to become NA...
Added documentation for n-grams, skip n-grams, and regex Added codecov and appveyor Added tidiers fo...
WordTokenizers v0.5.5 Diff since v0.5.4 Merged pull requests: Update paper.bib (#47) (@kthyng) Upda...