kgrams 0.2.1
- Added Suggests dependency from tibble (#32).
kgrams 0.2.0
Breaking changes
- tknz_sent()and- preprocess()now have a
different implementation on Windows and UNIX OSs, respectively (since
the previous C++ implementation has impredictable behaviour on Windows,
see #30). This fix also included minor changes in the- tknz_sent()output, in some corner cases
(e.g.- tknz_sent("")now returns- character(0),
wheareas it used to return- "").
New features
- perplexity()gets a new argument- expthat
allows to return the cross-entropy per word, rather than perplexity (its
exponential).
- perplexity.character()gets a new argument- detailedthat allows to return, alongside with the total
perplexity of the input document, also the cross-entropies and word
lengths of individual sentences. Closes #28.
Improvements
- Minor documentation improvements.
kgrams 0.1.5
- Removed “Tools for…” at the beginning of package DESCRIPTION, as per
CRAN’s request.
- Simplified examples in ?kgram_freqs.
kgrams 0.1.4
kgrams 0.1.3
- Remove dependency from external online sources in vignette.
kgrams 0.1.2
Overall Software
Improvements
- The package’s test suite has been greatly extended.
- Improved error/warning conditions for wrong arguments.
- Re-enabled compiler diagnostics as per CRAN policy (#19)
API Changes
- verbosearguments now default to- FALSE.
- probability(),- perplexity()and- sample_sentences()are restricted to accept only- language_modelclass objects as their- modelargument.
New features
- as_dictionary(NULL)now returns an empty- dictionary.
Bug Fixes
- Fixed bug causing .preprocessand.tknz_sentarguments to be ignored inprocess_sentences().
- Fixed previously wrong defaults for max_linesandbatch_sizearguments inkgram_freqs.connection().
- Added print method for class dictionary.
- Fixed bug causing invalid results in dictionary()with
batch processing and non-trivial size constraints on vocabulary
size.
Other
- Maintainer’s email updated