Title: | R Interface to Pullword Service |
Description: | R Interface to Pullword Service for natural language processing in Chinese. It enables users to extract valuable words from text by deep learning models. For more details please visit the official site (in Chinese) http://www.pullword.com/. |
Version: | 0.3 |
Author: | Tong He <hetong007@gmail.com> |
Maintainer: | Tong He <hetong007@gmail.com> |
BugReports: | https://github.com/hetong007/pullword/issues |
Depends: | R (≥ 3.0.2),RCurl |
License: | LGPL-2 | LGPL-2.1 | LGPL-3 [expanded from: LGPL] |
NeedsCompilation: | yes |
Packaged: | 2021-07-13 07:40:34 UTC; ubuntu |
Repository: | CRAN |
Date/Publication: | 2021-07-13 09:10:02 UTC |
R Interface of PullWord service
Description
This function deals with communication with the server. The result will be parsed in a vector or a matrix, depends on show probability or not
pullword
Usage
pullword(input = NULL, file = NULL, threshold = 0, showProb = FALSE)
Arguments
input |
The input text. |
file |
The input file. |
threshold |
The minimum probability for the words appearing in the result. Should be a real value between 0 and 1. |
showProb |
logical. The return value would be a |
Examples
require(pullword)
pullword('Replace this field with a Chinese sentence.',threshold=0,showProb=TRUE)
Indicate whether the encoding of input string is BIG5.
Description
Indicate whether the encoding of input string is BIG5.
Usage
isBIG5(string, combine = FALSE)
Arguments
string |
A character vector. |
combine |
Whether to combine all the strings. |
Value
Logical value.
Author(s)
Jian Li <rweibo@sina.com>
Indicate whether the encoding of input string is GB18030.
Description
Indicate whether the encoding of input string is GB18030.
Usage
isGB18030(string, combine = FALSE)
Arguments
string |
A character vector. |
combine |
Whether to combine all the strings. |
Value
Logical value.
Author(s)
Jian Li <rweibo@sina.com>
Indicate whether the encoding of input string is GB2312.
Description
Indicate whether the encoding of input string is GB2312.
Usage
isGB2312(string, combine = FALSE)
Arguments
string |
A character vector. |
combine |
Whether to combine all the strings. |
Value
Logical value.
Author(s)
Jian Li <rweibo@sina.com>
Indicate whether the encoding of input string is GBK.
Description
Indicate whether the encoding of input string is GBK.
Usage
isGBK(string, combine = FALSE)
Arguments
string |
A character vector. |
combine |
Whether to combine all the strings. |
Value
Logical value.
Author(s)
Jian Li <rweibo@sina.com>
Indicate whether the encoding of input string is UTF-8.
Description
Indicate whether the encoding of input string is UTF-8.
Usage
isUTF8(string, combine = FALSE)
Arguments
string |
A character vector. |
combine |
Whether to combine all the strings. |
Value
Logical value.
Author(s)
Jian Li <rweibo@sina.com>
Convert encoding of Chinese string to UTF-8.
Description
Convert encoding of Chinese string to UTF-8.
Usage
toUTF8(cnstring)
Arguments
cnstring |
A Chinese string vector. |
Value
Converted vectors.
Author(s)
Jian Li <rweibo@sina.com>