EVOLUTION-MANAGER
Edit File: word-counting.html
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: Count words, sentences, characters</title> <meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> <link rel="stylesheet" type="text/css" href="R.css" /> </head><body> <table width="100%" summary="page for count_words {tokenizers}"><tr><td>count_words {tokenizers}</td><td style="text-align: right;">R Documentation</td></tr></table> <h2>Count words, sentences, characters</h2> <h3>Description</h3> <p>Count words, sentences, and characters in input texts. These functions use the <code>stringi</code> package, so they handle the counting of Unicode strings (e.g., characters with diacritical marks) in a way that makes sense to people counting characters. </p> <h3>Usage</h3> <pre> count_words(x) count_characters(x) count_sentences(x) </pre> <h3>Arguments</h3> <table summary="R argblock"> <tr valign="top"><td><code>x</code></td> <td> <p>A character vector or a list of character vectors. If <code>x</code> is a character vector, it can be of any length, and each element will be tokenized separately. If <code>x</code> is a list of character vectors, each element of the list should have a length of 1.</p> </td></tr> </table> <h3>Value</h3> <p>An integer vector containing the counted elements. If the input vector or list has names, they will be preserved. </p> <h3>Examples</h3> <pre> count_words(mobydick) count_sentences(mobydick) count_characters(mobydick) </pre> <hr /><div style="text-align: center;">[Package <em>tokenizers</em> version 0.2.3 <a href="00Index.html">Index</a>]</div> </body></html>