EVOLUTION-MANAGER
Edit File: readHTMLList.html
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: Read data in an HTML list or all lists in a document</title> <meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> <link rel="stylesheet" type="text/css" href="R.css" /> </head><body> <table width="100%" summary="page for readHTMLList {XML}"><tr><td>readHTMLList {XML}</td><td style="text-align: right;">R Documentation</td></tr></table> <h2>Read data in an HTML list or all lists in a document</h2> <h3>Description</h3> <p>This function and its methods are somewhat similar to <code><a href="readHTMLTable.html">readHTMLTable</a></code> but read the contents of lists in an HTML document. We can specify the URL of the document or an already parsed document or an individual node within the document. </p> <h3>Usage</h3> <pre> readHTMLList(doc, trim = TRUE, elFun = xmlValue, which = integer(), ...) </pre> <h3>Arguments</h3> <table summary="R argblock"> <tr valign="top"><td><code>doc</code></td> <td> <p>the URL of the document or the parsed HTML document or an individual node.</p> </td></tr> <tr valign="top"><td><code>trim</code></td> <td> <p>a logical value indicating whether we should remove leading and trailing white space in each list item when returning it</p> </td></tr> <tr valign="top"><td><code>elFun</code></td> <td> <p>a function that is used to process each list item node (<code>li</code>). This provides an opportunity to customize how each node is processed, for example accessing attributes on the list item or on its contents such as links in the items.</p> </td></tr> <tr valign="top"><td><code>which</code></td> <td> <p>an index or name which or vector of same which identifies which list nodes to process in the overall document. This is for subsetting particular lists rather than processing them all.</p> </td></tr> <tr valign="top"><td><code>...</code></td> <td> <p>additional arguments passed to <code><a href="xmlTreeParse.html">htmlParse</a></code> and for the specific methods.</p> </td></tr> </table> <h3>Value</h3> <p>A list of character vectors or lists, with one element for each list in the document. If only one list is being read (by specifying <code>which</code> as a single identifier), that is returned as is. </p> <h3>Author(s)</h3> <p>Duncan Temple Lang </p> <h3>See Also</h3> <p><code><a href="readHTMLTable.html">readHTMLTable</a></code> </p> <h3>Examples</h3> <pre> readHTMLList("http://www.omegahat.net") </pre> <hr /><div style="text-align: center;">[Package <em>XML</em> version 3.99-0.3 <a href="00Index.html">Index</a>]</div> </body></html>