MDT 2004 John Fleck | 4c3bb7d | 2004-08-25 02:51:27 +0000 | [diff] [blame] | 1 | <html><head><meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"><title>Retrieving Element Content</title><meta name="generator" content="DocBook XSL Stylesheets V1.61.2"><link rel="home" href="index.html" title="Libxml Tutorial"><link rel="up" href="index.html" title="Libxml Tutorial"><link rel="previous" href="ar01s03.html" title="Parsing the file"><link rel="next" href="ar01s05.html" title="Using XPath to Retrieve Element Content"></head><body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF"><div class="navheader"><table width="100%" summary="Navigation header"><tr><th colspan="3" align="center">Retrieving Element Content</th></tr><tr><td width="20%" align="left"><a accesskey="p" href="ar01s03.html">Prev</a> </td><th width="60%" align="center"> </th><td width="20%" align="right"> <a accesskey="n" href="ar01s05.html">Next</a></td></tr></table><hr></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="xmltutorialgettext"></a>Retrieving Element Content</h2></div></div><div></div></div><p><a class="indexterm" name="id2525439"></a> |
MST 2003 John Fleck | 731967e | 2003-01-27 00:39:50 +0000 | [diff] [blame] | 2 | Retrieving the content of an element involves traversing the document |
MDT 2002 John Fleck | 598f6eb | 2002-06-04 15:10:36 +0000 | [diff] [blame] | 3 | tree until you find what you are looking for. In this case, we are looking |
MDT 2003 John Fleck | 63f3a47 | 2003-07-24 21:48:30 +0000 | [diff] [blame] | 4 | for an element called "keyword" contained within element called "story". The |
MDT 2002 John Fleck | 598f6eb | 2002-06-04 15:10:36 +0000 | [diff] [blame] | 5 | process to find the node we are interested in involves tediously walking the |
MST 2003 John Fleck | 731967e | 2003-01-27 00:39:50 +0000 | [diff] [blame] | 6 | tree. We assume you already have an xmlDocPtr called <tt class="varname">doc</tt> |
| 7 | and an xmlNodPtr called <tt class="varname">cur</tt>.</p><p> |
| 8 | </p><pre class="programlisting"> |
MST 2002 John Fleck | 7c67a83 | 2002-12-16 13:38:06 +0000 | [diff] [blame] | 9 | <a name="getchildnode"></a><img src="images/callouts/1.png" alt="1" border="0">cur = cur->xmlChildrenNode; |
| 10 | <a name="huntstoryinfo"></a><img src="images/callouts/2.png" alt="2" border="0">while (cur != NULL) { |
MDT 2003 John Fleck | 63f3a47 | 2003-07-24 21:48:30 +0000 | [diff] [blame] | 11 | if ((!xmlStrcmp(cur->name, (const xmlChar *)"storyinfo"))){ |
MDT 2002 John Fleck | 598f6eb | 2002-06-04 15:10:36 +0000 | [diff] [blame] | 12 | parseStory (doc, cur); |
| 13 | } |
| 14 | |
| 15 | cur = cur->next; |
| 16 | } |
MST 2003 John Fleck | 731967e | 2003-01-27 00:39:50 +0000 | [diff] [blame] | 17 | </pre><p> |
MDT 2002 John Fleck | 598f6eb | 2002-06-04 15:10:36 +0000 | [diff] [blame] | 18 | |
MST 2003 John Fleck | 731967e | 2003-01-27 00:39:50 +0000 | [diff] [blame] | 19 | </p><div class="calloutlist"><table border="0" summary="Callout list"><tr><td width="5%" valign="top" align="left"><a href="#getchildnode"><img src="images/callouts/1.png" alt="1" border="0"></a> </td><td valign="top" align="left"><p>Get the first child node of <tt class="varname">cur</tt>. At this |
| 20 | point, <tt class="varname">cur</tt> points at the document root, which is |
MDT 2003 John Fleck | 63f3a47 | 2003-07-24 21:48:30 +0000 | [diff] [blame] | 21 | the element "story".</p></td></tr><tr><td width="5%" valign="top" align="left"><a href="#huntstoryinfo"><img src="images/callouts/2.png" alt="2" border="0"></a> </td><td valign="top" align="left"><p>This loop iterates through the elements that are children of |
| 22 | "story", looking for one called "storyinfo". That |
| 23 | is the element that will contain the "keywords" we are |
MST 2003 John Fleck | 731967e | 2003-01-27 00:39:50 +0000 | [diff] [blame] | 24 | looking for. It uses the <span class="application">libxml</span> string |
MDT 2002 John Fleck | 598f6eb | 2002-06-04 15:10:36 +0000 | [diff] [blame] | 25 | comparison |
MST 2003 John Fleck | 731967e | 2003-01-27 00:39:50 +0000 | [diff] [blame] | 26 | function, <tt class="function"><a href="http://xmlsoft.org/html/libxml-parser.html#XMLSTRCMP" target="_top">xmlStrcmp</a></tt>. If there is a match, it calls the function <tt class="function">parseStory</tt>.</p></td></tr></table></div><p> |
MDT 2002 John Fleck | 598f6eb | 2002-06-04 15:10:36 +0000 | [diff] [blame] | 27 | </p><p> |
MST 2003 John Fleck | 731967e | 2003-01-27 00:39:50 +0000 | [diff] [blame] | 28 | </p><pre class="programlisting"> |
MDT 2002 John Fleck | 598f6eb | 2002-06-04 15:10:36 +0000 | [diff] [blame] | 29 | void |
| 30 | parseStory (xmlDocPtr doc, xmlNodePtr cur) { |
| 31 | |
MST 2002 John Fleck | 7c67a83 | 2002-12-16 13:38:06 +0000 | [diff] [blame] | 32 | xmlChar *key; |
MDT 2002 John Fleck | 598f6eb | 2002-06-04 15:10:36 +0000 | [diff] [blame] | 33 | <a name="anothergetchild"></a><img src="images/callouts/1.png" alt="1" border="0"> cur = cur->xmlChildrenNode; |
| 34 | <a name="findkeyword"></a><img src="images/callouts/2.png" alt="2" border="0"> while (cur != NULL) { |
MDT 2003 John Fleck | 63f3a47 | 2003-07-24 21:48:30 +0000 | [diff] [blame] | 35 | if ((!xmlStrcmp(cur->name, (const xmlChar *)"keyword"))) { |
MST 2002 John Fleck | 7c67a83 | 2002-12-16 13:38:06 +0000 | [diff] [blame] | 36 | <a name="foundkeyword"></a><img src="images/callouts/3.png" alt="3" border="0"> key = xmlNodeListGetString(doc, cur->xmlChildrenNode, 1); |
MDT 2003 John Fleck | 63f3a47 | 2003-07-24 21:48:30 +0000 | [diff] [blame] | 37 | printf("keyword: %s\n", key); |
MST 2002 John Fleck | 7c67a83 | 2002-12-16 13:38:06 +0000 | [diff] [blame] | 38 | xmlFree(key); |
| 39 | } |
MDT 2002 John Fleck | 598f6eb | 2002-06-04 15:10:36 +0000 | [diff] [blame] | 40 | cur = cur->next; |
| 41 | } |
| 42 | return; |
| 43 | } |
MST 2003 John Fleck | 731967e | 2003-01-27 00:39:50 +0000 | [diff] [blame] | 44 | </pre><p> |
MST 2002 John Fleck | bd3b4fd | 2002-11-11 03:41:11 +0000 | [diff] [blame] | 45 | </p><div class="calloutlist"><table border="0" summary="Callout list"><tr><td width="5%" valign="top" align="left"><a href="#anothergetchild"><img src="images/callouts/1.png" alt="1" border="0"></a> </td><td valign="top" align="left"><p>Again we get the first child node.</p></td></tr><tr><td width="5%" valign="top" align="left"><a href="#findkeyword"><img src="images/callouts/2.png" alt="2" border="0"></a> </td><td valign="top" align="left"><p>Like the loop above, we then iterate through the nodes, looking |
MDT 2002 John Fleck | 598f6eb | 2002-06-04 15:10:36 +0000 | [diff] [blame] | 46 | for one that matches the element we're interested in, in this case |
MDT 2003 John Fleck | 63f3a47 | 2003-07-24 21:48:30 +0000 | [diff] [blame] | 47 | "keyword".</p></td></tr><tr><td width="5%" valign="top" align="left"><a href="#foundkeyword"><img src="images/callouts/3.png" alt="3" border="0"></a> </td><td valign="top" align="left"><p>When we find the "keyword" element, we need to print |
MST 2003 John Fleck | 731967e | 2003-01-27 00:39:50 +0000 | [diff] [blame] | 48 | its contents. Remember that in <span class="acronym">XML</span>, the text |
MDT 2002 John Fleck | 598f6eb | 2002-06-04 15:10:36 +0000 | [diff] [blame] | 49 | contained within an element is a child node of that element, so we |
MST 2003 John Fleck | 731967e | 2003-01-27 00:39:50 +0000 | [diff] [blame] | 50 | turn to <tt class="varname">cur->xmlChildrenNode</tt>. To retrieve it, we |
MST 2004 John Fleck | d14bccc | 2004-02-15 01:57:42 +0000 | [diff] [blame] | 51 | use the function <tt class="function"><a href="http://xmlsoft.org/html/libxml-tree.html#XMLNODELISTGETSTRING" target="_top">xmlNodeListGetString</a></tt>, which also takes the <tt class="varname">doc</tt> pointer as an argument. In this case, we just print it out.</p><div class="note" style="margin-left: 0.5in; margin-right: 0.5in;"><table border="0" summary="Note"><tr><td rowspan="2" align="center" valign="top" width="25"><img alt="[Note]" src="images/note.png"></td><th align="left">Note</th></tr><tr><td colspan="2" align="left" valign="top"><p>Because <tt class="function">xmlNodeListGetString</tt> allocates |
MST 2002 John Fleck | 7c67a83 | 2002-12-16 13:38:06 +0000 | [diff] [blame] | 52 | memory for the string it returns, you must use |
MST 2004 John Fleck | d14bccc | 2004-02-15 01:57:42 +0000 | [diff] [blame] | 53 | <tt class="function">xmlFree</tt> to free it.</p></td></tr></table></div></td></tr></table></div><p> |
MDT 2003 John Fleck | 63f3a47 | 2003-07-24 21:48:30 +0000 | [diff] [blame] | 54 | </p></div><div class="navfooter"><hr><table width="100%" summary="Navigation footer"><tr><td width="40%" align="left"><a accesskey="p" href="ar01s03.html">Prev</a> </td><td width="20%" align="center"><a accesskey="u" href="index.html">Up</a></td><td width="40%" align="right"> <a accesskey="n" href="ar01s05.html">Next</a></td></tr><tr><td width="40%" align="left" valign="top">Parsing the file </td><td width="20%" align="center"><a accesskey="h" href="index.html">Home</a></td><td width="40%" align="right" valign="top"> Using XPath to Retrieve Element Content</td></tr></table></div></body></html> |