3.XPath

使用XPath可以在不遍历xml文档的情况下选择具体节点。

转自https://www.cnblogs.com/vaevvaev/p/6928201.html

XPath可以快速定位到Xml中的节点或者属性。XPath语法很简单，但是强大够用，它也是使用xslt的基础知识。
示例Xml：

<?xml version="1.0" encoding="utf-8" ?>

<pets>

  <cat color="black" weight="10">

    <price>100</price>

    <desc>this is a black cat</desc>

  </cat>

  <cat color="white" weight="9">

    <price>80</price>

    <desc>this is a white cat</desc>

  </cat>

  <cat color="yellow" weight="15">

    <price>80</price>

    <desc>this is a yellow cat</desc>

  </cat>

  <dog color="black" weight="10">

    <price>100</price>

    <desc>this is a black dog</desc>

  </dog>

  <dog color="white" weight="9">

    <price>80</price>

    <desc>this is a white dog</desc>

  </dog>

  <dog color="yellow" weight="15">

    <price>80</price>

    <desc>this is a yellow dog</desc>

  </dog>

</pets>

XPath的语法：
1. XPath中的符号

符号	说明	示例	示例说明
/	表示从根节点开始选择	/pets	选择根节点pets
/	表示节点和子节点之间的间隔符	/pets/dog	选择pets节点下的dog节点
//xx	表示从整个xml文档中查找，而不考虑当前节点位置	//price	选择文档中所有的price节点
.	单个英文半角句点表示选择当前节点	/pets/.	选择pets节点
..	双点，表示选择父节点	/pets/dog[0]/..	表示pets节点，也就是第一个dog节点的父节点
@xx	表示选择属性	//dog/@color	表示选择所有dog节点的color属性集合
[…]	中括号表示选择条件，括号内为条件	//dog[@color=’white’]	所有color为white的dog节点
	中括号表示选择条件，括号内为条件	//dog[/price<100]	所有price字节点值小于100的dog节点
	中括号内数字为节点索引，类似c#等语言中的数组,数组下标是从1开始的	//dog[1]	第1个dog节点
	中括号内数字为节点索引，类似c#等语言中的数组,数组下标是从1开始的	//dog[last()]	最后一个dog节点，last()是xPath内置函数
\|	单竖杠表示合并节点结合	//dog[@color=’white’] \| //cat[@color=’white’]	color属性为white的dog节点和color属性为white的cat节点
*	星号表示任何名字的节点或者属性	//dog/*	表示dog节点的所有子节点
*	星号表示任何名字的节点或者属性	//dog/@*	表示dog节点的所有属性节点

2. XPath数学运算符
+ 加号表示加
- 表示数字相减
* 表示乘以
div 表示除以，这里数学上的除号/已经被用作节点之间分隔符了
mod 表示取余
3. XPath逻辑运算符
= 等于，相当于c#中的 ==
!= 不等于
> 大于
>= 大于等于
< 小于
<= 小于等于
and 并且与关系
or 或者或关系
4. XPath Axes 从字面翻译这个是XPath轴的意思，但根据我的理解这个翻译成XPath节点关系运算关键字更合适，就是一组关键字加上::双冒号表示和当前节点有关系的一个或者一组节点.
使用语法： axisname::nodetest[predicate] 即轴名字::节点名字[取节点条件]
具体说明如下：

关键字	说明	示例	示例说明
ancestor	当前节点的父祖节点	ancestor::pig	当前节点的祖先节点中的pig节点
ancestor-or-self	当前节点以及其父祖节点	ancestor::pig
attribute	当前节点的所有属性	attribute::weight	相当于@weight，attribute::和@是等价的
child	当前节点的所有字节点	child::*[name()!=’price’]	选择名字不是price的子节点
descendant	子孙节点	descendant::[@]	有属性的子孙节点
descendant-or-self	子孙节点以及当前节点	descendant-or-self::*
following	Xml文档中当前节点之后的所有节点	following::*
following-sibling	当前节点的同父弟弟节点	following-sibling::
preceding	Xml文档中当前节点之前的所有节点	preceding::*
namespace	选取当前节点的所有命名空间节点	namespace::*
parent	当前节点的父节点	parent::	相当于双点..
preceding-sibling	当前节点之后的同父兄节点	preceding-sibling::*
self	当前节点	self::*	相当于单点.

5. 常用的XPath函数介绍：
在XPath表达式中常用的函数有下面两个：
position() 表示节点的序号例如 //cat[position() = 2] 表示取序号为2的dog节点
last() 表示取最后一个节点 //cat[last()]
name() 表示当前节点名字 /pets/*[name() != 'pig'] 表示/pets下名字不是pig的子节点

selectNodes() 方法用一个 XPath 查询选择节点，返回值是包含了匹配查询的节点的一个 NodeList，这个 selectNodes() 方法只用于 XML 文档节点，不用于 HTML 文档节点

selectSingleNode()方法与selectNodes() 方法基本相同，不同之处只是selectSingleNode()返回的是查询的单个节点，不是节点列表

XPath的函数还有很多，包括字符串函数，数字函数和时间函数等，具体可以参考w3的网站。
以上是XPath的语法，下面我们看下如何在.Net中使用XPath
在.Net中可以通过XPathDocument或者XmlDocument类使用XPath。XPathDocument是只读的方式定位Xml节点或者属性文本等，而XmlDocument则是可读写的。
如下代码示例展示了如何使用XPathDocument和XmlDocument。

using System;

using System.Collections.Generic;

using System.Linq;

using System.Text;

using System.Xml.XPath;

using System.Xml;

namespace UseXPathDotNet

{

    class Program

    {

        static void Main(string[] args)

        {

            UseXPathWithXPathDocument();

            UseXPathWithXmlDocument();

            Console.Read();

        }

        static void UseXPathWithXmlDocument()

        {

            XmlDocument doc = new XmlDocument();

            doc.Load("http://www.cnblogs.com/yukaizhao/rss");

            //使用xPath选择需要的节点

            XmlNodeList nodes = doc.SelectNodes("/rss/channel/item[position()<=10]");

            foreach (XmlNode item in nodes)

            {

                string title = item.SelectSingleNode("title").InnerText;

                string url = item.SelectSingleNode("link").InnerText;

                Console.WriteLine("{0} = {1}", title, url);

            }

        }

        static void UseXPathWithXPathDocument()

        {

            XPathDocument doc = new XPathDocument("http://www.cnblogs.com/yukaizhao/rss");

            XPathNavigator xPathNav = doc.CreateNavigator();

            //使用xPath取rss中最新的10条随笔

            XPathNodeIterator nodeIterator = xPathNav.Select("/rss/channel/item[position()<=10]");

            while (nodeIterator.MoveNext())

            {

                XPathNavigator itemNav = nodeIterator.Current;

                string title = itemNav.SelectSingleNode("title").Value;

                string url = itemNav.SelectSingleNode("link").Value;

                Console.WriteLine("{0} = {1}",title,url);

            }

        }

    }

}

XPath使用示例，请看下面的代码注释

using System;

using System.Collections.Generic;

using System.Linq;

using System.Text;

using System.IO;

using System.Xml;

namespace UseXPath1

{

    class Program

    {

        static void Main(string[] args)

        {

            string xml = @"<?xml version=""1.0"" encoding=""utf-8"" ?>

<pets>

  <cat color=""black"" weight=""10"" count=""4"">

    <price>100</price>

    <desc>this is a black cat</desc>

  </cat>

  <cat color=""white"" weight=""9"" count=""5"">

    <price>80</price>

    <desc>this is a white cat</desc>

  </cat>

  <cat color=""yellow"" weight=""15"" count=""1"">

    <price>110</price>

    <desc>this is a yellow cat</desc>

  </cat>

  <dog color=""black"" weight=""10"" count=""7"">

    <price>114</price>

    <desc>this is a black dog</desc>

  </dog>

  <dog color=""white"" weight=""9"" count=""4"">

    <price>80</price>

    <desc>this is a white dog</desc>

  </dog>

  <dog color=""yellow"" weight=""15"" count=""15"">

    <price>80</price>

    <desc>this is a yellow dog</desc>

  </dog>

    <pig color=""white"" weight=""100"" count=""2"">

    <price>8000</price>

    <desc>this is a white pig</desc>

    </pig>

</pets>";

            using (StringReader rdr = new StringReader(xml))

            {

                XmlDocument doc = new XmlDocument();

                doc.Load(rdr);

                //取所有pets节点下的dog字节点

                XmlNodeList nodeListAllDog = doc.SelectNodes("/pets/dog");

                //所有的price节点

                XmlNodeList allPriceNodes = doc.SelectNodes("//price");

                //取最后一个price节点

                XmlNode lastPriceNode = doc.SelectSingleNode("//price[last()]");

                //用双点号取price节点的父节点

                XmlNode lastPriceParentNode = lastPriceNode.SelectSingleNode("..");

                //选择weight*count=40的所有动物，使用通配符*

                XmlNodeList nodeList = doc.SelectNodes("/pets/*[@weight*@count=40]");

                //选择除了pig之外的所有动物,使用name()函数返回节点名字

                XmlNodeList animalsExceptPigNodes = doc.SelectNodes("/pets/*[name() != 'pig']");

                //选择价格大于100而不是pig的动物

                XmlNodeList priceGreaterThan100s = doc.SelectNodes("/pets/*[price div @weight >10 and name() != 'pig']");

                foreach (XmlNode item in priceGreaterThan100s)

                {

                    Console.WriteLine(item.OuterXml);

                }

                //选择第二个dog节点

                XmlNode theSecondDogNode = doc.SelectSingleNode("//dog[position() = 2]");

                //使用xpath ，axes 的 parent 取父节点

                XmlNode parentNode = theSecondDogNode.SelectSingleNode("parent::*");

                //使用xPath选择第二个dog节点前面的所有dog节点

                XmlNodeList dogPresibling = theSecondDogNode.SelectNodes("preceding::dog");

                //取文档的所有子孙节点price

                XmlNodeList childrenNodes = doc.SelectNodes("descendant::price");

            }

            Console.Read();

        }

    }

}

3.XPath的更多相关文章

xpath提取多个标签下的text
title: xpath提取多个标签下的text author: 青南 date: 2015-01-17 16:01:07 categories: [Python] tags: [xpath,Pyth ...
C#+HtmlAgilityPack+XPath带你采集数据(以采集天气数据为例子)
第一次接触HtmlAgilityPack是在5年前,一些意外,让我从技术部门临时调到销售部门,负责建立一些流程和寻找潜在客户,最后在阿里巴巴找到了很多客户信息,非常全面,刚开始是手动复制到Excel, ...
在Java中使用xpath对xml解析
xpath是一门在xml文档中查找信息的语言.xpath用于在XML文档中通过元素和属性进行导航.它的返回值可能是节点,节点集合,文本,以及节点和文本的混合等.在学习本文档之前应该对XML的节点,元素 ...
XPath 学习二：语法
XPath 使用路径表达式来选取 XML 文档中的节点或节点集.节点是通过沿着路径 (path) 或者步 (steps) 来选取的. 下面列出了最有用的路径表达式: 表达式描述 nodename 选 ...
xpath 学习一：节点
xpath 中,有七种类型的节点: 元素.属性.文本.命名空间.处理指令.注释.以及根节点树的根成为文档节点或者根节点. 节点关系: Parent, Children, sibling(同胞), A ...
Python爬虫利器三之Xpath语法与lxml库的用法
前面我们介绍了 BeautifulSoup 的用法,这个已经是非常强大的库了,不过还有一些比较流行的解析库,例如 lxml,使用的是 Xpath 语法,同样是效率比较高的解析方法.如果大家对 Beau ...
使用python+xpath 获取https://pypi.python.org/pypi/lxml/2.3/的下载链接
使用python+xpath 获取https://pypi.python.org/pypi/lxml/2.3/的下载链接: 使用requests获取html后,分析html中的标签发现所需要的链接在& ...
关于robotframework,app,appium的xpath定位问题及常用方法
关于类似的帖子好像很多,但是没有找到具体能帮我解决问题的办法.还是自己深究了好久才基本知道app上面的xpath定位和web上的不同点: 先放一个图: A,先说说不用xpath的场景,一般是用于存在i ...
Selenium Xpath Tutorials - Identifying xpath for element with examples to use in selenium
Xpath in selenium is close to must required. XPath is element locator and you need to provide xpath ...
xpath定位中starts-with、contains和text()的用法
starts-with 顾名思义,匹配一个属性开始位置的关键字 contains 匹配一个属性值中包含的字符串 text() 匹配的是显示文本信息,此处也可以用来做定位用 eg //input[sta ...

随机推荐

docker笔记2--镜像容器基本使用
1 docker的安装系统:centos7 (1)配置好yum (2)yum -y install docker (3)查看状态 systemctl status docker 2 docker镜像 ...
用python写一个简单的文件上传
用Pycharm创建一个django项目.目录如下: <!DOCTYPE html> <html lang="en"> <head> <m ...
VC6.0- C语言-winsocket-警告warning C4761
错误介绍操作系统:windows10 IDE:VC6.0 语言:C语言项目内容简介:编写一个双人网络海战棋对战游戏警告类型:警告warning C4761 integral size misma ...
xorm实例-创建xorm，映射
创建xorm引擎 //在xorm里面,可以同时存在多个Orm引擎,一个Orm引擎称为Engine, //一个Engine一般只对应一个数据库. //Engine通过调用`xorm.NewEngine` ...
max_prepared_stmt_count参数
MySQL报错[mysqld-5.5.17-log]Can't create more than max_prepared_stmt_count statements (current value: ...
java 简易日历表
在页面上输出1900年以后任意一年的简易日历表 package text3; import java.util.Scanner; public class MyCalendar { public st ...
centos jira wiki 开机自启
启动jira 和 wiki 需要java环境变量操作步骤: 在 /etc/rc.d/rc.local 文件中[ 注意要有可执行权限 ] export JAVA_HOME=xxxxxxxx expor ...
ubuntu 18.04安装RTX 2060 显卡驱动
第一:安装ppa的显卡驱动源 sudo add-apt-repository ppa:graphics-drivers/ppa sudo apt update 第二:检查显卡和推荐驱动 ubuntu- ...
Java开发环境搭建（二）：环境变量配置
如果不配置环境变量,java 命令就只能在 bin 目录下才能使用,而且很多Java软件也需要在配置JAVA_HOME和PATH的状态下才能运行.为了在任何目录下都可以使用 java 命令.保证程序正 ...
openwrt luci web分析
openwrt luci web分析来源 https://www.jianshu.com/p/596485f95cf2 www/cbi-bin/luci #!/usr/bin/lua --cgi的执 ...

3.XPath

3.XPath的更多相关文章

随机推荐

热门专题