Python3使用xml.dom.minidom和xml.etree模块儿解析xml文件，封装函数

总结了一下使用Python对xml文件的解析，用到的模块儿如下：

分别从xml字符串和xml文件转换为xml对象，然后解析xml内容，查询指定信息字段。

from xml.dom.minidom import parse, parseString

from xml.etree import ElementTree

import xml.dom.minidom

"""

Get XML String info 查询属性值

    response:xml string

    tag:xml tag

    element:xml attribute

"""

def get_xml_info(response, element):

    DOMTree = xml.dom.minidom.parseString(response)

    return DOMTree.documentElement.getAttribute(element)

"""

Get XML String info 查询制定名称的特定标签id

    xmlstring:xml str

    return config id

"""

def get_config_id_from_xml(xmlstring, scan):

    root = ElementTree.fromstring(xmlstring)

    configs = root.findall('config')

    for config in configs:

        config_name = config.find('name').text

        if config_name == scan:

            return config.attrib['id']

"""

Get XML String info 查询指定id

    xmlstring:xml str

    return report id

"""

def get_report_id_from_xml(xmlstring):

    root = ElementTree.fromstring(xmlstring)

    report_id = root.find('report_id').text

    return report_id

"""

Get XML String info

    xmlstring:xml str

    return progress

"""

def get_progress_from_xml(xmlstring):

    root = ElementTree.fromstring(xmlstring)

    task = root.find('task')

    progress = float(task.find('progress').text)

    if progress < 0:

        return 100.0

    else:

        return progress

"""

Get XML Report info 从xml文件查询

    file_path : report path

"""

def get_xml_report(file_path):

    report = {}

    result_dicts = {}

    resultsList = []

    try:

        root = ElementTree.parse(file_path)

    except:

        return {}

    if root is not None:

        creation_time = root.find("creation_time")

        if creation_time is not None:

            report[creation_time.tag] = creation_time.text

        if root.find("report") is not None:

            scan_start = root.find("report").find("scan_start")

            if scan_start is not None:

                if scan_start.text:

                    report[scan_start.tag] = scan_start.text

        results = root.getiterator("result")

        if results is not None:

            for result in results:

                if result.find("threat") is not None:

                    if result.find("threat").text != "Log":

                        resultsList.append(getResults(result))

    report["Results"] = resultsList

    return report

Python3使用xml.dom.minidom和xml.etree模块儿解析xml文件，封装函数的更多相关文章

xml dom minidom
一. xml相关术语: 1.Document(文档): 对应一个xml文件 2.Declaration(声明): <?xml version="1.0" encoding=& ...
python XML文件解析：用xml.dom.minidom来解析xml文件
python解析XML常见的有三种方法: 一是xml.dom.*模块,是W3C DOM API的实现,若需要处理DOM API则该模块很合适, 二是xml.sax.*模块,它是SAX API的实现,这 ...
python 应用xml.dom.minidom读xml
xml文件 <?xml version="1.0" encoding="utf-8"?> <city> <name>上海&l ...
python模块：xml.dom.minidom
"""Simple implementation of the Level 1 DOM. Namespaces and other minor Level 2 featu ...
python 之模块之 xml.dom.minidom解析xml
# -*- coding: cp936 -*- #python 27 #xiaodeng #python 之模块之 xml.dom.minidom解析xml #http://www.cnblogs.c ...
python-minidom模块【解析xml】
1,xml的文档结构 1.1,XML文档包括XML头信息和XML信息体 1.1.1,XML文档头信息 <?xml version="1.0" encoding="u ...
nodejs模块xml2js解析xml的坑
在一个项目中,用到nodejs模块xml2js解析xml,xml的数据如下: <xml> <MsgId>6197906553041859764</MsgId> &l ...
[java开发篇][dom模块] 遍历解析xml
http://blog.csdn.net/andie_guo/article/details/24844351 XML DOM节点树 XML DOM将XML文档作为树结构,树结构称为一个节点树.所有的 ...
java解析xml汇总（转自倾城幻影-Java解析xml汇总，链接：http://www.cnblogs.com/jiugehuanying/archive/2012/01/12/2320058.html）
[引言] 目前在Java中用于解析XML的技术很多,主流的有DOM.SAX.JDOM.DOM4j,下文主要介绍这4种解析XML文档技术的使用.优缺点及性能测试. [一.基础知识--扫盲] sax.do ...

随机推荐

CLR内部异常(中)
不捕捉某一个异常常常有这种情况,代码不需要捕捉异常,但需要执行一些清理或者修正操作.虽然不总是,支持物(holders)经常用在这种场景里.在支持物(holders)不适用的情况里,CLR提供了两个 ...
(浙江金华)Day 1 组合数计数
目录 Day 1 组合计数 1.组合数 (1).C(n,m) 读作n选m,二项式系数 : (2).n个东西里选m个的方案数不关心选的顺序: (3).二项式系数--->多项式系数: 2.组合数计 ...
CDH 6.0.1 版本默认配置下 HUE | happybase 无法访问 Hbase 的问题
第一个问题 HUE 无法直接连接到 HBase 在默认配置下 CDH 6.0.1 版本下的 HBase2.0 使用了默认配置 hbase.regionserver.thrift.compact = T ...
推荐一款阿里开源的 Java 诊断工具，好用到爆！
Arthas是什么鬼? Arthas是一款阿里巴巴开源的 Java 线上诊断工具,功能非常强大,可以解决很多线上不方便解决的问题. Arthas诊断使用的是命令行交互模式,支持JDK6+,Linux. ...
C# System.Net.Mail.MailMessage 发邮件
C# System.Net.Mail.MailMessage 发邮件上篇文化在哪个可以看到使用 System.Web.Mail.MailMessage 发邮件时会提示 ,提供用于构造电子邮件的属性和 ...
Grid数字或金额千分位或保留两位小数
formatter: 'number', formatoptions: { thousandsSeparator: "", decimalPlaces: 2 }
C++ Java throw goto
throw goto - 国内版 Binghttps://cn.bing.com/search?FORM=U227DF&PC=U227&q=throw+goto C++ throw 代 ...
利用lsof命令查找已经删除的文件来释放磁盘空间
测试环境一台服务器/目录空间使用率达到97%,但是通过du -sh *发现实际空间没用到那么多,初步怀疑,之前删除的文件,有运行中的进程一直占用,导致空间没有释放,如图通过du -sh *发现共实际使 ...
ubuntu18 maven
user1@user1-ThinkPad-W540:~$ user1@user1-ThinkPad-W540:~$ sudo mkdir /opt/maven[sudo] password for u ...
[译]为什么在__new __（）后总是调用__init __（）？
原文来源: https://stackoverflow.com/questions/674304/why-is-init-always-called-after-new 需要控制新实例的创建时,请使用 ...

Python3使用xml.dom.minidom和xml.etree模块儿解析xml文件，封装函数

Python3使用xml.dom.minidom和xml.etree模块儿解析xml文件，封装函数的更多相关文章

随机推荐

热门专题