Python – XMLファイルを読み込む(DOMの例)
この例では、XMLファイルを読み込み、その値をPythonの `xml.dom.minidom`を介して出力する方法を示します。
1. XMLファイル
単純なXMLファイルで、後でPythonの `minidom`で解析します。
staff.xml
<?xml version="1.0"?>
<company>
<name>Mkyong Enterprise</name>
<staff id="1001">
<nickname>mkyong</nickname>
<salary>100,000</salary>
</staff>
<staff id="1002">
<nickname>yflow</nickname>
<salary>200,000</salary>
</staff>
<staff id="1003">
<nickname>alex</nickname>
<salary>20,000</salary>
</staff>
</company>
2. DOMの例1
単純なPythonの `minidom`の例です。
dom-example.py
from xml.dom import minidom
doc = minidom.parse("staff.xml")
# doc.getElementsByTagName returns NodeList
name = doc.getElementsByTagName("name")[0]print(name.firstChild.data)
staffs = doc.getElementsByTagName("staff")
for staff in staffs:
sid = staff.getAttribute("id")
nickname = staff.getElementsByTagName("nickname")[0] salary = staff.getElementsByTagName("salary")[0] print("id:%s, nickname:%s, salary:%s" %
(sid, nickname.firstChild.data, salary.firstChild.data))
出力
Mkyong Enterprise id:1001, nickname:mkyong, salary:100,000 id:1002, nickname:yflow, salary:200,000 id:1003, nickname:alex, salary:20,000
3. DOMの例2
さらに別の `minidom`の例です。
dom-example2.py
from xml.dom import minidom
doc = minidom.parse("staff.xml")
def getNodeText(node):
nodelist = node.childNodes
result =[] for node in nodelist:
if node.nodeType == node.TEXT__NODE:
result.append(node.data)
return ''.join(result)
name = doc.getElementsByTagName("name")[0]print("Node Name : %s" % name.nodeName)
print("Node Value : %s \n" % getNodeText(name))
staffs = doc.getElementsByTagName("staff")
for staff in staffs:
sid = staff.getAttribute("id")
nickname = staff.getElementsByTagName("nickname")[0] salary = staff.getElementsByTagName("salary")[0] print("id:%s, nickname:%s, salary:%s" %
(sid, getNodeText(nickname), getNodeText(salary)))
出力
Node Name : name Node Value : Mkyong Enterprise id:1001, nickname:mkyong, salary:100,000 id:1002, nickname:yflow, salary:200,000 id:1003, nickname:alex, salary:20,000
4.プロジェクトデモ
参考文献
: xml.dom.minidom].
https://docs.python.org/3.4/library/xml.dom.html#dom-nodelist-objects
[Python
: Objects in the DOM]