0% found this document useful (0 votes)

61 views

XML Sem 3

The document describes how to parse and extract information from an XML file using the xml.etree.ElementTree module in Python. It provides an example XML file containing food item data, then demonstrates how to parse the file, extract the root element and child elements, retrieve attribute values, and extract text from elements. The document also shows how to write XML data to a file using the ElementTree module.

Uploaded by

prashanth kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

61 views

XML Sem 3

Uploaded by

prashanth kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Reading XML File :-

<?xml version="1.0" encoding="UTF-8"?>

<metadata>
<food>
<item name="breakfast">Idly</item>
<price>$2.5</price>
<description>
Two idly's with chutney
</description>
<calories>553</calories>
</food>
<food>
<item name="breakfast">Paper Dosa</item>
<price>$2.7</price>
<description>
Plain paper dosa with chutney
</description>
<calories>700</calories>
</food>
<food>
<item name="breakfast">Upma</item>
<price>$3.65</price>
<description>
Ravaupma with bajji
</description>
<calories>600</calories>
</food>
<food>
<item name="breakfast">BisiBele Bath</item>
<price>$4.50</price>
<description>
BisiBele Bath with sev
</description>
<calories>400</calories>
</food>
<food>
<item name="breakfast">Kesari Bath</item>
<price>$1.95</price>
<description>
Sweet rava with saffron
</description>
<calories>950</calories>
</food>
</metadata>

The above example shows the contents of a file which I have named as ‘Sample.xml’

Python XML Parsing Modules

Python allows parsing these XML documents using two modules namely, the
xml.etree.ElementTree module and Minidom (Minimal DOM Implementation). Parsing means to
read information from a file and split it into pieces by identifying parts of that particular XML file.
xml.etree.ElementTree Module:
This module helps us format XML data in a tree structure which is the most natural
representation of hierarchical data. Element type allows storage of hierarchical data structures in
memory and has the following properties:

Property Description

It is a string representing the type of data

Tag
being stored

Consists of a number of attributes stored as

Attributes
dictionaries

A text string having information that needs

Text String
to be displayed

Tail String Can also have tail strings if necessary

Consists of a number of child elements

Child Elements
stored as sequences

ElementTree is a class that wraps the element structure and allows conversion to and from XML.
Let us now try to parse the above XML file using python module.

There are two ways to parse the file using ‘ElementTree’ module. The first is by using the parse()
function and the second is fromstring() function. The parse () function parses XML document
which is supplied as a file whereas,fromstring parses XML when supplied as a string i.e within
triple quotes.

Using parse() function:-

As mentioned earlier, this function takes XML in file format to parse it. Take a look at the following
example:

importxml.etree.ElementTree as ET
mytree = ET.parse('Sample.xml')
myroot = mytree.getroot()
print(myroot)
As you can see, The first thing you will need to do is to import the xml.etree.ElementTree module.
Then, the parse() method parses the ‘Sample.xml’ file. The getroot() method returns the root
element of ‘Sample.xml’.

To check for the root element, you can simply use the print statement as follows:

OUTPUT:

<Element ‘metadata’ at 0x033589F0>

The above output indicates that the root element in our XML document is ‘metadata’.
Using fromstring() function:
You can also use fromstring() function to parse your string data. In case you want to do this, pass
your XML as a string within triple quotes as follows:

importxml.etree.ElementTree as ET
data='''<?xml version="1.0" encoding="UTF-8"?>
<metadata>
<food>
<item name="breakfast">Idly</item>
<price>$2.5</price>
<description>
Two idly's with chutney
</description>
<calories>553</calories>
</food>
</metadata>
'''
myroot = ET.fromstring(data)
#print(myroot)
print(myroot.tag)

You can also slice the tag string output by just specifying which part of the string you want to see
in your output.

EXAMPLE:

print(myroot.tag[0:4])

OUTPUT:

Finding Elements of Interest:

The root consists of child tags as well. To retrieve the child of the root tag, you can use the
following:

print(myroot[0].tag)

OUTPUT: food
Now, if you want to retrieve all first-child tags of the root, you can iterate over it using the for loop
as follows:

for x in myroot[0]:

print(x.tag, x.attrib

OUTPUT:

item {‘name’: ‘breakfast’}

price {}
description {}
calories {}

All the items returned are the child attributes and tags of food.

To separate out the text from XML using ElementTree, you can make use of the text attribute. For
example, in case I want to retrieve all the information about the first food item, I should use the
following piece of code:

for x in myroot[0]:

print(x.text)

OUTPUT:

Idly
$2.5
Two idly’s with chutney
553

As you can see, the text information of the first item has been returned as the output. Now if you
want to display all the items with their particular price, you can make use of the get() method.
This method accesses the element’s attributes.

EXAMPLE:

for x in myroot.findall('food'):
item =x.find('item').text
price = x.find('price').text
print(item, price)
OUTPUT:

Idly $2.5
Paper Dosa $2.7
Upma $3.65
BisiBele Bath $4.50
Kesari Bath $1.95

The above output shows all the required items along with the price of each of them. Using
ElementTree, you can also modify the XML files.
Writing XML Documents:-

Using ElementTree

ElementTree is also great for writing data to XML files. The code below shows how to create an
XML file with the same structure as the file we used in the previous examples.

The steps are:

1. Create an element, which will act as our root element. In our case the tag for this element is "data".

2. Once we have our root element, we can create sub-elements by using the SubElement function. This
function has the syntax:

SubElement(parent, tag, attrib={}, **extra)

Here parent is the parent node to connect to, attrib is a dictionary containing the element
attributes, and extra are additional keyword arguments. This function returns an element
to us, which can be used to attach other sub-elements, as we do in the following lines by
passing items to the SubElement constructor.

3. Although we can add our attributes with the SubElement function, we can also use the
set() function, as we do in the following code. The element text is created with the text
property of the Element object.

4. In the last 3 lines of the code below we create a string out of the XML tree, and we write
that data to a file we open.
Example code:

Import xml.etree.cElementTree as ET
root = ET.Element("data")
doc = ET.SubElement(root,"food")

ET.SubElement(doc, "item", name="breakfast").text = "idly"

ET.SubElement(doc, "price").text = "25"
ET.SubElement(doc, "description").text = "Two idly's with chutney"

doc = ET.SubElement(root,"food")
ET.SubElement(doc, "item", name="breakfast").text = "Dosa"
ET.SubElement(doc, "price").text = "35"
ET.SubElement(doc, "description").text = "one dosa with chutney"

tree = ET.ElementTree(root)
tree.write("FILE3.xml")
Executing this code will result in a new file, "FILE3.xml", which
should be equivalent to the original "Sample.xml" file, at least
in terms of the XML data structure. You'll probably notice that
it the resulting string is only one line and contains no
indentation,

Grasshopper Getting Started Guide v.1.1 PDF
No ratings yet
Grasshopper Getting Started Guide v.1.1 PDF
10 pages
Python Language 581 860
No ratings yet
Python Language 581 860
280 pages
Introduction to XML
No ratings yet
Introduction to XML
44 pages
Python XML Processing With LXML
No ratings yet
Python XML Processing With LXML
56 pages
Lecture 6 - Semi-Structured Data
No ratings yet
Lecture 6 - Semi-Structured Data
67 pages
Use of Xpath in PHP Fahmida Yesmin Haniwriter
No ratings yet
Use of Xpath in PHP Fahmida Yesmin Haniwriter
8 pages
Lab05 2024 Boustil DSS
No ratings yet
Lab05 2024 Boustil DSS
2 pages
Xpath: Supplier "Mother" Id "1"
No ratings yet
Xpath: Supplier "Mother" Id "1"
3 pages
Python XML Processing With LXML
No ratings yet
Python XML Processing With LXML
52 pages
unit 2
No ratings yet
unit 2
50 pages
Pythonlearn-13-WebServices Python
No ratings yet
Pythonlearn-13-WebServices Python
54 pages
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Data Structures and Algorithm
From Everand
Data Structures and Algorithm
Knowledge Flow
No ratings yet
Elementtree XML Api: Library Version: Library Scope: Named Arguments
No ratings yet
Elementtree XML Api: Library Version: Library Scope: Named Arguments
18 pages
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
dsaa49
No ratings yet
dsaa49
1 page
Document Type Definitions XML Schema
No ratings yet
Document Type Definitions XML Schema
44 pages
10 xml
No ratings yet
10 xml
44 pages
W3schools Xpath PDF
No ratings yet
W3schools Xpath PDF
32 pages
XPath Tutorial
No ratings yet
XPath Tutorial
11 pages
Path - Expressions: XML Code, Lemonade2.xml
No ratings yet
Path - Expressions: XML Code, Lemonade2.xml
2 pages
The XML Example Document: View The "Books - XML" File in Your Browser
No ratings yet
The XML Example Document: View The "Books - XML" File in Your Browser
4 pages
Pylxml
No ratings yet
Pylxml
56 pages
XML Parser - XML Viewer - XML Editor PDF
No ratings yet
XML Parser - XML Viewer - XML Editor PDF
5 pages
Xquery and Xpath 2
No ratings yet
Xquery and Xpath 2
25 pages
Urdf
No ratings yet
Urdf
18 pages
DA Unit 4
No ratings yet
DA Unit 4
46 pages
XSL Primer
From Everand
XSL Primer
Stephen Cote
No ratings yet
XML Document Rule, XML Structuring, XML Presentation Technologies
No ratings yet
XML Document Rule, XML Structuring, XML Presentation Technologies
53 pages
XML Document Rule, XML Structuring, XML Presentation Technologies
No ratings yet
XML Document Rule, XML Structuring, XML Presentation Technologies
53 pages
Using XML With PHP (Part2)
No ratings yet
Using XML With PHP (Part2)
18 pages
Lecture 17 XML and XPATH and XQUERY
No ratings yet
Lecture 17 XML and XPATH and XQUERY
93 pages
LXML
No ratings yet
LXML
488 pages
Xquery Tutorial
No ratings yet
Xquery Tutorial
20 pages
Write Down The Syntax Rules For XML Declaration
No ratings yet
Write Down The Syntax Rules For XML Declaration
35 pages
INFO-3138 Tutorial 9 - XPath in C Sharp
No ratings yet
INFO-3138 Tutorial 9 - XPath in C Sharp
6 pages
IT3031-L04-XMLDB
No ratings yet
IT3031-L04-XMLDB
11 pages
Xquery Tutorial: What You Should Already Know
No ratings yet
Xquery Tutorial: What You Should Already Know
21 pages
Unit 7 XML
No ratings yet
Unit 7 XML
15 pages
Simplifying Data Science With Python
From Everand
Simplifying Data Science With Python
Billy David millican
No ratings yet
10 Lessons in Front-end
From Everand
10 Lessons in Front-end
Krasimir Tsonev
2/5 (1)
SAX Parsing With Python
No ratings yet
SAX Parsing With Python
3 pages
Java Programming Tutorial With Screen Shots & Many Code Example
From Everand
Java Programming Tutorial With Screen Shots & Many Code Example
Desmond Ohwofosirai
No ratings yet
DWV_UNIT_II
No ratings yet
DWV_UNIT_II
37 pages
Web Scraping
No ratings yet
Web Scraping
11 pages
Unit 3xml
No ratings yet
Unit 3xml
19 pages
Tree-Based Parsers
No ratings yet
Tree-Based Parsers
15 pages
Xpath Tutorial and Reference
No ratings yet
Xpath Tutorial and Reference
5 pages
XSLT and XPath
100% (6)
XSLT and XPath
15 pages
Engineering The Web - My Notes
No ratings yet
Engineering The Web - My Notes
60 pages
Xmlschema PDF
No ratings yet
Xmlschema PDF
42 pages
Unit 5 (Web)
No ratings yet
Unit 5 (Web)
7 pages
Handout 3
No ratings yet
Handout 3
7 pages
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
From Everand
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
Charlie Masterson
No ratings yet
Ian Talks JS A-Z: WebDevAtoZ, #1
From Everand
Ian Talks JS A-Z: WebDevAtoZ, #1
Ian Eress
No ratings yet
Web Technologies Notes Unit 3
No ratings yet
Web Technologies Notes Unit 3
18 pages
Beginner Tutorial - 10
No ratings yet
Beginner Tutorial - 10
14 pages
Semistructured Data Extensible Markup Language Document Type Definitions
No ratings yet
Semistructured Data Extensible Markup Language Document Type Definitions
34 pages
XPath Introduction
No ratings yet
XPath Introduction
12 pages
Cse2045Y Web Application Development: XML (Extensible Markup Language)
No ratings yet
Cse2045Y Web Application Development: XML (Extensible Markup Language)
18 pages
PYTHON Data Science Internal
No ratings yet
PYTHON Data Science Internal
2 pages
Fit Model Paper 1
100% (1)
Fit Model Paper 1
1 page
Machine Learning Unit Wise Important Questions
100% (2)
Machine Learning Unit Wise Important Questions
2 pages
Important Questions Sem1 Physics
100% (1)
Important Questions Sem1 Physics
2 pages
Physics 3rd Year Pre Final
No ratings yet
Physics 3rd Year Pre Final
1 page
Data Eng With Python Internal
No ratings yet
Data Eng With Python Internal
2 pages
IVY Professional School: Program: KPO Training Module: Introduction To Session: 1 & 2
No ratings yet
IVY Professional School: Program: KPO Training Module: Introduction To Session: 1 & 2
21 pages
Operating Systems: History
No ratings yet
Operating Systems: History
9 pages
Qoriq Ls1028A Reference Design Board Reference Manual: Supports Ls1028Ardb Revision C
No ratings yet
Qoriq Ls1028A Reference Design Board Reference Manual: Supports Ls1028Ardb Revision C
107 pages
IP Version 3.3 Update Notes
No ratings yet
IP Version 3.3 Update Notes
9 pages
Freescale Sabre Lite User Manaul V1.3
No ratings yet
Freescale Sabre Lite User Manaul V1.3
49 pages
Mymsinfo Rezultata
No ratings yet
Mymsinfo Rezultata
101 pages
Exploiting Vulnerbilities in ESXi - Preauth RCE and Sandbox Escape.
No ratings yet
Exploiting Vulnerbilities in ESXi - Preauth RCE and Sandbox Escape.
68 pages
Basic Elements of C++ PDF
No ratings yet
Basic Elements of C++ PDF
12 pages
Unix
No ratings yet
Unix
47 pages
William Stallings Computer Organization and Architecture 10 Edition
No ratings yet
William Stallings Computer Organization and Architecture 10 Edition
34 pages
DFT Strategy For Arm Cores
No ratings yet
DFT Strategy For Arm Cores
6 pages
Microservices On Aws
No ratings yet
Microservices On Aws
35 pages
Roland VS-880EX Digital Studio Service Manual
No ratings yet
Roland VS-880EX Digital Studio Service Manual
20 pages
Mpi - Lab No - 1
No ratings yet
Mpi - Lab No - 1
7 pages
Fanuc 6m
No ratings yet
Fanuc 6m
3 pages
Using Python Libraries
No ratings yet
Using Python Libraries
18 pages
3 - XYZ Network Design and Presentation
No ratings yet
3 - XYZ Network Design and Presentation
7 pages
DC Poweredge R740 Server (Dell (TM) Poweredge (TM) R740 Rack Mount Server - Non DB DC DR)
No ratings yet
DC Poweredge R740 Server (Dell (TM) Poweredge (TM) R740 Rack Mount Server - Non DB DC DR)
32 pages
Icom Cloning and Data Cables
No ratings yet
Icom Cloning and Data Cables
2 pages
Hybrid Converter: Instruction Manual
No ratings yet
Hybrid Converter: Instruction Manual
27 pages
Xinorbis6 User Manual
No ratings yet
Xinorbis6 User Manual
73 pages
Don - Bosco Automation 1
No ratings yet
Don - Bosco Automation 1
36 pages
05b-EPAS Components Overview-C5 Technical Overview
No ratings yet
05b-EPAS Components Overview-C5 Technical Overview
69 pages
CISCO Lista de Precios DS3 COMUNICACIONES
100% (1)
CISCO Lista de Precios DS3 COMUNICACIONES
17 pages
Compal Confidential: QML70 Schematics Document
No ratings yet
Compal Confidential: QML70 Schematics Document
53 pages
Assignment #2 Question 1: SWOT Analysis For ACER Strengths Weaknesses
No ratings yet
Assignment #2 Question 1: SWOT Analysis For ACER Strengths Weaknesses
5 pages
OpenScape Voice V7 - Service Manual - Installation and Upgrades - Installation Guide - Issue 16 PDF
100% (3)
OpenScape Voice V7 - Service Manual - Installation and Upgrades - Installation Guide - Issue 16 PDF
837 pages
Air Mouse Android SEO Template
No ratings yet
Air Mouse Android SEO Template
4 pages
Java CTS Dumps 2
No ratings yet
Java CTS Dumps 2
28 pages

XML Sem 3

Uploaded by

XML Sem 3

Uploaded by

Reading XML File :-

<?xml version="1.0" encoding="UTF-8"?>

Python XML Parsing Modules

It is a string representing the type of data

Consists of a number of attributes stored as

A text string having information that needs

Tail String Can also have tail strings if necessary

Consists of a number of child elements

Using parse() function:-

<Element ‘metadata’ at 0x033589F0>

Finding Elements of Interest:

item {‘name’: ‘breakfast’}

The steps are:

SubElement(parent, tag, attrib={}, **extra)

ET.SubElement(doc, "item", name="breakfast").text = "idly"

You might also like