14

I'm writing some XML with element tree.

I'm giving the code an empty template file that starts with the XML declaration:<?xml version= "1.0"?> when ET has finished making its changes and writes the completed XML its stripping out the declarion and starting with the root tag. How can I stop this?

Write call:

ET.ElementTree(root).write(noteFile)

2 Answers 2

26

According to the documentation:

write(file, encoding="us-ascii", xml_declaration=None, method="xml")

Writes the element tree to a file, as XML. file is a file name, or a file object opened for writing. encoding 1 is the output encoding (default is US-ASCII). xml_declaration controls if an XML declaration should be added to the file. Use False for never, True for always, None for only if not US-ASCII or UTF-8 (default is None). method is either "xml", "html" or "text" (default is "xml"). Returns an encoded string.

So, write(noteFile) is explicitly telling it to write an XML declaration only if the encoding is not US-ASCII or UTF-8, and that the encoding is US-ASCII; therefore, you get no declaration.

I'm guessing if you didn't read this much, your next question is going to be "why is my Unicode broken", so let's fix both at once:

ET.ElementTree(root).write(noteFile, encoding="utf-8", xml_declaration=True)
Sign up to request clarification or add additional context in comments.

3 Comments

Ah! OK, I didn't realise it was completely reforming the document - thank you, that's very useful to know.
Well, it's up to the ElementTree implementation to decide whether to generate the entire XML from scratch or to reuse existing pieces. I believe lxml has some pretty smart caching, so if you just read in a small file and write it back out, it reuses the entire root node. But the declaration isn't part of the root node, so there's really no way it could get around that.
If above accepted solution don't work for you, please check the answer of @Olli as his little tweak saved my day
6

There are different versions of ElementTree. Some of them accept the xml_declaration argument, some do not.

The one I happen to have does not. It emits the declaration if and only if encoding != 'utf-8'. So, to get the declaration, I call write(filename, encoding='UTF-8').

Comments

Your Answer

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.