Remove all empty html elements using PHP: DOMDocument

Question

Is there any way to remove all empty elements from an html without using regex?

I did this with DOMXPath

$this->dom->loadHTML($document, LIBXML_HTML_NOIMPLIED | LIBXML_HTML_NODEFDTD);
$xpath = new \DOMXPath($this->dom);
while (($node_list = $xpath->query('//*[not(*) and not(@*) and not(text()[normalize-space()])]')) && $node_list->length) {
    foreach ($node_list as $node) {
        $node->parentNode->removeChild($node);
    }
}

You should be able to use domdocument as you tagged. Do you have some code you are having issues with? — chris85
– chris85, Commented Nov 1, 2016 at 19:19

Simbiat · Accepted Answer · 2021-03-29 12:11:32Z

1

Since it may be quite unclear, that the question has been already answered by the author (through editing the post) and I can't comment to ask for appropriate question closure, copying the same code as an actual answer.
An important thing: comments refer to another topic, but the solution there works only for flat documents, while the solution from OP does work with deep trees. It helped me quite a bunch.

$this->dom->loadHTML($document, LIBXML_HTML_NOIMPLIED | LIBXML_HTML_NODEFDTD);
$xpath = new \DOMXPath($this->dom);
while (($node_list = $xpath->query('//*[not(*) and not(@*) and not(text()[normalize-space()])]')) && $node_list->length) {
    foreach ($node_list as $node) {
        $node->parentNode->removeChild($node);
    }
}

answered Mar 29, 2021 at 12:11

Simbiat

3752 silver badges13 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

Remove all empty html elements using PHP: DOMDocument

1 Answer 1

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related