Skip to content
  • Daniel Veillard's avatar
    Keep non-significant blanks node in HTML parser · f933c898
    Daniel Veillard authored
    For https://bugzilla.gnome.org/show_bug.cgi?id=681822
    
    Regardless if the option HTML_PARSE_NOBLANKS is set or not, blank nodes
    are removed from a HTML document, for example:
    
    <html>
      <head>
        <title>This is a test.</title>
      </head>
      <body>
        <p>This is a test.</p>
      </body>
    </html>
    
    is read as:
    
    <html><head><title>This is a test.</title></head><body>
        <p>This is a test.</p>
      </body></html>
    
    This changes the default behaviour but the old behaviour is available
    as expected when using the parser flag HTML_PARSE_NOBLANKS
    
    Based on original patch from Igor Ignatyuk <igor_ignatiouk@hotmail.com>
    
    * HTMLparser.c: change various places in the parser where ignorable_space
      SAX callback was called without checking for the parser flag preference
    * xmllint.c: make sure we use the new flag even for HTML parsing
    * result/HTML/*: this modifies the output of a number of tests
    f933c898