admin管理员组

文章数量:1530028

from lxml import etree
doc = etree.parse('1.html') 

报错lxml.etree.XMLSyntaxError: Input is not proper UTF-8, indicate encoding !

 

把代码修改一下即可:

par = etree.HTMLParser(encoding="utf-8")
doc = etree.parse('1.html', parser=par)

本文标签: 报错etreelxmlhtmlXMLSyntaxError