I am trying to do a grab everything after the "</html>" tag and delete it, but my code doesn't seem to be doing anything. Does .replace()
not support regex?
z.write(article.replace('</html>.+', '</html>'))
No. Regular expressions in Python are handled by the re
module.
article = re.sub(r'(?is)</html>.+', '</html>', article)