Even more text-parser updates

  • Fixed many small bugs.
  • Pages can no longer be too wide to fit on screen.
  • Very long words can no longer overflow the right edge.
  • Adjusted styling on blockquotes to italicize those that are relatively short. This helps distinguish multiple short blockquotes from the surrounding text.
  • Adjusted styling on list elements to avoid indenting entire articles when the article’s container is itself a list element.
  • Added support for IDN hostnames.
  • Improved filter accuracy in a lot of small ways.
  • Made the filter less aggressive to significantly reduce the chances of ever stripping legitimate body text. I’d rather do it this way, even though it lets in more extraneous content.

I actually reached the bottom of the pile of text-parser reports for the first time. Thanks for the continued reports. You guys are awesome.

Related Posts