Kurt McKee

lessons learned in production

Archive

Articles tagged with feedparser

feedparser 5.2.0 | 24 April 2015

I'm pleased to announce the release of feedparser 5.2.0!

It's available only on the Python Package Index (PyPI) as Google Code is shutting down its services. The update incorporates over two years of work and patches from multiple authors.

Some of the release highlights include what will hopefully …

Twelve days later | 19 April 2015

It's been twelve days since my shoulder surgery so I thought I'd post a summary of what I've been up to.

In the first 24 hours there was an issue that cropped up at work. The problem was caused by an update to some software released by another group in …

The state of feedparser | 3 January 2015

feedparser is near and dear to my heart but I haven't touched it in quite a while. There are a number of personal and professional reasons for this which I won't bore you with here, but those same forces are now allowing and compelling me back. Personally, I'm needing to …

Date parsing | 17 December 2012

I have lost patience with the RFC 822 date parsing in both feedparser and listparser. Back in 2009 when I started writing listparser I decided to use regular expressions to turn RFC 822 date strings into Python datetime objects. Earlier this year when I discovered that feedparser's RFC 822 parser …

Announcing feedparser 5.1.3 | 9 December 2012

Howdy everybody!

I'm pleased to announce the release of feedparser 5.1.3! This version features a number of crash fixes, minor documentation fixes, and improved date parsing and character encoding handling, You'll also find that two HTML5 element attributes are no longer stripped by the HTML sanitizer. You can …

Getting back into it (part 6) | 29 November 2012

Finally! I discovered what the problem was with Ubuntu and old versions of Python (I had to create symbolic links for libz.so.1 and libreadline.so.5 from /lib/x86_64-linux-gnu to /lib). That resolved all of the problems I was experiencing with the older versions' interactive shells. I also …

Getting back into it (part 5) | 21 November 2012

A day or two ago I reviewed almost every bug report that's accumulated in the feedparser issue tracker. Several of them I've marked NeedInfo, but almost all of the rest of them are valid and will be addressed.

I'm expecting that there will be an opportunity to begin fetching and …

Getting back into it (part 4) | 5 November 2012

I made time this weekend to continue reading Pro Git, and I paid special attention to dealing with remote branches. I learned quite a bit, and I think I'm nearly to the point that I can competently deal with the pull requests I've received. I also created a new SSH …

Getting back into it (part 3) | 27 October 2012

This post is about software development, but I'm disappointed to say it's not about feedparser or listparser development.

I'm back to working 14+ hour days, and much of my time has been spent writing automation scripts in a custom scripting language that can be interpreted by Tera Term. It has …

Getting back into it (part 2) | 16 October 2012

I was invited to participate in the gpodder hackfest last Saturday by joining the group on IRC. I had a blast talking with the group! In the invitation it was suggested that I could join and talk about my plans for feedparser. I've thought a lot about serious architectural issues …

After losing so, so much in a catastrophic hard drive failure, I've found it difficult to get my development environment back up and running. Time was, I could open a terminal and everything was ready to go: command completion, test suite, every version of Python, BeautifulSoup, lxml, Sphinx...I had …

Howdy everybody,

I'm pleased to announce that feedparser 5.1.2 is available for immediate download, and I want to stress the word "immediate"! This is a security release, and all users and developers are strongly encouraged to upgrade immediately.

Security fix

Yesterday while working on a character encoding bug …

Announcing feedparser 5.1.1 | 20 March 2012

I'm pleased to announce the release of feedparser 5.1.1!

This release contains a number of important changes that range from fixed crasher bugs to improved date and time string parsing. There are also some bug fixes that affect how various elements are parsed.

Unit test fixes

Several Linux …

Announcing feedparser 5.1 | 3 December 2011

I'm pleased to announce the release of feedparser 5.1! This release
includes a boatload of bug fixes, support for Python 3.2 and partial
support for Jython, extensive unit test refactoring, and a massive
documentation conversion to ReST format. You can download the release
at one of the following …

Announcing feedparser 5.0.1 | 20 February 2011

I'm pleased to announce that feedparser 5.0.1 is now available for download!

This is a security release, and I encourage everyone to upgrade as soon as possible. Three sanitizer-related issues were fixed in this release:

  • Issue 91 (unexpected characters in XML declarations cause crashes)
  • Issue 254 (sanitization can …
Announcing feedparser 5.0 | 25 January 2011

I'm pleased to announce that feedparser 5.0 is now available for download!

New features

It's been five years since feedparser 4.1 was released, and there's been a lot of work put into it.

  • HTML5, XHTML, SVG, MathML, and CSS can now be sanitized. Sam Ruby's added a lot …
The problem with Mercurial | 17 January 2011

There's been some talk on the feedparser mailing list about moving off of Subversion, and I think that's great! I used Subversion back when I was contributing to another open source project in 2006 and it was excruciating. Subversion's utter inability to merge made contributing a grueling chore, and I'll …

Feedparser has a bunch of unit tests...somewhere in the vicinity of 4400 (tells you when the last release was when the homepage says there are only 3000!). My concern is that the vast majority are duplicates that differ in one key aspect: one version is wellformed XML and the …

There's been a lot of discussion going on in an already-closed feature request for feedparser regarding how best to support a URL request timeout, and I thought it would be good to summarize the issue and what options are available.

The problem

The problem revolves around Python's default request timeout …

After going through many, many feedparser bugs (over 100 closed since I started working on the project, with more waiting to be reviewed!) I've seen a lot of reports asking for additional support for various namespaces. Off the top of my head the list includes GeoRSS, the iTunes Music Store …

Feedparser porting status | 23 November 2010

I spent some time this weekend working on porting feedparser to Python 3, and found that it will be difficult because there are two separate parsers included (a strict parser and a loose parser), and while each works differently, both use the same core machinery in feedparser.

With the strict …

Porting feedparser to Python 3 | 19 November 2010

Today I'm announcing I'm throwing my hat in the ring to port feedparser to Python 3. There's a ticket open regarding this at the feedparser bug tracker, but the person who's working on porting it appears to be writing Python 3 code instead of writing Python 2 code that the …