Hmm I didn't know about dbpedia but it sounds like it could definitely be useful. Anything that provides an interface and supports queries is preferable to web scraping. I'm not settled on MusicBrainz as I haven't done much research into the range of music databases available - I just figured it must be pretty decent if last.fm relies on it. I'll definitely look into foo_discogs too.
This is pretty ambitious, I hope I can get started on it soon. And all suggestions/recommendations are welcome