Skip to content

SLOG of Carl Heaton Posts

parsing www.parliament.uk using php simple html dom parser

I needed to get a list of all websites for UK members of parliament and have it update reasonably frequently / ability to repeat.

http://www.parliament.uk/mps-lords-and-offices/mps/ lists all the current members so a simple PHP script was written using http://simplehtmldom.sourceforge.net/ to parse the list of members, fetch each member profile and parse the name, website and twitter handles.

This script in all it’s hackyness may be found below (best run via php-cli to minimise timeout or memory issues) and example output at http://slog.carlheaton.co.uk/list-of-all-uk-members-of-parliament-name-website-twitter-handle/

More parsing www.parliament.uk using php simple html dom parser

Leave a Comment