³ÉÈËÂÛ̳

Archives for April 2010

Data Art on Infosthetics

Post categories:

Ian Forrester Ian Forrester | 17:51 UK time, Thursday, 22 April 2010

Comments (3)

VisualCommunication_1271955005739.png

Read the rest of this entry

When is a dataset not a dataset? The hackday project that crowdsourced data.gov.uk

Post categories: ,Ìý,Ìý,Ìý

Dr Ian McDonald Dr Ian McDonald | 12:31 UK time, Thursday, 22 April 2010

Comments

Tom Morris and other participants at the end of the hackday

When is a dataset not a dataset? How many of the now 3241 datasets listed as part of are easy to open up and play with? How many are tables for computers to analyse, instead of PDF reports for people to read?

Ìý

The Ìýfilled a Channel 4 office with journalists and developers on the final Friday in January. Our aim was to tell new stories with open data. Attendees already hadÌýform - the ³ÉÈËÂÛ̳'s Open Secrets blogger Martin Rosenbaum, and data journalism teams from the Times, the Guardian, and the FT. judged our attempts in his role as head of hosts , alongsideÌýMy Society boss Tom Steinberg. They to my team's analysis of Tory candidates. But another project promised to shed light on public data in the UK.

Ìý

was part of a team that looked into the quality of data.gov.uk. Although data.gov.uk advertises itself as a database of open datasets, many of the entries are . He built a prototype format checker that invites people to go through datasets and record the file format.ÌýYou can listen to him explaining the checker to me and to the hackday, or reuse under the .

Ìý

In order to see this content you need to have both Javascript enabled and Flash installed. Visit ³ÉÈËÂÛ̳ Webwise for full instructions. If you're reading via RSS, you'll need to visit the blog to access this content.

Ìý

On Wednesday February 3rd, he put a completed quality checker online. On that Thursday, the crowd had gone through data.gov.uk and marked up all of the datasets.

Ìý

Tom posted his initial breakdown to the data.gov.uk community on March 20th:

HTML -252
XML -5
Word - 4
RTF - 1
OpenOffice -1
Something odd - 85
JSON - 9
Nothing there! - 190
CSV - 12
Multiple formats - 1211
PDF - 468
RDF - 10
Excel - 408
TOTAL - 2656
Sadly, this is over-optimistic. I've manually checked some of the data that has been categorised as JSON and RDF. Most of it is not actually correctly categorised - either people clicked, say, 'RDF' when they meant to click 'PDF', or they have seen an RSS or Atom feed and categorised it as RDF. What this admittedly imperfect dataset is basically saying is that the vast majority of the 'data' on data.gov.uk is not actually machine-readable data but human-readable documents.

He will be at the this weekend, where he will speak about and might do the analysis, which he told me was the most important part. When done, it will be very interesting indeed to read it.

³ÉÈËÂÛ̳ Backstage: five year retrospective

Post categories: ,Ìý

Ian Forrester Ian Forrester | 21:45 UK time, Thursday, 15 April 2010

Comments (2)

Backstage, is approaching five years old believe it or not. So to celebrate I have asked Social technologist of the popular blog to put together a retrospective. Don't worry there will be data and mashups but we also want you all to share with us your stories and memories of the last five years

So the first project is image-based: We are looking for your favourite photos and images of Backstage and the stories behind them. The images might be a photo from a Backstage event that you really enjoyed, or a screenshot of a prototype you developed or a visualisation of ³ÉÈËÂÛ̳ data that you put together. We don't mind what type of image it is, just so long as it's online and you can tell us a bit about it.

The second project is map-based: We'd like you to tell us what your favourite experiences of Backstage were. Perhaps a prototype you put together, an event you went to, or something else completely. We'd also like to know where you are based (at whatever level of detail you feel comfortable) so that we can see how far Backstage reached. When Backstage first launched it was mainly for the UK only but the internationalisation of Backstage was overwelming, so it would be great to see how far we're really talking.

Both mash-ups are based on Google Docs so the two forms are embedded in the page after the last link, or you can go straight to the pages directly by following... or . In both cases, if you add info to the spreadsheets we take that to mean that you're happy for us to reuse your contribution.

Read the rest of this entry

Prototype: ³ÉÈËÂÛ̳ + Data.gov.uk mashup

Post categories:

Ian Forrester Ian Forrester | 17:41 UK time, Wednesday, 7 April 2010

Comments

The mashup on bbc news

Our friends at Rewired State, recently had a hackday where which,
Publishes links to relevant data.gov.uk datasets next to news articles on the ³ÉÈËÂÛ̳ website. Provides important context for those articles and increased visibility for the datasets. Implemented as a simple greasemonkey Firefox script connecting to a simple search service built with Google's ajax search api.
Not content with that Ben's already thinking about packaged it as a firefox toolbar rather than a greasemonkey plug-in. Moving away from reliant on google's search apis. and of course, if it supported more websites. There's also a potential to add crowd-sourced citations too.

Prototype: ³ÉÈËÂÛ̳ Archiver

Post categories:

Ian Forrester Ian Forrester | 17:20 UK time, Wednesday, 7 April 2010

Comments (2)

³ÉÈËÂÛ̳ Homepage archived

There is something amazing about looking at stacks of data over a period of time, and ³ÉÈËÂÛ̳ Archiver does exactly that. Some of you might even remember something like it called the ³ÉÈËÂÛ̳ home page archiveÌý but James Holden's latest project snapshots the whole page and allows you to view the changes in animated way.

The is here and the .

James explains how it works,

The project is running on a C# app I wrote to correctly screen capture the page (harder than you'd of thought) and then using a local webserver it FTP's (via PHP) the resulting 3 images (thumb, medium and large) to the live server.Ìý The comparison tool (a link at the bottom of each image which is easily missed at the moment) Ìýis written and runs on the live server to compare the visual changes, written in PHP/GD.Ìý Obviously I haven't spent any time on the front-end site so that would be the next logical step.

In my head I'd see this as being the ultimate tool for archive.org.Ìý If you go "way back" using their tool you can see that resources are missing and indeed as the browser changes rapidly the result you see in newer browsers doesn't represent the look and feel the user got at the time which is an important point if you trying to look back at the way it was.Ìý Loading Netscape.com in Mosaic back in the mid 90's would have been an altogether different experience than in today's "Chrome's and Firefox's"

Fantastic prototype, which we hope he can keep running for a long time to come.

More from this blog...

Categories

These are some of the popular topics this blog covers.

³ÉÈËÂÛ̳ iD

³ÉÈËÂÛ̳ navigation

³ÉÈËÂÛ̳ © 2014 The ³ÉÈËÂÛ̳ is not responsible for the content of external sites. Read more.

This page is best viewed in an up-to-date web browser with style sheets (CSS) enabled. While you will be able to view the content of this page in your current browser, you will not be able to get the full visual experience. Please consider upgrading your browser software or enabling style sheets (CSS) if you are able to do so.