All datasets generated from the experimental crowdsourcing projects hosted on this platform are made available under a CC0 license.

There are three types of dataset available, all of which can be downloaded in JSON or CSV format:

  • Tasks: The initial task input.
  • Task Runs: All contributions made so far.
  • Results: The final results data, following any analysis of the completed tasks.

We are keen for these datasets to be used in innovative ways, perhaps to further research into new technologies. For instance, the digitised playbills alongside the final results might prove useful for testing pattern recognition applications, such as those using OCR or NER technologies.

A report containing some user and contribution statistics is also available for each project.

Head over to the forum, email or contact @LibCrowds to let us and others know how you have made use of the data, or if you have any further enquiries.


Download the project data.

There are no records to show