I am currently sitting at the dining table of Peter Murray-Rust with Egon Willighagen opposite me talking to Jean-Claude Bradley. We pulling together sets of data from Jean-Claude’s UsefulChem project into CML to make it more semantically rich and do a bunch of cool stuff. Jean-Claude has a recently published preprint on Nature Precedings of a paper that has been submitted to JoVE. Egon was able to grab the InChiKeys from the relevant UsefulChem pages and passing those to CDK via a script that he wrote on the spot (which he has also just blogged) generated CML react for those molecules.
...research already in progress is opened up to allow labs anywhere in the world to contribute experiments. The deeply networked nature of modern laboratories, and the brief down-time that all labs have between projects, make this concept quite feasible.
"Open Data Excuse" Bingo
We might want to use it in a paper People may misinterpret the data Thieves will use it There's no API
I don't mind, but someone else might Lawyers want a custom License It's too complicated We will get too many enquiries
It's too big Terrorists will use it Poor Quality There's already a project to...
What if we want to sell it later It's not very interesting Data Protection We'll get spam
For open data teams; print out a copy and put it on your office wall. Cross out each excuse people give you. There are no prizes, but you can tweet "bingo! #openDataExcuses" if you think it might make you feel better
Generate your own bingo grids at http://data.dev8d.org/devbingo/
OpenRefine (formerly Google Refine) is a powerful tool for working with messy data: cleaning it; transforming it from one format into another; and extending it with web services and external data.
T. Heck, I. Blümel, S. Fahrer, D. Lohner, J. Schneider, and L. Visser. (April 2020)German title: Öffene Praktiken in Wissenschaft und Lehre – eine Diskussion mit den Anwendern*innen aus der Praxis" The file includes a German version..