iPhylo: Creative Commons

Roderic D. M. Page

Showing posts with label Creative Commons. Show all posts

Wednesday, June 24, 2015

Visualising Geophylogenies in Web Maps Using GeoJSON

Fig3 GoogleMaps CC BY no logo 300x205 I've published a short note on my work on geophylogenies and GeoJSON in PLoS Currents Tree of Life:

Page R. Visualising Geophylogenies in Web Maps Using GeoJSON. PLOS Currents Tree of Life. 2015 Jun 23 . Edition 1. doi:10.1371/currents.tol.8f3c6526c49b136b98ec28e00b570a1e.

At the time of writing the DOI hasn't registered, so the direct link is here. There is a GitHub repository for the manuscript and code.

I chose PLoS Currents Tree of Life because it is (supposedly) quick and cheap. Unfortunately a perfect storm of delays in reviewing together with licensing issues resulted in the paper taking nearly three months to appear. The licensing issues were a headache. PLoS uses the Creative Commons CC-BY license for all its content. Unfortunately, the original submission included maps from Google Maps and Open Street Map (OSM), to show that the GeoJSON produced by my tool could work with either. Google Maps tile imagery is not freely available, so I had to replace that in order for PLoS to be able to publish my figures. At first I used simply replaced the tiles Google Maps displays with ones from OSM, but those tiles are CC-BY-SA, which is incompatible with PLoS's use of CC-BY. Argh! I got stroppy about this on Twitter:

FFS. So it appears I can't use either Google Maps or Open Street Map in a @PLOSCurrents article. Open licensing somehow feels worse than ©
— Roderic Page (@rdmpage) June 16, 2015

Eventually I discovered maps from CartoDB that have CC-BY licenses, and so could be used in the PLoS Currents article. After replacing Google's and OSM tiles with these maps (and trimming off the "Google" logo) the figures were acceptable to PLoS. Increasingly I think Creative Commons has resulted in a mess of mutually incompatible licenses that make mashing up things hard. The idea was great ("skip the intermediaries" by declaring that your content can be used), but the outcome is messy and frustrating.

But, enough grumbling. The article is out, the code is in GitHib. Now to think about how to use it.

Monday, August 05, 2013

GBIF and open biodiversity data: what license should GBIF use?

GBIF is asking for views on how it should license of data in the GBIF network. The full consultation document is available from Google Drive and DropBox. GBIF is:

...seeking input from all GBIF Participants and stakeholders on the following questions:
Do you have any comments on the plan to associate all GBIF-mediated data with a machine readable licence?
Do you have an opinion on the relative merits of Creative Commons, Open Data Commons or other licence types in the context of the GBIF network?
Which of the two options described in section 8 of this document should GBIF pursue? If you support “Option 2”, would your position be modified if it resulted in a significant decrease in data published to the GBIF network?

The two options referred to above are:

Option 1 – Support restrictions on commercial use

Option 2 – Only support fully free-and-open data

If you have opinions on licensing biodiversity data, please read the consultation document and send your thoughts send to licensing@gbif.org by 5 September 2013.

Tuesday, January 11, 2011

Why won't The Plant List won't let me do this?

In my last post I discussed why I thought the decision of The Plant List to use a restrictive license (CC-BY-NC-ND) was such a poor choice. CC-BY-NC-ND states that

You may not alter, transform, or build upon this work.

To make this point more concrete, I've created this site:

Experiments with The Plant List

to show the kinds of things that The Plant List's choice of license prevents the taxonomic community from doing. As a first step I'm exploring linking the names in the list to the primary scientific literature, as this video demonstrates:

The Plant List from Roderic Page on Vimeo.

For example, we can take a name like Begonia zhengyiana Y.M.Shui, parse the bibliographic citation provided by The Plant List (via IPNI), and locate the actual paper online, in this case it's freely available as a PDF:

Now we can see a drawing of the plant, and instead of simply trusting that the compilers of The Plant List have correctly interpreted this paper, we can see for ourselves. Down the track, we could imagine mining this paper for details about the plant, such as its morphology and geographic distribution. This requires the link to the original literature, which The Plant List lacks.

A good chunk of the recent plant taxonomic literature has DOIs, for example journals such as the Kew Bulletin and Novon. Playing with some scripts I've managed to associate nearly 9000 accepted names with a DOI, and that's by looking at only a few journals. There are lots more DOIs to be found, but because of the way botanical nomenclators record references (see my post Nomenclators + digitised literature = fail) it can be something of a challenge to find them. This task isn't helped by the fairly lax way some publishers enter data in CrossRef (Cambridge University Press I'm looking at you). The other obvious source of digitised literature is, of course, BHL, and that's next on the list of resources to play with.

Experiments with The Plant List is very crude, and I've barely scratched the surface of linking names to primary literature. That said, given that there are exactly zero links between names and digital literature in The Plant List, I'd argue that my site adds value to the data in that The Plant List. And that's my point — by making data available for others to play with, you enable others to add value to that data. By choosing a CC-BY-NC-ND license, The Plant List has killed that possibility.

So, my question for The Plant List is "why did you do that?"

Wednesday, December 29, 2010

The Plant List: nice data, shame it's not open

The Plant List (http://www.theplantlist.org/) has been released today, complete with glowing press releases. The list includes some 1,040,426 names. I eagerly looked for the Download button, but none is to be found. You can grab download individual search results (say, at family level), but not the whole data set.

OK, so that makes getting the complete data set a little tedious (there are 620 plant families in the data set), but we can still do it without too much hassle (in fact, I've grabbed the complete data set while writing this blog post). Then I see that the data is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs (CC BY-NC-ND) license. Creative Commons is good, right? In this case, not so much. The CC BY-NC-ND license includes the clause:

You may not alter, transform, or build upon this work.

So, you can look but not touch. You can't take this data (properly attributed, or course) and build your own list, for example with references linked to DOIs, or to the Biodiversity Heritage Library (which is, of course, exactly what I plan to do). That's a derivative work, and the creators of the Plant List don't want you to do that. Despite this, the Plant List want us to use the data:

Use of the content (such as the classification, synonymised species checklist, and scientific names) for publications and databases by individuals and organizations for not-for-profit usage is encouraged, on condition that full and precise credit is given to The Plant List and the conditions of the Creative Commons Licence are observed.

Great, but you've pretty much killed that by using BY-NC-ND. Then there's this:

If you wish to use the content on a public portal or webpage you are required to contact The Plant List editors at editors@theplantlist.org to request written permission and to ensure that credits are properly made.

Really? The whole point of Creative Commons is that the permissions are explicit in the license. So, actually I don't need your permission to use the data on a public portal, CC BY-NC-ND gives me permission (but with the crippling limitation that I can't make a derivative work).

So, instead of writing a post congratulating the Royal Botanic Gardens, Kew and Missouri Botanical Garden (MOBOT) for releasing this data, I'm left spluttering in disbelief that they would hamstring its use through such a poor choice of license. Kew and MOBOT could have made the Plant List available as open data using one of the licenses listed on the Open Definition web site, such as putting the data in the public domain (for example, or using a Creative Commons CC0 license). Instead, they've chosen a restrictive license which makes the data closed, effectively killing the possibility for people to build upon the effort they've put into creating the list. Why do biodiversity data providers seem determined to cling to data for dear life, rather than open it up and let people realise its potential?

Monday, February 02, 2009

Thoughts on the Wellcome Interactive Tree of Life

Last night BBC One aired David Attenborough's Charles Darwin and the Tree of Life, which featured a lovely "fly through" the tree of life:

In conjunction with the TV show, the Wellcome Trust has launched the Interactive Tree of Life, a Flash-based view of the tree of life. There's also a blog about the project. Here's a demo of the tree:

The tree looks very nice, and a lot of work has gone into it, but I am somewhat underwhelmed. The tree itself is tiny, and does a poor job of conveying the relative diversity of life (e.g., no plants, bacteria, few arthropods, etc.). It displays the tree on a 2D plane, and the user can move relative to that plane. I'm not convinced this is the best way to display large trees. Something modelled on Perceptive Pixel's demo might be more useful. I blogged about this last year, but the video host service has disappeared. You can see the tree display 50 seconds in to the video below:

There also seems to be some confusion over the license used. The disclaimer states that it is Creative Commons Attribution-Noncommercial 2.5 Generic, but the link is to Attribution-Noncommercial-No Derivative Works 3.0. There's a big difference. The disclaimer encourages us to remix it, the linked license says that we cannot.

Out of curiosity I grabbed the code from the web site (a 1.5Gb file) and had a quick look. The bulk of the files are media, such as images, movies, and 3D Maya models. There's some nice stuff here. The actual tree itself is there in New Hampshire eXtended format. Here it is displayed in TreeView X:

Here's the NHX tree itself.

((((((((((((((((((((((((((((((Antelope:25.38857142856932 [&&NHX:id=51129:tol=Y ], Sheep:25.38857142856932 [&&NHX:id=51093:tol=Y ]):4.231428571428751 [&&NHX:id=51154 ], Cow:29.619999999998072 [&&NHX:id=51150 ])Bovidae:33.26333333333336 [&&NHX:id=50878:tol=Y ], Pig:62.883333333331485 [&&NHX:id=51287:tol=Y ]):0.978333333333334, Hippopotamus:63.86166666666491 [&&NHX:id=30366 ]):2.445833333333335, Camel:66.30749999999811 [&&NHX:id=30350:tol=Y ])Artiodactyla:7.337500000000006 [&&NHX:id=15976:tol=Y ], Humpback_whale:73.64499999999816 [&&NHX:id=16054 ]):9.663854166666676 [&&NHX:id=15975 ], (((Horse:72.14374999999836 [&&NHX:id=16255 ], Rhino:72.14374999999836 [&&NHX:id=16262 ])Perissodactyla:5.153125000000017 [&&NHX:id=15980:tol=Y ], Elephant:77.29687499999827 [&&NHX:id=22667:tol=Y ]):4.294270833333347 [&&NHX:id=15979 ], Aardvark:81.59114583333152 [&&NHX:id=16881 ]):1.717708333333339):15.761701388888907 [&&NHX:id=15974 ], ((((Brown_bear:40.2599999999984 [&&NHX:id=123666:tol=Y ], Polar_bear:40.2599999999984 [&&NHX:id=123667:tol=Y ])Ursidae:11.18333333333332 [&&NHX:id=16015:tol=Y ], Seal:51.443333333331715 [&&NHX:id=16020 ]):4.473333333333327, Dog:55.91666666666514 [&&NHX:id=16013 ])Caniformia:11.18333333333332 [&&NHX:id=16011 ], (Cat:40.259999999998215 [&&NHX:id=123531:tol=Y ], Lion:40.259999999998215 [&&NHX:id=123566:tol=Y ])Felidae:26.839999999999964 [&&NHX:id=16006:tol=Y ])Carnivora:31.9705555555556 [&&NHX:id=15971:tol=Y ]):0.15222222222222132, ((Mole:86.06453703703531 [&&NHX:id=16213 ], Shrew:86.06453703703531 [&&NHX:id=16223 ]):3.759497354497354, Hedgehog:89.82403439153265 [&&NHX:id=16211 ])Insectivora:9.398743386243385 [&&NHX:id=15968:tol=Y ]):0.3805555555555533, (((Chimpanzee:9.143333333331611 [&&NHX:id=26565 ], Human:9.143333333331611 [&&NHX:id=16421:tol=Y ]):82.69666666666664 [&&NHX:id=16412 ], Tree_shrew:91.83999999999833 [&&NHX:id=50808 ]):4.1099999999999755 [&&NHX:id=15962 ], Fruit_bat:95.94999999999834 [&&NHX:id=16076 ]):3.6533333333333116 [&&NHX:id=15961 ]):1.14166666666666, ((Hamster:56.967999999998256 [&&NHX:id=16546 ], Rat:56.96799999999821 [&&NHX:id=50732 ])Eumuroida:33.211999999999804 [&&NHX:id=16528 ], Rabbit:90.17999999999788 [&&NHX:id=16227 ])Glires:10.565000000000104 [&&NHX:id=15957 ]):16.484999999999623 [&&NHX:id=15955 ], Kangaroo:117.22999999999797 [&&NHX:id=16248 ]):31.38499999999999 [&&NHX:id=15993 ], Platypus:148.61499999999816 [&&NHX:id=16253:tol=Y ]):39.50999999999999 [&&NHX:id=15990 ], Megazostrodon:4.0625 [&&NHX:id=15032:nsz=2:ncol=red:ext=Y ])Cynodontia:32.5 [&&NHX:id=15030 ], Dimetrodon:4.0625 [&&NHX:id=14972:nsz=2:ncol=red:ext=Y ])Sphenacodontoidea:73.125 [&&NHX:id=14971 ], (((((((((((((Flamingo:106.30859375000043 [&&NHX:id=89474:tol=Y ], Toucan:106.3085937500002 [&&NHX:id=93330:tol=Y ]):1.4296874999999905, Penguin:107.73828125000037 [&&NHX:id=57223:tol=Y ])Neoaves:3.5742187499999765 [&&NHX:id=26305:tol=Y ], Duck:111.31250000000024 [&&NHX:id=89298:tol=Y ])Neognathae:4.289062499999972 [&&NHX:id=26291 ], Ostrich:115.60156250000016 [&&NHX:id=26289:tol=Y ])Neornithes:4.289062499999972 [&&NHX:id=15834:tol=Y ], Ichthyornis:4.386242378048764 [&&NHX:id=15833:nsz=2:ncol=red:ext=Y ])Euornithes_true_birds:12.867187499999915 [&&NHX:id=15829 ], Archaeopteryx:2.144531249999986 [&&NHX:id=15824:nsz=2:ncol=red:ext=Y ])Aves:12.867187499999915 [&&NHX:id=15721:tol=Y ], Tyrannosaurus_Rex:79.47061157226744 [&&NHX:id=15889:nsz=2:ncol=red:ext=Y ]):74.99999999999773 [&&NHX:id=15713 ], Diplodocus:157.62263488769486 [&&NHX:id=15756:nsz=2:ncol=red:ext=Y ])Saurischia:9.374999999999716 [&&NHX:id=15724 ], Iguanadon:165.44959259033112 [&&NHX:id=15740:nsz=2:ncol=red:ext=Y ])Dinosauria:20.400000000000002 [&&NHX:id=14883:tol=Y ], Crocodile:250.39999999999782 [&&NHX:id=14868 ])Archosauria:22.950000000000003 [&&NHX:id=14900:tol=Y ], Snake:273.34999999999803 [&&NHX:id=17563 ])Sauria:12.75 [&&NHX:id=14913 ], Brouffia:1.275 [&&NHX:id=14865:nsz=2:ncol=red:ext=Y ]):5.1 [&&NHX:id=14864 ], Tortoise:291.1999999999979 [&&NHX:id=17631 ])Reptilia:2.55 [&&NHX:id=14846 ])Amniota:56.25000000000091 [&&NHX:id=14990:tol=Y ], (Amphibians:317.3333333333312 [&&NHX:id=14940 ], Newt:317.3333333333311 [&&NHX:id=82771 ])Living_amphibians:32.66666666666758 [&&NHX:id=14997:tol=Y ])Tetrapoda:9.00000000000009 [&&NHX:id=14987 ], Seymouriamorpha:14.610465116279109 [&&NHX:id=17554:nsz=2:ncol=red:ext=Y ]):40.87500000000041 [&&NHX:id=14985 ], (Acanthostega:1.8750000000000189 [&&NHX:id=15016:tol=Y:nsz=2:ncol=red:ext=Y ], Icthyostega:1.8750000000000189 [&&NHX:id=15015:tol=Y:nsz=2:ncol=red:ext=Y ]):0.7500000000000075):13.125000000000131 [&&NHX:id=14976 ], Panderichthys:2.2500000000000226 [&&NHX:id=14951:nsz=2:ncol=red:ext=Y ]):4.500000000000045 [&&NHX:id=14950 ], Eusthenopteron:2.2500000000000226 [&&NHX:id=14949:nsz=2:ncol=red:ext=Y ]):9.00000000000009 [&&NHX:id=14948 ], ((Paddle_fish:363.6363636363626 [&&NHX:id=68750 ], Sturgeon:363.6363636363626 [&&NHX:id=68749 ])Acipenseriformes:24.242424242424242 [&&NHX:id=68726:tol=Y ], Clown_fish:387.8787878787866 [&&NHX:id=52149 ]):38.621212121213034 [&&NHX:id=68709 ])Osteichthyes:9.00000000000009 [&&NHX:id=14921 ], Shark:435.5 [&&NHX:id=14925 ])Node_1:13.944444444444489 [&&NHX:id=14919 ], Cephalaspidida:19.261904761904795 [&&NHX:id=16894:nsz=2:ncol=red:ext=Y ])Node_3:28.333333333333336 [&&NHX:id=14840 ], Pteraspis:46.45061728395067 [&&NHX:id=16929:nsz=2:ncol=red:ext=Y ])Node_1:9.444444444444445 [&&NHX:id=14833 ], Lamprey:487.22222222222223 [&&NHX:id=15919 ])Vertebrata:18.88888888888889 [&&NHX:id=14829:tol=Y ], Pikaia:506.1111111111113 [&&NHX:id=14824 ]):158.00000000000003 [&&NHX:id=14822 ], ((((((((Fruit_fly:339.4285714285717 [&&NHX:id=10610 ], Peacock_butterfly:339.42857142857144 [&&NHX:id=94054 ]):7.071428571428555 [&&NHX:id=8224 ], Wasp:346.5000000000002 [&&NHX:id=11244 ]):28.28571428571422 [&&NHX:id=8223 ], Cockroach:374.7857142857142 [&&NHX:id=8544 ])Neoptera:7.071428571428555 [&&NHX:id=8267:tol=Y ], Giant_dragonfly:10.607142857142833 [&&NHX:id=13265:nsz=2:ncol=red:ext=Y ])Pterygota:98.80952380952378 [&&NHX:id=8210:tol=Y ], Brine_shrimp:480.6666666666665 [&&NHX:id=6387 ]):43.81073446327684 [&&NHX:id=2527 ], ((((Harevest_mite:496.40677966101725 [&&NHX:id=2612 ], Spider:496.4067796610171 [&&NHX:id=2788:tol=Y ]):8.864406779661008 [&&NHX:id=2542 ], Harvestman:505.27118644067804 [&&NHX:id=2556 ])Arachnida:8.864406779661008 [&&NHX:id=2536:tol=Y ], Sea_Scorpion:39.88983050847454 [&&NHX:id=8174:nsz=2:ncol=red:ext=Y ])Chelicerata:7.387005649717507 [&&NHX:id=2535 ], Millipede:521.5225988700563 [&&NHX:id=52849 ]):2.954802259887003)Arthropoda:40.855932203389834 [&&NHX:id=2469:tol=Y ], (Opabinia:555.9111111111108 [&&NHX:id=20357 ], Velvet_worm:555.9111111111108 [&&NHX:id=20356 ])Onychophora:9.422222222222217 [&&NHX:id=2470:tol=Y ]):70.55555555555556 [&&NHX:id=2468 ], Acoel:635.8888888888889 [&&NHX:id=20383 ]):28.222222222222225)Bilateria:85.88888888888889 [&&NHX:id=2459:tol=Y ], Sponge:750.0 [&&NHX:id=20438 ])Animals:2250.0 [&&NHX:id=2374:tol=Y ], E._coli:3000.0 [&&NHX:id=2306 ])Life_on_earth;

It's great to see creative people tackling the challenge of displaying the tree of life. I just not convinced that this is the best way to do it.

Thursday, December 11, 2008

Yes We Can - "scientists are the ultimate remixers"

The Science Commons has released a short video by Jesse Dylan, who made the Yes We Can video.

Monday, October 27, 2008

A Shared Culture

The A Shared Culture video from the Creative Commons web site.