Jisc Observatory report on Ebooks in Education

The joint CETIS and UKOLN Observatory has just published a report “Preparing for Effective Adoption and Use of Ebooks in Education” written by James Clay. My CETIS colleague Li and I wrote the foreword for this report, which I’ve reproduced here but really you would be better going to the observatory and downloading the whole report.

Foreword
Ebooks have been around for many years: their history can be traced back to initiatives such as Project Gutenberg in the 1970s or formats such as PDF released in the mid 1990s. Handheld ebook reader hardware has been available from the late 1990s. For much of that time, ebooks arguably had little impact outside a few areas of niche interest.

In the last few years, however, there has been an increasing stream of stories about ebooks outselling printed books by some measure or another. For example, in August 2012 it was widely reported that Amazon in the UK had sold more ebooks than hard- and paperbacks combined[1]. Although there is of course often an element of advertising hype in many of these stories, they do reflect a real shift in the popularity of ebooks.

Various visions of ebooks
This shift has largely been prompted by two developments in ebook readers: the Amazon Kindle and Apple iPad, with associated apps. In technical capabilities neither is unique. Arguably, they are not even innovative as they both do no more than bring together pre-existing technologies. Nevertheless they do represent major initiatives that demonstrate their technical potential. Interestingly, they also represent somewhat opposing visions of what an ebook is.

The Amazon Kindle is by far the most successful representative of a range of devices that adopt the approach of trying to deliver the same content as a book, in as convenient a manner as possible while maintaining the ease of reading. The emphasis is on cheap, lightweight e-readers that allow owners to carry all they could desire to read without too much thought or effort. The minimal size and weight allow the Kindle to be no more of a burden to carry than a small paperback. The screen is designed so that the reading experience is similar to that of paper, rather than that of a computer screen. The memory capacity and network connectivity are designed so that owners need never run out of material to read and need never worry about syncing content with their computers. The battery life typically extends to days of use, so owners need rarely be worried about charging the device. The emphasis is on the text, so that design elements such as font, colour, and layout may be lost. This represents one of the potential drawbacks of the Kindle approach in education since the layout of many textbooks is carefully designed to enhance the explanation being presented through choice of colour, boxed explanations of concepts as asides, or (especially in technical subjects) complex tables and equations. It is important to recognise that many of the news stories concerning the explosion of ebooks relate to the Amazon Kindle and to linear texts (such as novels) read for pleasure, rather than complex material designed for study.

Not surprisingly ( given Apple’s history of an emphasis on good-looking, well-designed products and a target market of customers who appreciate such things), Apple’s ebooks designed for the iPad (iBooks) place greater value on the appearance of the printed page. Apple’s iBooks can support full-colour, high-fidelity representations of a printed original. The iPad is capable of displaying the original look of historical manuscripts such as mediaeval bestiaries or the original handwritten and illustrated copy of Alice in Wonderland (and other rare books held by the British Library). While maintaining some of the convenience of the Kindle approach, Apple’s approach requires greater computing power, at a cost of increased weight and price — and decreased battery life. With this greater computing power also comes the opportunity toChallenges of ebooks in academic contexts go beyond what can be displayed on the printed page. An image on a printed page has to be static, whereas hardware such as the iPad allows moving images and interactive 3D models to be displayed, offering a potentially rich educational experience.

Challenges of ebooks in academic contexts
One of the challenges facing Higher and Further Education is how to respond to these possibilities. Does interactive content that can be brought into the classroom by students change the role of the course textbook? Does the facility of even the modest Kindle for sharing comments and annotations among readers allow new ways of discussing a text? Are there deep-seated human factors surrounding the way that students study, which cannot be satisfactorily replicated by ebooks and could even impede learning when using them? For example, such factors include: making notes, annotations and bookmarks; jumping around a textbook rather than reading it sequentially cover-to-cover; having several books open at one time; or just the plain familiarity of the paper-based format as compared to software navigation that has to be learnt. It does seem clear from studies so far that students in general will not welcome ebooks unless there is some clear advantage to be gained by their use[2].

There are other challenges. The change in publishing brought about by ebooks represents a challenge to publishers. It is noticeable that none of the developments in ebooks (from Project Gutenberg through to the Kindle and the iPad) have come from publishers. They are challenged by the change in publishing that ebooks represent. Typically, publishers are challenged by the difficulty of producing content for novel and varied platforms. Such interoperability issues are accentuated by the desire of some to push the interactive capabilities of ebooks as far possible (and these capabilities are important to education). Publishers are also challenged by the way that digital content changes the way in which material can be distributed and copied. This is a problem that they pass on to libraries: in essence an ebook may be “lent” out by a library numerous times without degradation or loss of availability to others, whereas a paper book can only be lent out to one person at a time and will eventually fall apart. As a result, in order to protect their income, publishers seek to limit what libraries can do with books by limiting the rights that libraries buy when they purchase a book, and by enforcing those rights through Digital Rights Management technology. Alternatively, publishers will need to redefine what libraries purchase, moving from a transfer of ownership of a copy to something more akin to rental or subscription to a service. These changes impinge on libraries’ scope for action in Higher Education.

As with any other technology in education, there are still many barriers and challenges that exist and these need to be overcome for ebooks to be adopted more widely in Higher and Further Education. This report introduces some key concepts related to ebooks in general and discusses the technical, cultural and legal challenges that need to be addressed for the successful adoption of ebooks in education. Furthermore, it also offers scenarios showing effective use of ebooks in libraries and in teaching and learning across institutions. It provides us with useful insights into the future directions of ebooks development.

footnotes
1. See, for example, this August 2012 report in The Guardian of Amazon’s announcement that “for every 100 hardback and paperback books it sells on its UK site, 114 ebooks are downloaded”.

2. For commentary on recent research into university student attitudes toward and usage of etextbooks, see this August 2012 article: “Students Find E-Textbooks ‘Clumsy’ and Don’t Use Their Interactive Features” The Chronicle of Higher Education, online . For more in-depth summary of this research, see also:Internet2 eTextbook Spring 2012 Pilot Final Project Report (1 August 2012).

Phil Barker & Li Yuan, JISC CETIS, September 2012

At the end of the JLeRN experiment

The JLeRN experiment was a toe dipped in the learning registry, a trial at different approach to sharing information about learning resources and how they are used that focusses on getting the information out there and not on worrying over the schemas and formats in which the information is conveyed. That experiment (JLeRN, not the Learning Registry as a whole) is drawing to a close, so we had a meeting earlier this week to review what had been done, what had been learnt and what was left to do and learn.

Sarah Currier had arranged for projects that had worked with JLeRN blog something about what they had done before the meeting, here’s the email with a summary of them, if you haven’t come across JLeRN before you might want to have a look through them before reading on. What I want to describe here is my own understanding of where the Learning Registry is and to report some of the issues about it raised at the meeting.

The Learning Registry: Nodes or a network?

The learning registry as a network from a presentation by Dan Rehak and others.. © Copyright 2011 US Advanced Distributed Learning Initiative: CC-BY-3.0.

The learning registry as a network from a presentation by Dan Rehak and others.. © Copyright 2011 US Advanced Distributed Learning Initiative: CC-BY-3.0.

From the outset the Learning Registry was conceived as a network, the software created would be nodes that connected together to share data about resources. Some of the details have been put on the back burner since those early descriptions, for example the ideas of communities and gateway nodes haven’t been much developed.

The community map on the Learning Registry website shows three nodes (the red pins), including the JLeRN node; Steve Midgely told us via email “There are a few development nodes out there that we know of: Agilix, Illinois Dept of Commerce and California Dept of Ed. To my knowledge there are no production nodes beyond the ones we currently run. Several companies have expressed interest in taking over our production nodes including Dell, Cisco and Amazon.” To that tally I can add the EngRich node at Liverpool. Steve adds that the only network he knows of is the LR public network. Now, I’m not sure about the other nodes, but I do know that the JLeRN and EngRich nodes haven’t interacted with the public network in any meaningful way (yet).

So I think we have to say that, to date, there isn’t really much to prove the concept of the Learning Registry as a network. There are, however some developments in the works that I think will change that, for example the Learning Registry Index, see below.

Services
The other aspect of the development of the Learning Registry against the vision shown in the diagram above is that of services being built to interact with the data in the nodes (these are shown as square in the diagram above). This is crucial since the Learning Registry is no more than plumbing to shift data around, it does nothing with that data that would interest a teacher or learner. It is left to others to develop services that meet user needs–Pat Lockley summed this up quite nicely in his presentation showing how the learning registry was targeted at developers and promoted relationships between developers, service managers and users more than was the case with traditional repository software.

“I think the major point of my slides was to suggest the learning registry is a “developer’s repository” – not that you need a developer to use it, more that you develop services around a node. Also, I feel there is a greater role for the developer in the ecosystems around a node than around a repository – the services on offer, and the scope of services you create seem richer – partially as any data can be stored.”

Well, there are some services for getting data in, there is the OAI-PMH to Learning Registry Publish Utility, and there is Pat’s RSS importer, Ramanathan, and his Google analytics data importer, Pliny. Also at least two projects–Scott Wilson’s SPAWS and Liverpool University’s EngRich–had involved the submission of data to Learning Registry nodes as part of the services they created.

But putting data in is meeting a service manager’s needs, it’s no good in itself since it doesn’t meet any user needs. There are a few user oriented services built off data in the Learning Registry. Pat showed us a couple of Chrome plugins, demos here and here. These are great as proofs of concept, and really important as such, they help show non-technical people what the learning registry is for. But there then follows some expectation management while you explain the limitations of the demonstrators. Other projects had embedded means of getting data out of the Learning Registry nodes into their project outputs, for example EngRich have an iLike widget for the Liverpool student portal that shows what resources students on specific courses have recommended based on data in their Learning Registry node.

Steve Midgely provided us with some very promising information, “the Gates foundation is funding several groups to build index and search services on top of Learning Registry (called Learning Registry Index) and that will require running nodes of some kind.”

Does it work?
One message that I picked up during the meeting and elsewhere is that the Learning Registry, as software, works. The people who set up nodes seem to have done so quickly, the people who used the APIs didn’t report problems in doing so. That’s a good place to be starting.

At a deeper level I guess we need to wait until there are more services built off the data in the Learning Registry to find out whether the Learning Registry works as a concept. Some known problems have been deliberately pushed out of scope in the development of the Learning Registry, one key one is not worrying about what formats and schemas for the data that goes in. This is good if you are submitting data, but unless some level of agreement is reached it does place the onus for making sense of the data on the people who are creating services that use the data. So far, the extent to which this (reaching agreement or making sense of arbitrary data) is possible in the context of the Learning Registry is untested.

Other questions remain over how the learning registry will function as a network, for example how duplicate and complementary records about the same resource will be dealt with when many people might be providing information about the same resource.

Why use it?
Owen Stephens and David Kay were at the meeting asking some very pertinent questions. Neither are particularly caught up in the education technology world, with more of a background in information systems for libraries, where of course there are different approaches to solving similar problems. So, why use the Learning Registry rather than raw couchDB, or some other schemaless, NoSQL, document store (e.g. MongoDB, which is popular for research data management), or free text indexing and search software such as Lucene/Solr, or RDF triple stores, or just a traditional relational database with SQL? To some extent the aim at the moment is to try and answer some of those questions: we won’t know if we don’t try it. But it’s valid to ask how far have we got to answering them, and here is my appraisal.

RDF?
Schemaless sharing of data still appeals to me because I don’t think we know what schema we want to use to share some of the interesting information about the use of resources for teaching and learning. I think the RDF approach will influence the data that is submitted, for example there is interest in using the Learning Registry to store LRMI style metadata. LRMI is adding properties to schema.org so that educational characteristics of resources can be described, and schema.org is only a step or two away from semantic web approaches such as RDF. But some influences of RDF we don’t want. For example there is a tendency at times for RDF approaches to fixate on ontologies. That would stall us. So, for example in LRMI it is possible to say that a resource “aligns” with some point in an educational framework: i.e. it is useful for teaching some topic in a standard curriculum, or assessing some skill required by a competency framework. That’s really useful, but the vocabulary for the nature of the alignment has had to be left open (“teaches” and “assess” are two suggested terms, others are that the resource has a certain “text complexity” or requires a “reading level” or other “educational level”)–the understanding of what education is about varies so much over the world and between settings that agreement on a closed ontology seems unattainable. Still, you could use RDF if you didn’t specify and ontology, and if you could make sense of the RDF without one.

Another weakness of RDF in this context, as I understand it, is its ability to deal with subjective opinions. As soon as a teacher or learner sees an assertion that resource X is good for teaching topic Y (to continue the example used above) they should be asking “says who”. Engineering students at Liverpool are more interested in what other Engineering students find useful, especially those at Liverpool, than they are in the opinions of physics students. Yes, you can have named graphs in RDF and provide information about who asserted which triples, but it goes beyond what is usual, whereas in it is built in from the start in the Learning Registry concept of paradata.

All of that is somewhat conjectural though, because as yet there is little in the Learning Registry that is not metadata that could be expressed in some standard schema such as LOM XML or DC RDF.

Other schemaless data stores
Why not use just CouchDB, without the Learning Registry API, or MongoDB, or Lucene? All of these would make sense for single instance data stores, which is pretty much what we have now with single more-or-less isolated nodes rather than a network. And, yes, I am sure that some way of sharing data between them could be worked up if that is what you wanted. So again any advantages of the Learning Registry is still putative at this stage.

One advantage of the Learning Registry is that, as I mentioned above, it does seem to work: it does seem to come out of the package as a functional way of storing and sharing data that is tailored to education. So as an introduction to No SQL databases it’s not a bad place for the education community to start.

In summary
In a post about the end of the JLeRN project David Kay has quoted Simon Schama on his not being sure whether the French Revolution was over. I’ll quote what Chairmain Mao supposedly said when asked what he thought of the French Revolution; “it’s too early to tell”. The things to look out for are a functioning network of nodes and user-facing services being delivered from data in those nodes. Then we can ask whether that data could be shared in any other way. For the time being I think that the main achievement of JLeRN and the UK’s involvement in the Learning Registry is that it has started people thinking about alternatives to relational databases and they have taken first steps into working with these. Too often, I think, data has been squeezed into an relational data where the benefits of doing so are simply that it is what the developer happens to be familiar with. If all you have is a hammer then you can have real problems dealing with screws.

[updated to correct an attribution error as to who was comparing JLeRN to the French revolution]