Future of Interoperability Standards – factors contributing to reuse

Posted on October 4, 2010 by Simon Grant

Reuse requires awareness: to support that, how?

As my inputs to the CETIS FIS meeting on 24th September were partly to do with extensibility and reuse, I facilitated a small but select group initially charged with talking about extensibility and reuse. It included Alan Paull, a colleague very active in the XCRI and HEAR work, and two others who I did not know previously, one (Roger) from a large computer business, and one (Neil) ploughing his own furrow. And rather than talking about the mechanics of extensibility and reuse, we found ourselves pulled back to more human issues.

A key emerging point, that perhaps deserves more attention, is that before anyone can reuse or extend another specification or standard, they have to know of it, and then know about it. How do people actually get to know about specs that they might find reuseful, or extendable? You can make a standard perfectly extendable, or reusable, but if the appropriate people do not know about it, it will not be reused or extended. What can we do about that?

It was suggested:

publish case studies of good practice
support the community of practice
maintain a standards map, functional and technical
signpost related standards

We do much of this already, of course, in CETIS, but it is still notable that many people (including Roger and Neil) only came across this meeting by accident. In HE, maybe CETIS is not unknown or unreachable, but outside, how do we reach people?

The next major point we came up with was that XML is not be best vehicle for extendability and reuse. There is a tendency for people to be lulled into writing their own XML schemas – a practice that CETIS has warned against for some time – and it is very easy to create XML schemas in a way that is hard to extend or reuse.

To address this, Roger indicated that his big company was already very interested in Semantic Web ideas. The underlying structure of RDF (not RDF/XML!) naturally lends itself to decomposing complex structures into nodes and links. The problem of extension largely disappears, but the problem of reuse remains, in that to get reuse of Semantic Web information, people have either to use the same URIs (both for subjects and properties) or to set up and use links to indicate equivalence. owl:sameAs is of course useful (see sameAs.org) but not a panacea. I have been saying for a long time that we need to be using something like skos:exactMatch and skos:closeMatch. So, perhaps we need to focus on

tools to help people put in the links for the linked data
helping people define the links in the first place
understanding other difficulties that seem to be present, and overcoming them

Another point that I drew from the discussion was that the more that any data is used, the more motivation people have for keeping it up to date. Thus, the more that information about people is consolidated, the more there is a single copy that is used many times rather than several copies each of which is used less often. We need to keep kicking to kick-start the virtuous circle of using standards to help information to be consolidated, and further motivating people to consolidate it – and that naturally means to link it, probably in a linked data kind of way.

Motivation also depends on the economics and politics. What if changing the way that things are done (inevitably, along with the improvements we are suggesting) shifts costs from one party to another? It may be that costs are cut overall, but what if many costs are cut, but a few costs, of key players, are raised? We will have to keep aware of this happening, and think how to solve it when it arises.

Perhaps at a tangent to our main topic, we noted that XCRI-CAP is not a completely satisfying whole, and needs to be extended to cope with other areas of course-related information.

And the “ecosystem” that is the world of standards and specifications needs to take into account the motivation for standardisation in the first place. Perhaps CETIS could be a bit more ambitious about the niche we carve out for ourselves?

Future of interoperability standards – small points

Posted on September 23, 2010 by Simon Grant

This is a rather ephemeral statement of position-of-the-month on the future of interoperability standards, for the CETIS meeting on 24th September. I have just three things to note: two issues from helping to create the EuroLMAI CEN Workshop Agreement (moving towards an EN European Standard) and one issue from Leap2A.

1. Keep pressing for those URIs.

For EuroLMAI, we want URIs for our classes and properties, so that we can be good citizens of the Semantic Web. How hard is that? Well, first, whose domain are they going to be in? As this is a prospective CEN standard, one would have thought they would be keen to help by providing suitable URIs. Maybe they are, and maybe they will provide them, but, being a European institution, it does seem to take time, and plenty of it! It looks like we will have to use a PURL server like purl.org instead, at least for the time being. That is sort of OK, but there is a time penalty for accessing things through a PURL server, so it does slow things down and have the potential for increased frustration. And it doesn’t look half as official: there is some PR cost.

2. Do keep a clear conceptual model, as it helps later on as well.

In the EuroLM work, I was always keen on, and played a large part in, getting a good conceptual model with good definitions, meant to serve as a relatively firm foundation on which to build the specifications and standards. Recent experience suggests that not only is this useful in the initial work, but it is also useful to have the conceptual model to hand when checking the detail of the spec. My own experience reflects what may be obvious, that it is easy, when revising a draft much later on, to forget why something was done in a certain way. A little doubt in the mind, and it is too easy to edit something back to what looks like a common-sense position, but actually represents something that you carefully argued against on the basis of having taken the pains to build that clear and agreed conceptual model. (The problem being that we all habitually take our own personal cognitive short cuts, which may seem like common sense, and too often these end up being represented in formal structures when they shouldn’t be.)

3. Prepare better for people building on your spec.

OK, so your new spec is really gaining ground. You’ve done a fair job of capturing requirements and representing structures that everyone can relate to. You’ve not built a monster, but something that covers more or less just what it needs to cover, coherently. So now you shouldn’t be surprised that people want to take your spec and adapt it to their needs. Perhaps they will need to add a class or two of their own, perhaps some of their own properties, perhaps some categories or vocabularies, which may overlap with the default ones you have provided with the spec. How are you going to recommend that they proceed, in each case? This is a real question that is taxing me with Leap2A at the moment, and is a learning experience, as I find I am not as well prepared as I would have liked to be. I’d like to be able to document a page on “Building on Leap2A”, which might perhaps refer to the DCMI “Singapore Framework”.

ISKO Linked Data event

Posted on September 15, 2010 by Simon Grant

The full but mixed audience meant that this event was partly introductory, giving good revision, but going on to some interesting ideas around open linked data. What I was most looking for, leads on linked personal data, wasn’t covered, but it was useful nevertheless.

Nigel Shadbolt, the first keynote speaker, has the co-distinction with Sir Tim B-L of advising for data.gov.uk, and naturally he talked about government linked data. It is great that so much information is being exposed from government sources. I asked him about the National Occupational Standards maintained by Sector Skills Councils, coordinated by the UK Commission on Employment and Skills, and I hope he will be able to advise on leverage points, as even the first steps of the linked data ladder, giving things dereferenceable URIs, would be a highly significant for skills and competences for use in conjunction with learning outcomes, job role competence specifications, and matching outcomes of learning to skills wanted for employment. (UKCES is sponsored by several government departments, though BIS is the lead sponsor and therefore would probably be our best point of contact.)

Crown Copyright information is to have a new, more open, licence, assumed and designed to help reuse. Nigel introduced two sites, enakting.org and sameAs.org, which featured in later presentations as well (both useful and new to me).

Antoine Isaac gave a good introduction to SKOS. I asked him later about applying SKOS to skill definitions, and he seemed to agree that some specialisation of skos:broader and skos:narrower was in order. He also encouraged me to bring the topic up on the SKOS mailing list, which I will do when ready. He seemed to (and Nigel Shadbolt certainly did) imply that linked data meant using RDF/XML as the vehicle — somewhat daunting if not actually dispiriting — but at the end it became apparent that Antoine at least regarded RDFa as equivalent to RDF/XML. The more popular- and commercial-minded participants and presenters seemed to favour RDFa, which left me wondering how in touch RDF/XML proponents are. Probably not that many people are aware that RDFa is currently being developed to be more friendly to people who have started with microformats, so some existing reading on RDFa might not yet be as persuasive as it could be. However, it was good to note that no one at this conference was advocating microformats. Microdata, on the other hand, seems to be an entirely unknown quantity. (Current discussions within the RDFa community suggest a possible cross-mapping.)

Richard Wallis brought the Birmingham origins of Talis (co-sponsors of the event) into a generally informative presentation reinforcing some points already made with interesting examples. His presentation is on slideshare under his id of “rjw”.

Steve Dale told us about the local government “Knowledge Hub“, a “big, bold and ambitious” project going live in February 2011. Again it is about public sector information, though this time perhaps as much for local government workers themselves (who may not even be aware of all the information held) as much as members of the public. Needless to say, none of this involves information about individual members of the public, though I did engage in some discussion around this. Seems that people still shy away from the area. My view would be that individuals have more to gain than to lose by having the infrastructure available for them easily to access information held about them from various sources, particularly in the public sector.

After a pleasant and plenteous lunch, Martin Hepp introduced the GoodRelations ontology, designed for representing the semantics of e-commerce, thus enabling much faster and more accurate matches of offers and requests. He reckons that a very large proportion of GDP — perhaps over 50% — can be accounted for as involved with commercial matchmaking, which becomes quite plausible when you consider that it must include marketing, advertising, etc. Hence it is clear that improvements here can have a huge positive effect on an economy. Martin was one of the explicit advocates of RDFa, and the systems he helps to facilitate use RDFa.

Then came the well-known-to-us Andy Powell (one of the very few I knew there) telling a well-illustrated “long and winding road” story of how Dublin Core has related to RDF, in the process trying to balance the enthusiasm of the Semantic Web evangelists against the cataloguing librarians who were not at all so sure. He introduced the amusing Southampton blog post describing a new Batman antihero, “the Modeller”, which I hadn’t seen before…

Challenges that he pointed out include the fact that modelling is hard, and that models have to gain recognition and consensus within a community before becoming useful. This fits in well with my recent emphasis on the processes supporting consensus in conceptual modelling, as a precursor to standardisation.

John Goodwin went into more detail about the Ordnance Survey’s “OpenData”, exposing for free the small-scale map geographical data of the country, though keeping the large-scale detail to sell. New to me. But some fascinating challenges came up for discussion. How does one relate, and keep track, of geographical entities that may both change their names, and have subtly different meanings in different contexts? “Hampshire” was used as an example (does it include the Isle of Wight, Bournemouth, or even Southampton?) Even more interestingly, he is looking at building up a vernacular gazetteer, for example to help emergency services locate places referred to by local people under the names they actually use.

The other co-sponsor of the event was punkt. netServices from Austria. Andreas Blumauer demonstrated their “PoolParty” system, which certainly looked clever enough, and includes a “corporate ontology” similar to the idea I was advocating for CETIS a while back, in connection with the topics that we have on our web site and blogs. Is it really that easy, I wondered?

The most esoteric presentation was reserved to the final spot. Bernard Vatant of Mondeca explained how there is more than the concept-centric SKOS to his ideal of linking data. Not just the Semantic Web, but the Semiotic Web… He would like to complement the representation of concepts with an explicit representation also of terms themselves. Give the terms their own URIs, make statements about them, don’t just include them as bare literals. Why exactly, I wondered, other than theoretical rigour, or the motive to include the discourse of semiotics (etc.)? If I had a few hours with him some time, I’d really like to bottom this out in conversation, partly to follow my bent towards relating to as many different conceptual starting points as I can.

The networking was valuable. As well as querying Nigel Shadbolt and Antoine Isaac, I caught up with some people I came across some time ago from Metataxis, asked some of the many BBC people there about skills and competences, and at least made one contact interested in linking personal data. (Colleagues are of course very welcome to ask me more while the memories are fresh.)

E-portfolio Scotland

Posted on September 10, 2010 by Simon Grant

The Scottish e-portfolio scene seems to have comparatively many colleges, many of which use or are interested in Mahara. It may be even more promising than England for exploring company e-portfolio use, and we should try to ensure Scots are represented in any work on skills frameworks for e-portfolio tools.

That was the most interesting conclusion for me in a generally interesting day conference, e-Portfolio Scotland at Queen Margaret University on Friday (2010-09-10). I was given a plenary spot about Leap2A, and the audience responded well to my participative overtures — which is where I gathered this valuable information — and asked some intelligent questions. Mahara and PebblePad are well used, with Blackboard’s offering less so. Reassuringly, Leap2A came up in the presentations / demonstrations of Mahara and PebblePad, and in the final plenary by Gordon Joyes, so the audience would not be doubt about how central Leap2A is. (We just have to carry on following through and delivering!)

It was interesting to meet so many new faces. Apart from Gordon, there was Derrin Kent, and Susi Peacock on her home ground, but I didn’t know any of the others well. There seemed to be a roughly even split between HE and FE, with a very few from professions and schools. Perhaps I ought to spend more e-portfolio time in Scotland…

The vendors present included Calibrand, who I first met at the EIfEL conference this summer, and Taskstream, who have been represented in many e-portfolio conferences over several years. I suggested to the latter that they really need to take on Leap2A to get more into the UK market. A Manchester-based company, OneFile, sells a “Portfolio Assessment Solution” that I had not come across at all before, and their location has obvious potential for future discussion. But perhaps the most interesting vendor there, also giving a presentation, was Linda Steedman, MD of eCom Scotland. Their company has got beyond being a micro-business and offers an “Enterprise Skills Management” tool called SkillsLocker. I was impressed by her presentation, ranging across accreditation of prior learning, work-based learning, and what is now fashionably called “talent management” rather than HR. It seems they are well-connected, with AlphaPlus among others; also that they have done some valuable work cross mapping different skill definitions — I intend to follow this up.

Though perhaps not quite so central to JISC as those working in the HE sector, we still need to find some way of supporting the adoption of Leap2A-friendly portfolio tools in such commercially-based concerns. Work- and skills-based learning and training is a natural successor to HE-based PDP and skills development, and we really need to link in to it to make HE portfolio use more universally motivating.

One big remaining challenge was broadly acknowledged: dealing with these skill and competence representation issues that we do have on our agenda. The vision I was putting around, with no dissenting voices, was to decouple portfolio tools from any particular skills framework, and to have the frameworks published with proper URIs (in good Linked Data style). Then any tool should be able to work with any skills framework, and Leap2A information would include the relevant URIs. Though there remains the problem with HE that they tend to define skills at a different level to industry demands, FE is comparatively much closer to their employers, and they have common reference points in National Occupational Standards. So, among other things, any help we can get to persuade Sector Skills Councils to give proper URIs and structure to their NOSs will be most welcome, and maybe the Scottish e-portfolio community can help with this, and with defining the needed structures?

What’s in a standard name?

Posted on August 29, 2010 by Simon Grant

“Confusion” and “names” go together rather too easily. How might we take forward the representation of names in specifications and standards?

In June I noticed some RS3G work on defining person metadata, which I took an interest in, having grappled with this for Leap2A. Then last month I discovered somewhat to my surprise that I was a co-author of the rather oddly titled “Conformance Guidelines – The Path to a pan-European Asset (final draft) By example of the natural person” from SEMIC, the Semantic Interoperability Centre Europe. This document also deals with the representation of names, though differently.

In subsequent discussion, I have been drawn in to the peculiarities of Spanish surnames (they officially must have exactly two) and how other naming systems in the world are sometimes radically different from the EU passport convention of just “surname” and “given names”, echoed by UK birth certificates, driving licences, etc. Wikipedia has an article on Arabic names, for example. One thing is clear: it is not easy, and it matters to people how their name is rendered.

The RS3G idea refers to CETIS’s HEAR work, which in turn was based on some fields from MIAP (now the Learning Records Service) Common Data Definitions. However, CDDs look like they have been quietly superseded, and what looks like the current GovTalk version of name schemas is now at the Cabinet Office site. In particular, CDDs dealt with name order by having a boolean field to show if the family name was first, to cope with e.g. some Chinese practice. This does not seem to be in the GovTalk version, and indeed doesn’t seem to have caught on more widely either. Instead, people seem to be relying on a full name field to represent the ordering of the structured name fields. Strangely, GovTalk keeps titles and suffixes with their own separate fields, rather as they are in vCard.

Most people seem to agree these days that one or more full name fields are useful, giving the name as complete as someone wants it. This full name is not necessarily a full legal name. It may have parts at the beginning or end, e.g. titles or qualifiers, and these may or may not have legal standing. Thinking of the Arabic example, it may have a full rendition of a name that will not fit in to the patterns familiar to us. It may or may not be unique.

For use in the UK or the EU, there is likely to be one field for given names or forenames, and one field for family name or surname. Obviously, this tallies with usage in birth certificates, driving licences, passports and other official documents. This may suffice where it is the official legal name is all that is required, but it leaves many questions unanswered, and for a fuller picture we need to go beyond the common ground.

The reality looks to me more like this. In our dealings with different people and different bodies, we typically have identifiers, unique to that body or to those people, and we also have names that we are known as (by people) in those contexts. These may or may not overlap, and I suspect that it is the overlap which often causes problems.

My own case is a very common case in point. When asked to give my “name”, I will usually say “Simon” or “Simon Grant” if it might appear likely that there is another Simon in the context. However, I have learned not to put my name on airline reservation forms like this, as I get questioned. They want my official name, to tie in with my passport, where my Given Names field is “Andrew Simon”. The airline is interested, not in what I am called by my friends, but in my official identity, so they can check against the passport, which in turn can be checked against police records, etc. But it’s curious, isn’t it – I could probably pass my ticket on to anyone called Andrew Grant and they would be able to fly with it, but not anyone with a different name. Perhaps they should really ask for my passport number, though that is not as easy to check at a glance. At least some people get it right: on-line purchase forms ask for my name “as on the card”, which is clear, and in other contexts the practice of context-specific names is well-established: I think of stage names, noms-de-plume, and other professional names.

The most reasonable view seems to me that every name field in a specification or standard should be at least have an explanation of the context for that name, and be adequate for representing appropriate name information for everyone whose data is to be transferred using that spec or standard. If a standard or specification deals with transferring information about people who are all likely to have EU passports, it is reasonable to specify “Surname” and “Given names” as two fields, as on a passport. If the use cases of a different spec all involve Spanish citizens, then it is reasonable to have two fields, one for the first surname and another for the second surname. It would not, however, be reasonable for such a spec to have those fields as mandatory if it was going to be used for information about people of Arabic birth.

This discussion underlines my reservations about vCard and related specs, and is vital to setting an agenda for reviving the discussions about future specifications for names.

In Leap2A, what we have done is roughly to follow MIAP’s Common Data Definitions, as mentioned above. At least that gives the option of noting whether a name is a legal name or a preferred name. But, while it may suffice for the time being, it does not look good in the long term, and I hope that whenever Leap2A next changes, in a couple of years perhaps, we will move on to something more universal.

Achievement information documents, e.g. degree certificates and transcripts, also need names. We are just finalising the draft of the EuroLMAI (European Learner Mobility Achievement Information) European Standard, which should be highly influential on technical solutions adopted for the UK HEAR (Higher Education Achievement Report). I would like to see a better approach to names for these standards. What?

Given that the scope of EuroLMAI is Europe (perhaps the EU and a little beyond) it does seem reasonable to have one field for given names and one for surname, or pair of surnames in the Spanish case, as on the standard passport or other official identity documents. But what is most important after that? Two things seem to me to be vital. First, and most important, the name by which the student appears or appeared in the institutional student records. Is this always the same as the “official” legal name? If you print out a certificate for someone of Arab birth, what is more appropriate for the name printed on the certificate? I would have thought, a name as complete as the learner wants it to be. This would point towards a full name, like the UK govtalk Person Requested Name, as long as this was agreed with the student.

The second thing that might well be useful, though less important, would be the name that the student was actually known by to teaching staff and to peers. This would be closer to the “nickname” concept in other specifications. This would be particularly useful if it was not the same as the official name(s), when seeking feedback from staff and from peers, or when following up to check things after graduation.

So perhaps a wider range of potentially useful name information could be represented in four fields. Leaving any of these out from a specification or standard seems to me to risk being unable adequately to cover the use cases of this kind of achievement document.

Official (passport) Given names
Official (passport) Surname(s)
Full formatted name as shown on certificates
Personal name known by in face-to-face interactions

These can only be represented by vCard with a strain. Surname is OK, but a single field of all given names seems different from a vcard “Given”. Full formatted name could be vCard’s “FN”, though vCard’s FN is presumably meant for the main name on a “business” card, which might well not be the same. Is the personal name indicated above the same as vCard’s “NICKNAME” or not? I don’t know. Unfortunately, there is only slightly less strain fitting these into the UK GovTalk fields.

Where do we go from here? Call me “confused”…

Addendum, 2010-08-31

By happy coincidence, Paul Heald of Sigma Systems distributed a very interesting document yesterday to various people including Scott Wilson and other colleagues. It is called “Student Identity Defined: A Comparison of the Data Elements of Four Higher-Education Standards” revised August 30th 2010. It does indeed suggest that the discussion is ready to be taken forward to another level. And I find it is helping, gradually, to relieve my confusion.

Overhauling universities

Posted on June 14, 2010 by Simon Grant

Timely article from the BBC, “Universities need radical overhaul, says David Willetts” (2010-06-10) might provoke a positive response from people like us in CETIS, Bolton’s IEC, and the Centre for Recording Achievement (CRA). The BBC indicates that Willetts thinks universities faced “tough times” and needed to find cheaper and more flexible ways to teach. To which I’d add, more relevant and effective, perhaps?

What inefficiencies might be identified in higher education at present? For now, here are just a few first ideas, along with the kind of responses that CETIS, IEC or CRA could contribute to (though not in any order of impact, significance, importance, or difficulty, all of which need consideration).

1. Cost of producing learning materials and resources.
Response: greater use of open educational resources — see the CETIS OER topic.

2. Cost of staff.
Response: make greater use of peer support and assessment, perhaps starting with IEC’s IDIBL approach.

3. Irrelevance to employment and the economy.
Response: at the behest of learners themselves (see below) make more learning work-based, again like IDIBL, and let HEIs focus on Employer Engagement, as in the HE5P project undertaken by the CRA for HEFCE.

4. High student drop-out.
Response: ensure that students know what they want, are well motivated, know what they can do already, and have supportive PDP processes in place. Again, the CRA specialise in PDP and e-portfolio tools, and I have a particular interest in e-portfolio tools that are well-adapted to help good practice. This relates to ethical development that I have written about before. To counter the interminable arguments about the ideal aims of higher education, let properly-prepared learners choose. If they want employment-centric education, let them have it, not some poor ineffective attempt at such. If they want liberal arts with no requirement for consequent employment, again, let them have it. It’s not ultimately up to you or I or anyone to preach about what education should be for. And, surely, good preparation and real choice of objective should lead to more commitment?

5. Ineffective technology
OK, but several tools won’t perform as required to enable these efficiency gains. VLEs in silos, which you can’t extract information from, are a case in point. But the kind of cross-linking, enabling technology that CETIS people work on is surely well-placed. Look at Wookie, for example, allowing different applications to exist as widgets within web pages. Or look at the mobile technology work, brought together in a meeting about which many people twittered… For a full vision of an overhauled university, we would probably need to do more along the e-admin line, which isn’t perhaps appealing at first sight, but could make so much difference to the institutional overheads.

6. Lack of interoperability
Last but not least (in relevance to CETIS) we could list the inefficiencies due to lack of interoperability within the technology. For tasks that have to be done, this leads to inefficiencies such as rekeying; for tasks that are still very valuable but not absolutely necessary (such as many portfolio tasks) this probably leads to good things not being done at all, and consequent ineffectiveness. Not only to CETIS contribute very significantly to interoperability initiatives, as our name suggests, but we are maintaining a forward-looking discussion about the future of interoperability.

As I hinted at the beginning, the people I work with know about these things. It might be both very impressive, and very helpful to the likes of David Willetts, to bring these points together in a coherent vision of a university aimed at learner-centered effectiveness as well as efficiency.

Portfolios need verifiability

Posted on April 19, 2010 by Simon Grant

Having verified information included in learner-owned portfolios looks attractive to employers and others, but perhaps it would be better to think in terms of verifiable information, and processes that can arrange verification on demand.

Along with Scott Wilson and others, I was at a meeting recently with a JISC-funded project about doing electronic certificates, somewhat differently from the way that Digitary do them. Now, the best approach to certifying portfolio information is far from obvious. But Higher Education is interested in providing information to various people about activities and results of those who have attended their institution, and employers and others are keen to know what can be officially certified. When people start by imagining an electronic transcript in terms their understanding of a paper transcript, inevitably the question of how to make it “secure” will come up, echoing questions of how to prevent forgery of paper certificates.

Lately, I have been giving people my opinion that portfolio information and institutionally (“primary source”) verified information are different, and don’t need to interact too closely. Portfolio holders may write what they like, just as they do in CVs, and if certificates or verification are needed, perhaps the unverified portfolio information can provide a link to a verified electronic certificate of achievement information (like the HEAR, the UK Higher Education Achievement Report, under development). This meeting moved my understanding forward from this fairly simple view, but there are still substantial gaps, so I’ll try to set out what I do understand, and ask readers for kind suggestions about what I don’t.

As Scott could tell you much more ably than me, there are plenty of problems with providing digitally signed certificates for graduates to keep in their own storage. I won’t go into those, just to say that the problem is a little like banknotes: you can introduce a new clever technology that is harder to forge, but sooner or later the crooks will catch up with you, and you have to move on to ever more complex and sophisticated techniques. So, in what perhaps I may call “our” view, it seems normally preferable to keep original verified information at source, or at some trusted service provider delegated by the primary source. There are then several ways in which this information can be shown to the people who wish to rely on its verified authority. In principle, these are set out in Scott’s page on possible architectures for the HEAR. But in detail, again at the meeting I realised something I hadn’t figured out before.

We have already proposed in outline a way in which each component part of an achievement document could have its own URI, so that links could be made to particular parts, and differential permissions given to each part. (See e.g. the CEN EuroLMAI PDF.) If each part of an achievement document is separately referenceable, the person to whom the document refers (let’s call this person the holder again) could allow different people to view different parts, for different times, etc., providing that achievement information servers can store that permission information alongside the structured achievement information itself.

Another interesting technical approach, possible at least in PebblePad (Shane Sutherland was helpfully contributing to the meeting), is transparently to include information from other servers, to view and manage in your portfolio tool. The portfolio holder would directly see what he or she was making available for others to view. The portfolio system itself might have general permission to access any information on the achievement information server, with the onward permissions managed by the portfolio system. Two potential issues might arise.

What does giving general permission to an e-portfolio system mean for security? Would this be too much like leaving an open door into the achievement information server?
As the information is presented by the portfolio server, how would the viewer know that the information really comes from the issuer’s server, and is thus validated? A simple mark may not be convincing.

A potential solution to the second point might start with the generation of a permission token on the issuer’s server whenever a new view is put together on the portfolio system. Then the viewer could request a certificate that combined just the information that was presented in that view. But, surely, there must be other more general solutions?

The approach outlined above might be satisfactory just for one achievement information server, but if the verified information covering a portfolio is distributed across several such servers, the process might be rather cumbersome, confusing even, as several part certificates would have to be shown. Better to deal with such certificates only as part of a one-off verification process, perhaps as part of induction to a new opportunity. Instead, if the holder were able to point from a piece of information to the one or more parts of the primary records that backed it up, and then to set permissions within the portfolio system for the viewer to be able to follow that link, the viewer could be given the permission to see the verified information behind any particular piece of information. Stepping back a little, it might look like this. Each piece of information in a portfolio presentation or system is part of a web of evidence. Some of that evidence is provided by other items in the portfolio, but some refers to primary trustable sources. The method of verification can be provided, at the discretion of the portfolio holder, for permitted viewers to follow, for each piece of information.

One last sidestep: the nice thing about electronic information is that it is very easy to duplicate exactly. If there is a piece of information on a trusted server, belonging to a portfolio holder, it is in principle easy for the holder to reproduce that piece of information in the holder’s own personal portfolio system. Given this one-to-one correspondence, for that piece of information there is exactly one primary source of verification, which is the achievement information server’s version of just that piece of information. The information in the portfolio can be marked as “verifiable”, and associated with its means of verification. A big advantage of this is that one can query a trusted server in the least revealing way possible: simply to say, does this person have this information associated with them? The answer would be, “yes”, or “no”, or “not telling” (if the viewer is not permitted to see that information).

Stepping back again, we no longer need any emphasis on representing “verified” information within a portfolio itself, but instead the emphasis is on representing “verifiable” information. The task of looking after this information then becomes one of making sure that the the verification queries are successful just when they should be. What does this entail? These are the main things that I am unclear about in this vision, and would be grateful to know. How do we use and transform personal information while retaining its verifiability? What is required to maintain that verifiability?

Education and employment

Posted on March 30, 2010 by Simon Grant

Rather worrying to read a recent post from the CIPD, pointing out the great discrepancy between what people have studied recently and the jobs they get (or don’t get). Significant enough to get other people quoting it. These facts might reasonably lead one to the conclusion that we ought to have:

effective personal development planning as the norm, including good employment-oriented “information, advice and guidance”, more reliably joined to educational opportunities, and including clear advice on what is not usually “learned”, but more often are aspects of personal style and values;
more transparent connections between the actual skills and competence in demand from employers, and the intended learning outcomes of courses that purport to prepare people for employment;
far more widespread, transparent and effective systems for labour market matching between job-seekers and openings, taking into account what really makes the difference between “just a job” and genuine employee engagement, satisfaction and development.

The learning technology we support and promote needs to take that into account as well. Great technology for learning tools or learning design, great open learning resources on ever-so-well managed repositories, are only really valuable when truly suitable individuals take learning opportunities both that fit them, and that do what can be done to prepare them for whatever can be reliably predicted about their future occupations. I don’t think we are clueless about the technology that supports the latter objectives, but I’d say it is harder to do it well.

Perhaps it is a question of balance. If the PDP, the IAG, the skills development, tracking and matching were done relatively well, it would be a good reason to invest more in the tools, the resources, and the methods, which are perhaps not so challenging in principle, and easier to show supposed benefits from, until confronted with the stark reminders mentioned at the beginning.

linked portfolios?

Posted on March 23, 2010 by Simon Grant

There’s been continued development of interest within CETIS around the issue of linked data. Most people seem to start from the assumption that linked data is public data, and of course that isn’t going to work in e-portfolio land. (See e.g. this W3C guide in construction.) I see it as a creative challenge for CETIS to get hold of the issue of linking personal data, the issues it involves, and perhaps leading on to initial guidance for others implementing systems. This is perhaps needed to make progress with Leap2R.

Wilbert Kraan was in the Bolton office today, and I had a brief chat that opened up some of these issues to me. (He is a CETIS Semantic Web authority.) We could approach linked personal data in at least two ways:

named graphs with permissions attached;
security policies for particular URIs.

The named graph approach would seem to fit well with the way that e-portfolio systems make information available. Mahara has “views”, PebblePad has “webfolios”, which are somewhat similar in structure. They are both the means for presenting subsets of one’s information to particular audiences. So, if an e-portfolio had a SPARQL query facility attached, it would have to give no information by default, but only information derived from the graphs specifically named in the query. It is, I am assured, quite possible to restrict permission to access particular named graphs in a way very similar to restricting access to any web document.

But does that give too little to those who want to write really interesting SPARQL queries involving personal information? Or would the necessary permission processes be too cumbersome? What if an individual could create permissions, or an access regime, for individual bits of his or her information? That might be more in keeping with the spirit of the Semantic Web. In which case, perhaps we could envisage two strengths of control:

filtering triples output from a SPARQL query to ensure that they only contained restricted URIs if the querying agent had permission to have those URIs;
filtering the inferencing process so that triples containing restricted URIs were only used in the inferencing process if they querying agent had permission to use them.

We would need to look into what the effects of these might be. Maybe we might conclude that the latter was an appropriate way of keeping sensitive data really private, while the former might be OK for personal information that was not sensitive? That is no more than a guess. If this approach proved to be feasible, it might provide a way, not only for the principled permission to use particular personal information, but a really effective approach to keeping data private while still allowing it to be linked where allowed.

The point here is just to open up the agenda. If we are to take the future of linked data and the Semantic Web seriously, in any case we need to think through what we do to link personal information. Just assuming that no one will want to link personal data is very unlikely to work in the long run.

PLE, e-p, or what?

Posted on February 18, 2010 by Simon Grant

The concept of the personal learning environment could helpfully be more related to the e-portfolio (e-p), as both can help informal learning of skills, competence, etc., whether these abilities are formally defined or not.

Several people at CETIS/IEC here in Bolton had a wide-ranging discussion this Thursday morning (2010-02-18), focused around the concept of the “personal learning environment” or PLE. It’s a concept that CETIS people helped develop, from the Colloquia system, around 1996, and Bill Olivier and Oleg Liber formulated in a paper in 2001 — see http://is.gd/8DWpQ . The idea is definitely related to an e-portfolio, in that an e-p can store information related to this personal learning, and the idea is generally to have portfolio information continue “life-long” across different episodes of learning.

As Scott Wilson pointed out, it may be that the PLE concept overreached itself. Even to conceive of “a” system that supports personal learning in general is hazardous, as it invites people to design a “big” system in their own mind. Inevitably, such a “big” system is impractical, and the work on PLEs that was done between, say, 2000 and 2005 has now been taken forward in different ways — Scott’s work on widgets is a good example of enabling tools with a more limited scope, but which can be joined together as needed.

We’ve seen parallel developments in the e-portfolio world. I think back to LUSID, from 1997, where the emphasis was on individuals auditing and developing their transferable / employability skills. Then increasingly we saw the emergence of portfolio tools that included more functionality: presentation to others (through the web); “social” communication and collaboration tools. Just as widgets can be seen as the dethroning of the concept of monolithic learning technology in general, so the “thin portfolio” concept (borrowing from the prior “personal information aggregation and distribution service” concept) represents the idea that you don’t need that portfolio information in one server; but that it is very helpful to have one place where one can access all “your” information, and set permissions for others to view it. This concept is only beginning to be implemented. The current PIOP 3 work plans to lay down more of the web services groundwork for this, but perhaps we should be looking over at the widgets work.

Skills and competences have long been connected with portfolio tools. Back in 1997 LUSID had a framework structure for employability skills. But what is new is the recent greatly enlarged extent of interest in learning outcomes, abilities, skills and competencies. Recent reading for eCOTOOL has revealed that the ECVET approach, as well as being firmly based on “outcomes” (which ICOPER also focuses), also recognises non-formal and informal learning as central. Thus ECVET credit is not attached only to vocational courses, but also to the accreditation of prior learning by institutions that are prepared to validate the outcomes involved. Can we, perhaps, connect with this European policy, and develop tools that are aimed at helping to implement it? It takes far sighted institutions to give up the short term gain of students enrolled on courses and instead to assess their prior learning and validate their existing abilities. But surely it makes sense in the long run, as long as standards are maintained?

If we are to have learning technology — and it really doesn’t matter if you call them PLEs, e-portfolios or whatever — that supports the acquisition or improvement of skills and competence by individuals in their own diverse ways, then surely a central organising principle within those tools needs to be the skills, competencies or whatever that the individual wants to acquire or improve. Can we draw, perhaps on the insights of PLE and related work, put them together with e-portfolio work, and focus on tools to manage the components of competence? In the IEC, we have all our experience on the TENCompetence project that has finished, as well as ICOPER that is underway and eCOTOOL that is starting. Then we expect there will be work associated with PIOP 3 that brings in frameworks of skill and competence. Few people can be in a better position to do this work that we are in CETIS/IEC.

In part, I would formulate this as providing technology and tools to help people recognise their existing (uncertificated) skills, evidence them (the portfolio part) and then help them, and the institutions they attend, to assess this “prior learning” (APL) and bring it in to the world of formal recognition, and qualifications.

But I think there is another very important aspect to the technology connected with the PLE concept, and that is to provide the guidance that learners need to ensure they get on the “right” course. At the meeting, we discussed how employers often do not want the very graduates whose studies have titles that seem to related directly to the job. What has gone wrong? It’s all very well treating students like customers — “the customer is always right” — but what happens when a learner wants to take a course aimed at something one believes they are not going to be successful at? Perhaps the right intervention is to start earlier, helping learners clarify their values before their goals, understand who they are before deciding what they should do. This would be “personal learning” in the sense of learning about oneself. Perhaps the PDP part of the e-portfolio community, and those who come from careers guidance, know more about this, but even they sometimes seem not to know what to do for the best. To me, this self-knowledge requires a social dimension (with the related existing tools), and is something that needs to be able to draw on many aspects of a learner’s life (“lifewide” portfolio perhaps).

So, to reconstruct PLE ideas, not as monolithic systems, but as parts, there are two key parts in my view.

The first would be a tool for bringing together evidence residing in different systems, and organising it to provide material for reflection on, and evidence of, skills and competence across different areas of life, and integrating with institutional systems for recognising what has already been learned, as well as slotting people in to suitable learning opportunities. This would play a natural part in continuous professional development, and in the relatively short term learning education and training needs we have, which we can see we need from an existing working perspective, and thus, in the kind of workplace learning that many are predicting will need to grow.

The second may perhaps be not a tool but several tools to help people understand themselves, their values, their motives, their real goals, and the activities and employment that they would actually find satisfying, rather than what they might falsely imagine. Without this function, any learning education or training risks being wasted. Doing this seems much more challenging, but also much more deeply interesting to me.