Tag Archives: workflow

Libraries of the future – insights from a talk by Lorcan Dempsey

June 6, 2015Uncategorizedcollections, Digital Science, Elsevier, figshare, Identity management, Libraries, Libraries of the Future, library, Lorcan Dempsey, Mendeley, Nature, publishers, repository, Symplectic, workflowOffice of Scholarly Communication

There is no argument even from traditionalists that the library role is changing. But there is a great deal of confusion and sometimes fear about what that means, and what the future might look like.

On 3 June, Lorcan Demsey* came to speak to staff at Cambridge University Library about how the role and purpose of libraries are changing. The slides from his talk are available on Slideshare.

The one sentence headline from the talk was that research libraries are moving from licensing published content to managing workflow and research outputs – which means the print collection needs to be managed down to free up resources for the new roles. The subhead is – if we don’t do this, the publishers are waiting in the wings to take over.

Modern libraries in research environments

The Library role is distilling from owned materials to facilitating access to many things. Changing focus to ‘discovery’ in the collection means there must be a loss of some of the items.

Lorcan noted that his sense is that there is still a low uptake of this concept. As someone who has been working in scholarly communications for over a decade. I agree.

The collection as a means to an end rather than an end it itself – in some ways this is obvious but in others it is a huge psychological shift.

In a print world, researchers and learners organised their workflow around the library. The library had a limited interaction with the full process.
In a digital world the Library needs to organise itself around the workflows of research and learners. Workflows generate and consume information resources.

In libraries there is a separation of the discovery and the collection – library users are on the global level. The library will make some available and own some of those.

Change of focus

The research endeavour has moved from a focus on outcomes to begin to think about a range of activities around the process and the aftermath.

The traditional role of a library – outside of special collections and manuscripts – deals with outcomes like the books and journals. In this model students and researchers interact with the books and journals and then turn it into classically published works that come back into the library.

But we live now in an online world and the Library is interacting with the content in many different ways. There is interest in the process of research –methods, evidence and research data. There is also interest in the discussion around research through pre-prints, working papers and a variety of prepublication activity. This involves revision, derivative works and reuse. Copyright is important in these cases to let people know how things will be used.

This means that ‘collections’ from a library perspective now include the process, methods, discussions and outputs as well as books and journals.

From curation to creation

Mediated access to licensed material is becoming more streamlined, and other items becoming more available. Libraries are supporting creation, not just consumption.

Libraries need to be seen as a source for collaboration. There needs to be a partnership between the Library and the Faculty. The library is a partner in terms of the creation activity. The mediating role will continue.

Managing this transition is often done opportunistically – when people retire they are replaced with new skill sets.

The repository was seen by the Libraries initially as something in relation to artefacts but now it is seen as part of the workflow of the research lifecycle. There is less attention on what is coming in and more focus on sharing material back out.

Show me the money

There is a question around how much of this activity will be supported by the institution? And how is the resource shifting occurring in the libraries?

Lorcan said that while libraries talk about a growing interest in special collections and archives, there is no evidence from a budget perspective that this is being supported.

Publishers are trying to muscle in

Managing online identities

There is considerable interest amongst researchers in having a carefully tended online presence. This is time consuming, and would appear to be important to the researchers. This process is becoming intimately tied to publication – it is where people are announcing their publication.

Lorcan mentioned a study in Nature which was a survey of 3,000 scientists and engineers. They found 6% used Google Scholar, but more than half were using ResearchGate more regularly than LinkedIn. Not surprisingly this behaviour can be broken down by discipline. The social sciences tend to use Google Scholar, and academic.edu has low use by engineering and sciences. There are many solutions to the workflow to help researchers. Many of these will go away, but some are quite heavily used.

Thinking historically, a catalogue covers the material a library owns. The library has a discovery layer and a license. However this activity will have to shift to support creation. We have repositories, Google Scholar, ResearchGate etc. The incentive to use the repository is very low compared with these services.

A gap in the market?

Workflow is the new content – managing identity is where a lot of the focus is. Publishers are trying to position themselves as the service provider in this space.

Many libraries do not see their role as managing evolving scholarly records – the research and learning material. The curation of identity for researcher profiles is a big interest. This is often currently managed by research offices.

However this is a space into which publishers are moving. Several big publishers are now trying to be part of the full cycle for researchers.

For example Elsevier has two products – Pure is a content management system for research reporting and Mendeley is an academic social network. It is no coincidence that the word ‘solution’ is in the url thread. Similarly, Macmillian (publishers of Nature) recently bought Digital Science the company that created the equivalent products Symplectic and figshare. Digital Science was not included in the Macmillan Springer merger, possibly because they still need substantial investment. Lorcan noted that people see them as ‘plucky start-ups’ but they are owned by big publishers. There has been a big take-up of these services.

Lorcan showed a quote from Annette Thomas, CEO of Macmillian Publishers about ‘A publisher’s new job description’.

Her view is that publishers are here to make the scientific research process more effective by helping them keep up to date, find colleagues, plan experiments, and then share their results. After they have published, the process continues with gaining a reputation, obtaining funds, finding collaborators, and even finding a new job. What can we as publishers do to address some of the scientists’ pain points?

As Lorcan observed – you can take out the word ‘publisher’ and replace with the word ‘library’.

Managing down collections

Libraries are increasingly wanting to organise their space around the student experience not around collections. Lorcan used a grid to illustrate the changing focus. The two distinctions were:

Whether items are in many collections or are rare or unique.
Whether items require stewardship. Items that are high stewardship items are looked after and resources are spent on them. Items that are low stewardship don’t get looked after in Libraries.

At one extreme is licensed materials which are high stewardship/many collections. The opposite corner is research materials which are only available in a few collections and are low stewardship.

Lorcan said he thinks in the future there will be a focus on distinctive collections. There needs to be a lot more money to do this. So licensed purchased material will be more streamlined. Management attention 15 years ago was on highly managed, licensed items, but now the focus has shifted to items of low stewardship.

Inside out library

Market materials: licensed/purchased stuff. Library as broker and telling users that these things are available in a special way.

Distinctive collections: Library is provider and want to maximise discoverability. Want other people to know about faculty expertise, and research data. Putting into own discovery layer doesn’t help there. Think about metadata and which aggregators are important. Been slow to realise that discoverability is vital.

These have very different dynamics. We want to share material held within the library with the rest of the world. The licensed stuff is external and libraries bring it in to share internally. This is inside out.

Traditionally libraries deal with published, purchased material (including special collections). However there is a shift away from acquisitions to demand. This means that libraries need to redirect their resources towards research support. One way of doing this is to manage down the print collection.

There was an explosion of publishing after the Second World War. In the same way that baby boomers are all retiring at the same time, we are now faced with the challenge of managing these collections down.

Challenges for identity

The managing down of print collections coincides with the push to repurpose space in libraries. There are many discussions with architects – managing down print means there must be refurbishment.

One of the issues emerging for libraries is: Without the books, does the campus see this as the Library? Is the space needed for the Library – could they be replaced by learning commons or the student union?

We can see the identity discussion about libraries emerging now. If we are managing down collections, what is the space for? What are the new services we offer? Lorcan mentioned media stories where librarians are being attacked by historians who see this as managerial, technocratic activity.

Lorcan described some of the shared collection activities happening in the USA.

Conclusion

We used to think of the Library as a collection. Now we need to think of the Library in terms of the user and their workflows.

We must move to more facilitated access to items, also move to the management and disclosure of curated materials. The print and digital scholarly record needs curation and co-ordination at a conscious national level.

The job is about restructuring the means but we need to make decisions about moving resources or bets on the future. Libraries must shift from an organisation where the end was known to one where we must take some risk.

Published 6 June 2015
Written by Dr Danny Kingsley

* Lorcan coordinates strategic planning and oversees Research, Membership and Community Relations at OCLC. He has worked for library and educational organizations in Ireland, the UK and the US. His influence on national policy and library directions is widely recognized. He is on Twitter – @LorcanID

FORCE2015 observations & notes

March 18, 2015Uncategorizedaltmetrics, Author identification, Author tools, Credit models, Crowd Sourcing, data citation, data management, Libraries of the Future, open access, open data, open scholarship, Publishing software, Training, workflowOffice of Scholarly Communication

This blog first appeared on the FORCE2015 website on the 14 January 2015

First a disclaimer. This blog is not an attempt to summarise everything that happened at FORCE2015 – I’ll leave that to others. The Twitter feed using #FORCE2015 contains an interesting side discussion, and the event was livestreamed with individual sessions live in two weeks here – so you can always check bits out for yourself.

So this is a blog about the things that I as a researcher in scholarly communication working in university administration (with a nod to my previous life as a science communicator) found interesting. This is a small representative of the whole.

This was my first FORCE event, which has occurred annually since the first event FORCE11 , which happened in August 2011 after a “Beyond the pdf” workshop in January that year. It was nice to have such a broad group of attendees. There were researchers and innovators (and often people were both), research funders, publishers, geeks and administrators all sharing their ideas. Interestingly there were only a few librarians – this, in itself makes this conference stand out. Sarah Thomas, Vice President of Harvard Library observed this, noting she is shocked that there are usually only librarians at the table at these sort of events.

To give an idea of the group – when the question was asked about who had received a grant from the various funding bodies, I was in a small minority by not putting up my hand. These are actively engaged researchers.

I am going to explore some of the themes of the conference here, including:

Library issues
The data challenge
New types of publishing
Wider scholarly communication issues, and
The impenetrability of scientific literature

Bigger questions about effecting change

Responsibility

Whose responsibility is it to effect change in the scholarly communication space? Funders say they are looking to the community for direction. Publishers are saying they are looking to authors and editors for direction. Authors are saying they are looking to find out what they are supposed to do. We are all responsible. Funding is not the domain of the funders, it is interdependent.

What is old is still old

The Journal Incubator team asked the editorial board members of the new journal “Culture of Community” to identify what they thought will attract people to their journal. None mentioned the modern and open technology of their publishing practices. All points they identified were traditional, such as: peer review, high indexing, pdf formatting etc. Take home message – Authors are not interested in the back-end technology of a journal, they just want the thing to work. This underlies the need to ENABLE not ENGAGE.

The way forward

The way forward is three fold, and incorporates: Community – Policy – Infrastructure. Moving forward we will require initiatives focused on: Sustainability, Collaboration and Training.

Library issues

Future library

Sarah Thomas, the Vice President of the Harvard Library spoke about “Libraries at Scale or Dinosaurs Disrupted”. She had some very interesting things to say about the role of the library into the future:

Traditional libraries are not sustainable. Acquisition, cataloguing and storage of publications doesn’t scale.
We need to operate at scale, and focus on this centuries’ challenges not last, by developing new priorities and reallocate resources to them. Use approaches that dramatically increase outputs.
There is very little outreach of the libraries into the community – we are not engaging broadly expect “we are the experts and you come to us and we will tell you what to do”.
We must let go of our outdated systems – such as insufficiently automated practices, redundant effort, ‘just in case coverage’.
We must let go of our outdated practices – a competitive, proprietary approach. We need to engage collaborators to advance goals.
Open up hidden collection and maximise access to what we have.
Start doing research into what we have and illuminate the contents in ways we never could in a manual world, using visualization and digital tools

Future library workforce

There was also some discussion about the skils a future library worksforce needs to have:

We need an agile workforce – skills training, data science social media etc – help promote the knowledge of quality to work. Put it in performance goals.
We need to invest in 21^st century skillsets. the workforce we should be hiring includes:
- Metadata librarian
- Online learning librarians
- Bioinformatics librarians
- GIS specialist
- Visualization librarian
- Copyright advisor
- Senior data research specialist
- Data curation experts
- Scholarly communications librarian
- Quantitative data specialist
- Faculty technology specialist
- Subject specialist

Possible solution?

The Council on LIbrary and Information Resources offers PostDoc Fellowships: CLIR Postdoctoral Fellows work on projects that forge and strengthen connections among library collections, educational technologies and current research. The program offers recent PhD graduates the chance to help develop research tools, resources, and services while exploring new career opportunities.

Possible opportunity to observe change?

In summing up the conference Phil Bourne said there is an upcoming major opportunity point – both the European Bioinformatics Institute in EU and the National Library of Medicine in US will soon assume new leadership. They are receiving recommendations on what the institution of the future should look like.

The library has a tradition of supporting the community, being an infrastructure to maintain knowledge, and in the case of National Library of Medicine to set policy. If they are going to reinvent this institution we need to watch what will it look like in the future.

The future library (or whatever it will be called) should curate, catalog, preserve and disseminate the complete digital research lifecycle. This is something we need to move towards. The fact that there is an institution that might move towards this is very exciting.

The data challenge

Data was discussed at many points during the conference, with some data solutions/innovations showcased:

Harvard has the Harvard Dataverse Network– a repository to share data. “Data Management at Harvard” – Harvard Guidelines and Policies cranking up investment in managing data LINK
The Resource Identification Initiative is designed to help researchers sufficiently cite the key resources used to produce the scientific findings reported in the biomedical literature.
Bio Caddie is trying to do for data what PubMed central has done for publications using a Data Discovery Index. The goal of this project is to engage a broad community of stakeholders in the development of a biomedical and healthCAre Data Discovery and Indexing Ecosystem (bioCADDIE).

The National Science Foundation data policy

Amy Frielander spoke about The Long View. She posed some questions:

Must managing data be collated with storing the data?
What gets access to what and when?
Who and what can I trust?
What do we store it in? Where do we put things, where do they need to be?

The NSF don’t make a policy for each institution, they make one NSF Data Sharing Policy that works more or less well across all disciplines. There is a diversity of sciences with heterogeneous research results. Range of institutions, professional societies, stewardship institutions and publishers, and multiple funding streams.

There are two contact points – when grant is awarded, and when they report. If we focus on publications we can develop the architecture to extend to other kinds of research products. Integrate the internal systems within the enterprise architecture to minimise burden on investigators and program staff.

Take home message: The great future utopia (my word) is: We want to upload once to use many times. We want an environment in which all publications are linked to the underlying evidence (data) analytical tools, and software.

New types of publishing

There were several publishing innovations showcased.

Journal Incubator

The University of Lethbridge has a ‘journal incubator’ which was developed with the goal of sustaining scholarly communication and open and digital access. It allows the incubator to train graduate students in the task of journal editorships so the journal can be provided completely free of charge.

F1000 Research Ltd – ‘living figures’

Research is an ongoing activity but the way we publish you wouldn’t think it was. It is still very much around the static print object. The F1000 LINK has the idea that data is embedded in the article – developed a tool that allows you to see what is on the article.

Many figures don’t need to exist – you need the underlying data. Living figures in the paper. Research labs can submit data directly on top of the figure – to see if it was reproducible or not. This provides interesting potential opportunities –bringing static reseach figures to life – a “Living collection” Can have articles in different labs around that data. The tools and technologies are out there already.

Collabra – giving back to the community

New University California Open Press journal, Collabra will share a proportion of APC with researchers and reviewers. Of the $875 APC, $250 goes into the fund. Editors and reviewers get money into the fund, and there is a payout to the research community – they can decide what to do with it. Choices are to:

Receive it electronically
Pay it forward to pay APCS in future
Pay it forward to institution’s OA fund.

This is a journal where reviewers get paid – or can elect to pay themselves. See if everyone can benefit from the journal. No lock-in – benefit through partnerships.

Future publishing – a single XML file

Rather than replicating old publishing processes electronically, the dream is we have one single XML file in the cloud. There is role-based access to modify the work (by editors, reviewers etc) then at the end that version is the one that gets published. Everything is in the XML and then automatic conversion at the end. References at the click of a button are completely structured XML – tags are coloured. Can track the changes. The journal editor gets a deep link to say something to look at. Can accept or reject. XML can convert to a pdf – high level typography, fully automatically.

Wider scholarly communication issues

This year is the 350^th anniversary of the first scientific journal* Philosophical Transactions of the Royal Society. Oxford holds a couple of copies of this journal and there was an excursion for those interested in seeing it.

It is a good time to look at the future.

Does reproducibility matter?

Something that was widely discussed was the question of whether research should be reproducible,which raised the following points:

The idea of a single well defined scientific method and thus an incremental, cumulative, scientific process is debatable.
Reproducibility and robustness are slightly different. Robustness of the data may be key.
There are no standards with a computational result that can ensure we have comparable experiments.

Possible solution?

Later in the conference a new service that tries to address the diversity of existing lab software was showcased – Riffyn. It is a cloud based software platform – a CAD for experiments. The researcher has a unified experimental view of all their processes and their data. Researchers can update it themselves – not reliant on IT staff.

Credit where credit is due

I found the reproducibility discussion very interesting, as was the discussion about authorship and attribution which posed the following:

If it is an acknowledgement system everyone should be on it
Authorship is a proxy for scientific responsibility. We are using the wrong word.
When crediting research we don’t make distinctions between contributions. Citation is not the problem, contribution is.
Which building blocks of a research project do we not give credit for? And which ones only get indirect credit? How many skills would we expect one person to have?
The problem with software credit is we are not acknowledging the contributors, so we are breaking the reward mechanism
Of researchers in research-intensive universities, 92% are using software. Of those 69% say their work would be impossible without software. Yet 71% of researchers have no formal software development training. We need standard research computer training.

Possible solutions

The Open Science Framework – provides documentation for the whole research process. This therefore determines how credit should be apportioned.
Project CRediT has come up with a taxonomy of terms. Proposing take advantage of an infrastructure that already exists. Using Mozilla OpenBadges – if you hear or see the word ‘badges’ think ‘Digital Credit’

The impenetrability of scientific literature

Astrophysicist Chris Lintott discussed citizen science, specifically the phenomenally successful programGalaxyZoo which taps into a massive group of interested amateur astronomers to help classify galaxies in terms of their shape. This is something that humans do better than machines.

What was interesting was the challenge that Chris identified – amateur astronomers become ‘expert’ amateurs quickly and the system has built ways of them to communicate with each other and with scientists. The problem is that the astronomical literature is simply impenetrable to these (informed) citizens.

The scientific literature is the ‘threshold fear’ for the public. This raises some interesting questions about access – and the need for some form of moderator. One suggestion is some form of lay summary of the research paper – PLOS Medicine have an editor’s summary for papers. (Nature do this for some papers, and BMJ are pretty strong on this too).

Take home message – By designing a set of scholarly communication tools for citizen scientists we improve the communication for all scientists. This is an interesting way to think about how we want to browse scholarly papers as researchers ourselves.

*Yes I know that the French Journal des scavans was published before this, but it was boarder in focus, so hence the qualifier ‘first scientific journal”

Published 18 March 2015
Written by Dr Danny Kingsley

Unlocking Research

Open Research at Cambridge

Tag Archives: workflow

Libraries of the future – insights from a talk by Lorcan Dempsey

Modern libraries in research environments

Change of focus

From curation to creation

Show me the money

Publishers are trying to muscle in

Managing online identities

A gap in the market?

Managing down collections

Inside out library

Challenges for identity

Conclusion

FORCE2015 observations & notes

Bigger questions about effecting change

Responsibility

What is old is still old

The way forward

Library issues

Future library

Future library workforce

Possible solution?

Possible opportunity to observe change?

The data challenge

The National Science Foundation data policy

New types of publishing

Journal Incubator

F1000 Research Ltd – ‘living figures’

Collabra – giving back to the community

Future publishing – a single XML file

Wider scholarly communication issues

Does reproducibility matter?

Possible solution?

Credit where credit is due

Possible solutions

The impenetrability of scientific literature