All posts by Office of Scholarly Communication

Changing roles and changing needs for academic librarians

The Office of Scholarly Communication (OSC) has joined the Centre for Evidence Based Library and Information Practice (C-EBLIP) Research Network, and as part of this commitment has prepared the following blog which is a literature review of papers published addressing the changing training needs for academic librarians. This work feeds into research currently being carried out by the OSC into the educational background of those working in scholarly communication. The piece concludes with a discussion of this research and potential next steps.

Changing roles

There is no doubt that libraries are experiencing another dramatic change as a result of developments in digital technologies. Twenty years ago in their paper addressing the education of library and information science professionals, Van House and Sutton note that “libraries are only one part of the information industry and for many segments of the society they are not the most important part”.

There is an argument that “as user habits take a digital turn, the library as place and public services in the form of reference, collection development and organisation of library resources for use, all have diminishing value to researchers”. Librarians need to adapt and move beyond these roles to one where they play a greater part in the research process.

To this end scholarly communication is becoming an increasingly established area in many academic libraries. New roles are being created and advertised in order to better support researchers as they face increasing pressure to share their work. Indeed a 2012 analysis into new activities and changing roles for health science librarians identified ‘Scholarly communications librarians’ as a new role for health sciences librarians based on job announcements whilst in their 2015 paper on scholarly communication coaching Todd, Brantley and Duffin argue that: “To successfully address the current needs of a forward-thinking faculty, the academic library needs to place scholarly communication competencies in the toolkit of every librarian who has a role interacting with subject faculty.”

Which skill sets are needed?

Much of the literature is in agreement about the specific skill set librarians need to work in scholarly communication. “Reskilling for Research”identified nine areas of skill which would have increasing importance including knowledge about data management and curation. Familiarity with data is an area mentioned repeatedly and acknowledged as something librarians will be familiar with. Mary Anne Kennan describes the concept as “the librarian with more” – traditional library skills with added knowledge of working with and manipulating data.

Many studies reported that generic skills were just as much, if not more so, in demand than discipline specific skills. A thorough knowledge of advocacy and outreach techniques is needed to spread the scholarly communication message to both library staff and researchers. Raju highlighted presentation skills for similar reasons in his 2014 paper.

The report “University Publishing in a Digital Age” further identified a need for library staff to better understand the publishing process and this is something that we have argued at the OSC in the past.

There is also a need to be cautious when demanding new skills. Bresnahan and Johnson (article pay-walled) caution against trying to become the mythical “unicorn librarian” – an individual who possesses every skill an employer could ever wish for. This is not realistic and is ultimately doomed to fail.

In their 2013 paper Jaguszewski and Williams instead advocate a team approach with members drawn from different backgrounds and able to bring a range of different skills to their roles. This was also the argument put forward by Dr Sarah Pittaway at the recent UKSG Forum where her paper addressed the issue of current library qualifications and their narrow focus

Training deficit

Existing library roles are being adapted to include explicit mention of areas such as Open Access whilst other roles are being created from scratch. This work provides a good fit for library staff but it can be challenging to develop the skills needed. As far back as 2008 it was noted that the curricula of most library schools only covered the basics of digital library management and little seems to have changed since with Van House and Sutton identifying barriers to “the ability of LIS educational programs to respond” to changing needs such as the need to produce well-rounded professionals.

Most people working in this area learn their skills on the job, often from more experienced colleagues. Kennan’s study notes that formal education could help to fill the knowledge gap whilst others look to more hands-on training as this helps to embed knowledge.

The question then becomes should the profession as a whole be doing more to prepare their new recruits for the career path of the 21st century academic librarian? This is something we have been asking ourselves in Office of Scholarly Communication (OSC) at Cambridge. Since the OSC was established at the start of 2015 it has made a concerted effort to educate staff at the one hundred plus libraries in Cambridge through both formal training programmes and targeted advocacy. However we are aware that there is still more to be done. We have begun by distributing a survey to investigate the educational background of those who work in scholarly communications. The survey was popular with over five hundred responses and many offers of follow up interviews which means that we have found an area of interest amongst the profession. We will be analysing the results of the survey in the New Year with a view to sharing them more widely and further participating in the scholarly communication process ourselves.

Conclusion

Wherever the skills gaps are there is no doubt that the training needs of academic librarians are changing. The OSC survey will provide insight into whether these needs are currently being met and give evidence for future developments but there is still work to be done. Hopefully this project will be the start of changes to the way academic library staff are trained which will benefit the future of the profession as a whole.

Published 29 November 2016
Written by Claire Sewell and Dr Danny Kingsley

Creative Commons License

Are we achieving our OA goals?

This post was written for Hindawi for Open Access Week and published by them on 28 October. It is reposted here.

Recently I spent a day in two consecutive weeks travelling to London to meet with colleagues to discuss the implementation of the Wellcome Trust (COAF) and RCUKOpen Access policies. In both cases the discussions were centred on compliance with their policies.

Certainly it makes sense that a funder should ensure that its policies are being implemented properly. But this focus on compliance raises the more fundamental question about whether we are actually achieving the underlying goal of these policies – which is to open up access to UK research so more people can access, read and use this work.

After all, having huge swathes of research in repositories under embargoes* or spending literally millions of pounds annually to make particular articles in subscription journals available open access is not in itself the end goal.

We should be taking stock. Have the past three and a half years of the RCUK and over a decade of the Wellcome Trust policies meant our researchers are more engaged in open access? Has there been a movement by publishers towards flipping their journals? Indeed, is UK research being read and used more now? These are very pertinent questions that simply do not appear to be discussed at the moment.

*Cambridge has managed to address this issue by providing a Request a Copy button – see here.

Big bucks

There is a lot of money in this ecosystem. Cambridge University has been allocated £1,269,318.59 by RCUK in the 2016-2017 year, and have a £403,138 underspend which will be directed to this year’s Open Access activities. In addition Wellcome Trust have allocated us £902,243.

So Cambridge University has £2,574,699.59 allocated by funders to pay for Open Access APCs and related staff and systems costs (we recently made all of our expenditure available). Cambridge University spends about £4.8 million annually on subscriptions, so the cost for Open Access at our institution is over half of our subscription cost.

These are serious amounts of money. Surely it is a good idea to ask whether this process is actually achieving what it set out to do.

So what has actually happened?

Embargo changes

The RCUK Open Access policy has allowances for green Open Access with a sufficient embargo period and the decision tree at the Office of Scholarly Communication reflects the actual wording and rules of the policy – that is choose green options if you can. However the emphasis of RCUK is decidedly towards gold Open Access – see their decision tree which is actually slightly misleading.

So when the RCUK announces a policy where cash for article processing charges will flow to publishers dependent on embargo periods, what happens? The embargo periods lengthen.

According to a study published this year “What does ‘green’ open access mean? Tracking twelve years of changes to journal publisher self-archiving policies” (Open Access version here) there is “a clear link between the introduction of Gold open access and the increasing restrictions around Green open access”. The study also includes a graph mapping embargo periods over time which shows a very clear and defined ‘Finch effect’.

This was entirely predictable. When the RCUK Open Access policy was announced in response to the Finch Report I wrote (in my previous role) “Clearly it is advantageous for journals to offer a hybrid option and to extend their embargo periods in response to this policy.” And they did.

Springer and Emerald both extended their embargoes beyond the RCUK limits. (Of course Springer has since redeemed itself by experimenting with new business models).

Those embargo extensions were particularly galling at the time for me because they were worldwide and affected everyone – including in Australia, where I was based. Other publishers have responded to the RCUK rules by creating particular embargoes for UK authors. Elsevier is a clear example.

Institutional pressure

About the time the RCUK policy came into force I wrote about the difficulty of anyone staying up to speed on copyright agreements. Since then it has got worse. At Cambridge we do not expect our researchers to try and wade through this – we provide a service to help them. But this means staff and that costs money.

The pressure on research institutions to manage the UK Open Access policies is significant. Analyses of the total cost of publication (Open Access version here) associated with the administration of making research open access show a huge staffing load. The cost of processing a gold Open Access article was shown to be 2.5 times that for the processes of making an article available in a repository.

The RCUK do allow some of their block grant to be spent on staffing and infrastructure. At Cambridge we have reported that we spent 4.6% of the year’s allocation on staff costs and 5.1% on systems support. The general understanding is that RCUK don’t want the total spend on these costs to be more than 10% of the grant and it appears some institutions have spent more than this in previous years.

This highlights the overall lack of funding for support costs for managing Open Access. There are no specific funds for managing the HEFCE Open Access policy, or the COAF policies. While both the Wellcome Trust and HEFCE provide considerable funds to UK institutions for research, this is not directed to the Libraries. Certainly at Cambridge there is a robust process required to argue for funds to support these types of activities.

The 2012 Finch Report talked about a “transition to open access” and acknowledged that this will mean additional costs. Certainly the funders have channelled significantly more funds to publishers through the institutional block grants, and those institutions are having to channel internal resources to support the staff supporting the policies.

But the Finch Report also mentioned “seeking efficiency savings and other reductions in costs from publishers and other intermediaries”. It is safe to say that this has yet to actually occur.

Taking stock

So, more than four years on from the Finch Report, are we any closer to full Open Access? The answer is yes in the UK – because we have poured millions of extra (taxpayers’) pounds into the system. But if the RCUK policy were to end tomorrow, would the publishing landscape be any different? Has any other country in the world followed this model?

And are the Open Access policies achieving their end goal? Is UK research more visible in the world now? Are people actually finding these articles? Is it being read more?

Is anyone even asking these questions? Who is monitoring this? If we don’t ask and measure these parameters we will never know.

What we do know is we have extended embargo periods, forcing funded researchers down the gold Open Access path, which is more expensive to process in terms of staff time. We have spent millions, the majority of which is spent in hybrid journals – which is itself another issue. And there is little if any evidence that publishers are moving towards fully Open Access models.

A glimmer

Unfortunately the discussions held recently about the Wellcome Trust and RCUK policies were solely focused on compliance. This has become the narrative in the Open Access space in the UK and does nothing to help ‘sell’ the idea of Open Access.

Indeed it would be hugely helpful if there were communication about the underlying goals of these policies and whether they are being met. But the lack of monitoring of these goals means we have nothing to say. We can’t communicate what we don’t know about.

There is some hope. At least one publisher is interested in whether this is making a difference. At the Frankfurt Book Fair last week I attended a discussion of the German Serials Interest Group where a colleague from Springer said that Springer is assessing the success or otherwise of the Springer Compact. They had specifically compared the readership of Open Access articles against subscription only articles. According to this work, the percentage of non-institutional affiliated people reading the Open Access articles was dramatically higher than the subscription-only.

This type of information is hugely valuable to Open Access advocates, and I am hoping that Springer will release these findings publicly.

The team at the Office of Scholarly Communication strongly believe that all Cambridge research should be available, and we are working hard towards that goal (recently celebrating 10,000 submissions to the repository). It would help us enormously if we could offer evidence to our community of the value and benefits of this effort.

Published 3 November 2016
Written by Dr Danny Kingsley

Creative Commons License

Mission Open Access: the Apollo repository launches

IMG_2298To celebrate Open Access Week 2016, the Office of Scholarly Communication (OSC) officially launched ‘Apollo’, the University of Cambridge’s upgraded open access repository.

Researchers, University research staff and librarians gathered at the University’s Engineering Department to see a demonstration of the new features of Apollo, speak to some of the University’s Open Access Champions and raise a glass to launch the service.

The repository stores a range of content and provides different levels of access, but its primary focus is on providing open access to the University’s research publications.  Apollo forms an important part of the University’s provision for meeting research funder requirements for open access, enabling ‘Green’ access to publications.  The launch of the upgrade comes at an exciting time for the Office of Scholarly Communication, as the repository has recently received its 10,000th upload.

The Cambridge University Office of Scholarly Communication looks after all aspects of scholarly communication within the University. This ranges across the entire research lifecycle from searching for information and collaborators, through to authoring and copyright issues and finally the publication and dissemination process, leading into assessment. The OSC has responsibility for the open access and open data programs at the University in terms of compliance with funders’ policies, and delivers and manages the University’s digital repository, Apollo.

Cambridge University was one of a handful of ‘testbed ‘ institutions that participated in the early deployment and development of DSpace, and has been running a DSpace repository for over a decade. Over that time, Apollo has participated in a number of externally funded projects intended to better understand researcher requirements or improve the services it offers. These include: Incremental, DataTrain and PrePARe, which developed resources to support research data management and EPIC and Keeping Research Data Safe (KRDS), which focused on the repository’s preservation services.

IMG_2297Upgraded features

With the support of RCUK, the OSC have spent £43,000 to upgrade the repository. Cambridge is now leading the country by running DSpace Version 5.4, the most recent and most stable version of the application. This has given Apollo a modern and improved user-friendly interface.

Since the upgrade in May 2016, the repository has had close to 2 million views from actual people (not machines!)

The upgrade means we can now increase the services offered by the repository.  Digital Object Identifiers, or DOIs, can be minted in-house. The Open Access team has minted over 6000 DOIs since May for articles, theses, datasets and other research outputs.

In addition, people identifiers – Author ORCIDs – are now displayed in the repository. The repository is interoperable with other systems and sends ORCIDs  to Datacite, which might allow repository items to be automatically populated into Authors’ ORCID profiles in the future.

Perhaps the most exciting integration is with the University’s publication management system Symplectic, allowing for easier reporting of Open Access compliance.

Request a Copy

Part of the upgrade involved the introduction of a new feature called ‘Request a Copy,  designed to open up the University’s most current research to a wider audience.  ‘Request a Copy’ operates on the principle of peer-to-peer sharing – if an item in Apollo is not yet available to the public, a repository user can ask the author for a copy of the item.  Authors sharing copies of their work on an individual basis falls outside the publisher’s copyright restrictions; here, the repository is acting as a facilitator to a process which happens anyway.

The Request a Copy button has been much more successful than we anticipated, particularly because there is no actual ‘button’.  By the end of September 2016 (four months after the introduction of ‘Request a copy’), we had received 1120 requests (approximately 280 requests per month), with two thirds for articles. Apart from a small number of requests for datasets, the remaining third were for theses.

Of the requests for articles during this period, 38% were fulfilled by the author sending a copy via the repository, and 4% were rejected by clicking the ‘Don’t send a copy’ button.

Of the articles requested during this period 45% were yet to be published.  The large number of requests made prior to publication indicates the value of having a policy where articles are submitted to the repository on acceptance rather than publication – there is clearly interest in quickly accessing this research, rather than waiting for publication.

Open Access Week

The Apollo launch was the closing event of Open Access Week at the OSC.  Established by SPARC and partners in the student community in 2008, International Open Access Week is an opportunity to take action in making openness the default for research—to raise the visibility of scholarship, accelerate research, and turn breakthroughs into better lives.  The OSC also released a daily programme of announcements, blog posts and live-streamed events, which are spotlighted on the OA Week webpage, and celebrated this year’s theme of ‘Open in Action’.

Stay in touch with news from the OSC through the monthly newsletter

Published 28 October 2016
Written by Hannah Haines

Creative Commons License

Are academic librarians getting the training they need?

As part of Open Access Week 2016, the Office of Scholarly Communication is publishing a series of blog posts on open access and open research. In this post Claire Sewell looks at the training of library staff in areas relating to scholarly communication.

The problem

Few people would deny that the world of the academic library is changing. Users are becoming more and more sophisticated in their information gathering techniques and the role of the academic librarian needs to adapt accordingly or risk being left behind. Librarians are changing from the traditional gatekeeper role to one which helps their research community to disseminate the outputs of their work.

This shift offers academic library staff new opportunities to move into research support roles. An increasing number of libraries are establishing scholarly communication departments and advertising for associated roles such as Repository Managers and Data Specialists.  It’s also becoming common to see more traditional academic library roles advertised asking for at least a working knowledge of areas such as Open Access and Research Data Management.

This is an issue that we have been considering in the Office of Scholarly Communication for a while. My role as Research Skills Coordinator involves up-skilling Cambridge library staff in these areas so I’m more aware than most that it is a full time job. But what happens to those who don’t have this type of opportunity through their work? How do they find out about these areas which will be so relevant to their future careers?

For many new professionals studying is their main chance to get a solid grounding in the information world but with the profession undergoing such rapid change is the education received via these degrees suitable for working in 21st century academic libraries? This is a question that has been raised many times in the profession in recent years so it’s time to dig a bit deeper.

Hypothesis

Our hypothesis is simple: there is a systematic lack of education on scholarly communication issues available to those entering the library profession. This is creating a time bomb skills gap in the academic library profession and unless action is taken we may well end up with a workforce not suited to work in the 21st century research library.

In order to test this hypothesis we have designed a survey aimed at those currently working in scholarly communication and associated areas. We hope that asking questions about the educational background of these workers we can work to determine the suitability of the library and information science qualification for these types of role into the future and how problems might be best addressed.

After a process of testing and reworking, our survey was launched to the scholarly communication community on October 11th 2016. In less than 24 hours there were over 300 responses, clearly indicating that the subject had touched a nerve for people working in the sector. (And thank you to those who have taken the time to respond).

Preliminary findings

We were pleased to see that even without prompting from the survey, respondents were picking up on many of the issues we wanted to address. For example, the original focus of the survey was the library and information science qualification and its impact on those working in scholarly communication.

When we piloted the survey with members of our own team we realised how diverse their backgrounds were and so widened the survey to target those who didn’t hold an LIS qualification but worked in this area. This has already given us valuable information about the impact that different educational backgrounds have on scholarly communication departments and has gained positive feedback from survey respondents.

Many of the respondents talk of developing the skills they use daily ‘on the job’. Whilst library and information professionals are heavily involved in lifelong learning and it’s natural for skills to develop as new areas emerge, the formal education new professionals receive also needs to keep pace. If even recent graduates have to develop the majority of skills needed for these roles whilst they work this paints a worrying picture of the education they are undertaking.

The survey responses have also raised the issue of which skills employers are really looking for in library course graduates and how these are provided. Respondents highlighted a range of skills that they needed in their roles – far more than were included in the original survey questions. This opens up discussions about the vastly differing nature of jobs within scholarly communication and how best to develop the skill set needed.

A final issue highlighted in the responses received so far is that a significant number of people working in scholarly communication roles come from outside the library sector. Of course this has benefits as they bring with them very valuable skills but importing knowledge in this way may also be contributing to a widening skills gap for information professionals that needs to be addressed.

Next steps

The first task at the end of the collection period (you have until 5pm BST Monday 31 October) will be to analyse the results and share them with the wider scholarly communication community. There are plans for a blog post, journal article and conference presentations. We will also be sharing the anonymised data via the Cambridge repository.

Following that our next steps depend largely on the responses we receive from the survey. We have begun the process of reaching out to other groups who may be interested in similar issues around professional education to see if we can work together to address some of the problems. None of this will happen overnight but we hope that by taking these initial steps we can work to create academic libraries geared towards serving the researchers of the 21st century.

One thing that the survey has done already is raise a lot of interesting questions which could form the basis of further research. It shows that there is scope to keep exploring this topic and help to make sure that library and information science graduates are well equipped to work in the 21st century academic library.

Published 27 October 2016
Written by Claire Sewell
Creative Commons License

Theses – releasing an untapped resource

As part of Open Access Week 2016, the Office of Scholarly Communication is publishing a series of blog posts on open access and open research. In this post Dr Matthias Ammon looks at theses and their use.

It may sound obvious, but PhD theses are a huge reservoir of original research content, given that each thesis represents at least three or four years’ focussed engagement with a specialised research topic. Traditionally, however, the results of this work have not been easily accessible.

A print copy of the approved thesis would be deposited in the library of the university where the PhD was undertaken so that access was mainly restricted to other members of that university. Interested readers have to travel to visit the library or rely on frequently costly interlibrary loans. While some of the research contained in theses would be published in articles or monographs, this still means that an enormous amount of research was and is effectively locked away.

Increasing access

With the changes in technology in recent decades allied with the rise of Open Access and institutional repositories, the accessibility of PhD theses in general has improved. In Australia, the Australian Digital Theses program began in 1998, expanding to the Australasian Digital Theses program in 2005. This used VT-ETD software to host digital theses at individual institutions which were collated to one search engine. The ADT website, a central metadata repository, was hosted at the University of New South Wales. This was decommissioned in 2011 as theses were migrated to their various institutional repositories. All Australian theses are now findable in Trove, the National Library of Australia’s Trove service. There are 334, 000 theses listed in Trove of which over 119,000 are available online.

A significant number of UK universities now require the deposit of a digital copy of a thesis in the university’s repository as a condition for awarding the PhD degree. Usually this entails making the thesis openly available although embargoes may be placed for reasons of confidentiality or commercial concerns. In addition, PhD students funded by any of the UK research councils under the RCUK Training Grant are required to make their theses available Open Access.

Although it is not yet mandatory at the University of Cambridge for PhD students to provide a digital copy of their thesis, students can voluntarily upload their approved dissertations to the institutional repository, Apollo. Approximately one in 10 PhD students do so. In the next couple of weeks, the Office of Scholarly Communication is embarking on a pilot for the systematic submission of digital theses with selected departments.

Finding theses

There are national and international repositories that aggregate access to PhD theses, such as the British Library’s EThOS (for the UK) or DART-Europe (for European universities), making it easier for interested researchers to find relevant material without having to trawl through individual repositories.

Open Access Theses and Dissertations aims to be the best possible resource for finding open access graduate theses and dissertations published around the world. Metadata (information about the theses) comes from over 1100 colleges, universities, and research institutions. OATD currently indexes 3,422,634 theses and dissertations.

NDLTD, the Networked Digital Library of Theses and Dissertations provides information and a search engine for electronic theses and dissertations (ETDs), whether they are open access or not. The service also provides ‘Guidance Briefs’ on topics such as Copyright and Preserving and Curating ETD Research Data and Complex Digital Objects.

Proquest Theses and Dissertations (PQDT) is a database of dissertations and theses published digitally or in print. Note these are made available for a fee that does not benefit the author. [In September 2017 ProQuest contacted us to say they do pay royalties. Their policy is here.] In addition access to PQDT may be limited depending on local library licensing arrangements.

Looking to the past

So while it is looking likely that most future PhD theses will be available online (either freely or requestable), what about the vast number of PhD theses written up to this point? For context, Cambridge alone holds over 40,000 printed theses, with approximately 1100 being added every year. Approximately 2,000 of these have been digitised at the request of individuals wishing to have access to the theses.

Last year we ran an ‘Unlocking Theses’ project to increase the number of Open Access theses in the repository, which stood at about 600 at the beginning of 2015. The Library also held over 1200 scanned theses on an internal server. The Unlocking Theses project added all of these scanned theses held by the Library into the University repository. The Development and Alumni Office were able to provide contact details for just over 600 of these authors. The majority of these authors have now been contacted and we have had a 35% positive response rate from them.

As of today we hold 2257 theses in the repository of which half are Open Access. The remaining theses are currently held in a Restricted Theses Collection but the biographical information about these theses is searchable. Approximately one third of requests we have from our Request a Copy service is for these theses. In addition some authors have found their restricted thesis online and requested we open access to it.

Cambridge is currently working with the British Library to digitise some of the 14,000 Cambridge theses they hold on microfilm. Our finances do not stretch to the whole corpus, so we have decided to digitise ten percent. This has meant a process to determine which theses we choose to have digitised. Considerations have included the quality of digitisation from microfilm for typeset versus typewritten theses (and indeed whether the thesis is printed single or double sided because of shadowing). We have also chosen theses on the basis of those disciplines are highly requested from our Digital Content Unit. This has proved to be challenging, not least because of the difficulty of determining disciplines of theses from our library catalogue.

We are hoping to upload these theses to the repository towards the end of the year, and with the addition of several hundred theses that have been digitised this year from the Digital Content Unit will double the number of theses we hold in the repository.

Considerations

There are several issues that need to be considered before theses can be made available openly. The first concerns third party copyright, that is to say the inclusion of quotations, images, photographs or other material that does not represent original work on behalf of the thesis author but has been taken from previously published work. There is generally no problem with including such material in the copy of the thesis submitted for examination and the print version deposited in the University library, but making the thesis freely available online constitutes a change of use and requires separate permissions. This is a problem that applies to both current and older theses and requires checks on behalf of the author and possibly the library.

Another issue related to copyright is the author’s permission to make the thesis available which is necessary because the author retains the copyright for his work. For current theses, this permission can be incorporated into the submission process, either as part of the requirement for the PhD or by the author signing an agreement when the thesis is voluntarily uploaded.

However, it is not so easy to obtain permission for retrospective digitisation as we discovered during our Unlocking Theses project. The contact details of alumni are not always known and in cases where the original author is deceased it may be challenging to establish the copyright holder, making it difficult to obtain an explicit ‘opt-in’ permission. Finally, there are financial considerations as the digitisation of large number of theses requires a significant outlay for staff, equipment and administrative costs.

Big projects

In recent years, a number of universities have undertaken large-scale digitisation projects of their holdings of PhD theses and have dealt with the permission issue in different ways.

The experience of these UK universities also appears to indicate that alumni are for the most part happy to see their theses made openly available. If more institutions follow suit and dedicate funding to opening up the research undertaken by generations of students this large reservoir of research will no longer remain untapped.

There are other challenges related to digital theses that still remain to be solved, such as the problem of linking theses to their associated data and the question of persistent identifiers to seamlessly integrate the output of both individual researchers and institutions. In the future, consideration should be given to non-text or multimedia PhDs, as was debated at a recent panel discussion at the British Library.

For now though, opening up access to decades’ or even centuries’ worth of scholarship sitting on university library shelves in the form of physical copies of PhD theses sounds like a good start.

Published 26 October 2016
Written by Dr Matthias Ammon and Dr Danny Kingsley
Creative Commons License

Walking the talk- reflections on working ‘openly’

As part of Open Access Week 2016, the Office of Scholarly Communication is publishing a series of blog posts on open access and open research. In this post Dr Lauren Cadwallader discusses her experience of researching openly.

Earlier this year I was awarded the first Altmetric.com Annual Research grant to carry out a proof-of-concept study looking at using altmetrics as a way of identifying journal articles that eventually get included into a policy document. As part of the grant condition I am required to share this work openly. “No problem!” I thought, “My job is all about being open. I know exactly what to do.”

However, it’s been several years since I last carried out an academic research project and my previous work was carried out with no idea of the concept of open research (although I’m now sharing lots of it here!). Throughout my project I kept a diary documenting my reflections on being open (and researching in general) – mainly the mistakes I made along the way and the lessons I learnt. This blog summarises those lessons.

To begin at the beginning

I carried out a PhD at Cambridge not really aware of scholarly best practice. The Office of Scholarly Communication didn’t exist. There wasn’t anyone to tell me that I should share my data. My funder didn’t have any open research related policies. So I didn’t share because I didn’t know I could, or should, or why I would want to.

I recently attended The Data Dialogue conference and was inspired to hear many of the talks about open data but also realised that although I know some of the pitfalls researchers fall into I don’t quite feel equipped to carry out a project and have perfectly open and transparent methods and data at the end. Of course, if I’d been smart enough to attend an RDM workshop before starting my project I wouldn’t feel like this!

My PhD supervisor and the fieldwork I carried out had instilled in me some practices that are useful to carrying out open research:.

Lesson #1. Never touch your raw data files

This is something I learnt from my PhD and found easy to apply here. Altmetric.com sent me the data I requested for my project and I immediately saved it as the raw file and saved another version as my working file. That made it easy when I came to share my files in the repository as I could include the raw and edited data. Big tick for being open.

Getting dirty with the data

Lesson #2. Record everything you do

Another thing I was told to do during my PhD lab work was to record everything you do. And that is all well and good in the lab or the field but what about when you are playing with your data? I found I started cleaning up the spreadsheet Altmetric.com sent and I went from having 36 columns to just 12 but I hadn’t documented my reasons for excluding large swathes of data. So I took a step back and filled out my project notebook explaining my rationale. Documenting every decision at the time felt a little bit like overkill but if I need to articulate my decisions for excluding data from my analysis in the future (e.g. during peer review) then it would be helpful to know what I based my reasoning on.

Lesson #3. Date things. Actually, date everything

I’d been typing up my notes about why some data is excluded and others not so it informs my final data selection and I’d noticed that I’d been making decisions and notes as I go along but not recording when. If I’m trying to unpick my logic at a later date it is helpful if I know when I made a decision. Which decision came first? Did I have all my ‘bright ideas’ on the same day and now the reason they don’t look so bright is was because I was sleep deprived (or hungover in the case of my student days) and not thinking straight. Recording dates is actually another trick I learnt as a student – data errors can be picked up as lab or fieldwork errors if you can work back and see what you did when – but have forgotten to apply thus far. In fact, it was only at this point that I began dating my diary entries…

Lesson #4. A tidy desk(top) is a tidy mind

Screen Shot 2016-10-24 at 13.21.11I was working on this project just one day a week over the summer so every week I was having to refresh my mind as to where I stopped the week before and what my plans were that week. I was, of course, now making copious notes about my plans and dating decisions so this was relatively easy. However, upon returning from a week’s holiday, I opened my data files folder and was greeted by 10 different spreadsheets and a few other files. It took me a few moments to work out which files I needed to work on, which made me realise I needed to do some housekeeping.

Aside from making life easier now, it will make the final write up and sharing easier if I can find things and find the correct version. So I went from messy computer to tidy computer and could get back to concentrating on my analysis rather than worrying if I was looking at the right spreadsheet.

 

Lesson #5. Version control

One morning I had been working on my data adding in information from other sources and everything was going swimmingly when I realised that I hadn’t included all of my columns in my filters and now my data was all messed up. To avoid weeping in my office I went for a cup of tea and a biscuit.

Upon returning to my desk I crossed my fingers and managed to recover an earlier version of my spreadsheet using a handy tip I’d found online. Phew! I then repeated my morning’s work. Sigh. But at least my data was once again correct. Instead of relying on handy tips discovered by frantic Googling, just use version control. Archive your files periodically and start working on a new version. Tea and biscuits cannot solve everything.

Getting it into the Open

After a couple more weeks of problem free analysis it was time to present my work as a poster at the 3:AM Altmetrics conference. I’ve made posters before so that was easy. It then dawned on me at about 3pm the day I needed to finish the poster that perhaps I should share a link to my data. Cue a brief episode of swearing before realising I sit 15ft away from our Research Data Advisor and she would help me out! After filling out the data upload form for our institutional repository to get a placeholder record and therefore DOI for my data, I set to work making my spreadsheet presentable.

Lesson #6. Making your data presentable can be hard work if you are not prepared

I only have a small data set but it took me a lot longer than I thought it would to make it sharable. Part of me was tempted just to share the very basic data I was using (the raw file from Altmetric.com plus some extra information I had added) but that is not being open to reproducibility. People need to be able to see my workings so I persevered.

I’d labelled the individual sheets and the columns within those sheets in a way that was intelligible to me but not necessarily to other people so they all needed renaming. Then I had to tidy up all the little notes I’d made in cells and put those into a Read Me file to explain some things. And then I had to actually write the Read Me file and work out the best format for it (a neutral text file or pdf is best).

I thought I was finished but as our Research Data Advisor pointed out, my spreadsheets were returning a lot of errors because of the formula I was using (it was taking issue with me asking it to divide something by 0) and that I should share one file that included the formulae and one with just the numbers.

If I’d had time, I would have gone for a cup of tea and a biscuit to avoid weeping in the office but I didn’t have time for tea or weeping. Actually producing a spreadsheet without formulae turned out to be simple once I’d Googled how to do it and then my data files were complete. All I then needed to do was send them to the Data team and upload a pdf of my poster to the repository. Job done! Time to head to the airport for the conference!

Lesson #7. Making your work open is very satisfying.

Just over three weeks have passed since the conference and I’m amazed that already my poster has been viewed on the repository 84 times and my data has been viewed 153 times! Wowzers! That truly is very satisfying and makes me feel that all the effort and emergency cups of tea were worth it. As this was a proof-of-concept study I would be very happy for someone to use my work, although I am planning to keep working on it. Seeing the usage stats of my work and knowing that I have made it open to the best of my ability is really encouraging for the future of this type of research. And of course, when I write these results up with publication in mind it will be as an open access publication.

But first, it’s time for a nice relaxed cup of tea.

Published 25 October 2016
Written by Dr Lauren Cadwallader
Creative Commons License

Who is paying for hybrid?

In our related blog ‘Hybrid Open Access  – an analysis‘ we explored the origins and issues with hybrid open access. Here we describe what funders are allowing or not in relation to payments for hybrid Open Access APCs.

Funding agencies and hybrid

Of the 179 Open Access funds listed in the Open Access Directory, 99 (55%) do not allow hybrid publishing; 78 (44%) do, or do not specify. The two remaining funds (1%) allow hybrid but either discourage it or require that the publisher have an offsetting scheme in place. This shows a strong move away from hybrid since 2014, when only 39% of funds rejected hybrid – a rejection of hybrid is now the majority position.

What’s more, these anti-hybrid funders now include some major organisations, particularly in Europe. The EU FP7 post-grant pilot, for example, is only open to authors publishing in fully Open Access journals, and the Netherlands Organization for Scientific Research (NWO) has considered hybrid ineligible for funds since December 2015.

According to a news story in Nature in January this year, the Norwegian Research Council and the German Research Foundation both pay Open Access fees for researchers but do not permit the payment of hybrid costs. The Austrian Science Fund has capped Open Access payments at a certain level; if researchers want to publish in more expensive journals (often the hybrids), they must find the extra cash themselves.

In 2013 Science Europe declared in a position statement that:

The Science Europe member organisations […] stress that the hybrid model, as currently defined and implemented by publishers, is not a working and viable pathway to Open Access. Any model for transition to Open Access supported by Science Europe member organisations must prevent ‘double dipping’ and increase cost transparency.

UK funders’ position on hybrid

The Wellcome Trust, while not yet abandoning hybrid entirely, voiced considerable wariness in its 2014-15 report, and has warned that stricter action will follow if there is not an improvement in publisher behaviour:

We believe declaring that Wellcome funds cannot be used to pay for hybrid OA is too blunt an instrument, unfairly penalising those publishers which provide a good service at a reasonable price, and that it would slow down the transition to a fully OA world – the position we ultimately want to get to.

However, doing nothing is no longer a valid option.  If hybrid publishers are unable to commit to the Wellcome Trust’s set of requirements and do not significantly improve the quality of the service, then classifying those hybrid journals as “non-compliant” will be an inevitable next step.

In 2015 RCUK published an independent review into the implementation of their Open Access policy which, while notably less combative on the issue of hybrid, nevertheless noted the expensiveness of the option and suggested potential future action:

The panel noted that average APCs for articles published in hybrid journals were consistently more expensive than in fully open access journals (despite the fact that hybrid journals still enjoyed a revenue stream through subscriptions). The panel recommends that RCUK continues to monitor this and if these costs show no sign of being responsive to market forces, then a future review should explore what steps RCUK could take to make this market more effective.

In the Universities UK Open Access Coordination Group’s report “Open access to research publications – Independent advice” the author, Professor Adam Tickell noted:

An alternative approach would be to consider whether funding Gold Open Access in Hybrid Journals where there are no equivalent offsets in subscription costs is a good use of public funds. During the course of working on this report, I met with the Publishers Association and Elsevier and I do not believe that the major publishers would find this slight change of course challenging.

Library funds and hybrid

In January this year the Canadian Association of Research Libraries (CARL) published Library Open Access Funds in Canada: review and recommendations. Amongst the summary of fund management recommendations was  ‘do not  fund hybrid journals‘.

SPARC maintains an Open Access Campus Funds page, which provides advice. The document “Campus-based open-access publishing funds: a practical guide to design and implementation” contains a whole section on deciding whether to support hybrid, noting “Many institutions that have functioning Open-access Funds have indicated that the toughest decision they made concerned hybrid journal eligibility”.

US library-run funds

Zuniga, H. & Hoffecker, L. (2016). Managing an Open Access Fund: Tips from the Trenches and Questions for the Future. Journal of Copyright in Education and Librarianship, 1(1), 1-13 discusses the thinking behind a library-run Open Access fund at University of Colorado Health Sciences Library and specifies that funding will only be available for fully Open Access journals and not hybrid ones.

A recent discussion on one of the lists (which is dominated by American institutions) about library funds for open access revealed the very strong preference to support only fully Open Access journals. Of the responses from the US libraries, nine funds did not support hybrid and two did under particular circumstances. The US is not subject to the gold Open Access policies that the UK is:

  • University of Rhode Island only supports “articles published in fully open access, peer-reviewed scholarly journals” that are listed in the DOAJ with its Open Access Fund
  • Texas A&M University Libraries’ Open Access to Knowledge Fund (OAKFund) notes “”Hybrid” Open Access publication venues and publication venues with delayed Open Access models are ineligible.”
  • The University of Pittsburgh’s Open Access Fee Author Policy states “Journals with a hybrid open-access model or delayed open-access model are not eligible.”
  • The One University Open Access (OA) AuthorFund at the University of Kansas supports only publication in “an entirely open access journal. Journals with a hybrid open-access model or delayed open-access model are not eligible”. A definition of hybrid journal is provided. – 2015 article in JLSC “Campus Open Access Funds: experiences of the KU “One University” Open access author fund”.
  • Cornell University’s Open Access Publication Fund does not mention hybrid specifically but the wording implies the fund supports only fully Open Access journals, noting “Since open access publishers do not charge subscription or other access fees, they must cover their operating expenses through other sources.”
  • Concordia University’s Open Access Author Fund states “the article must be published in a fully open access journal. Traditional subscription-based or ‘hybrid’ journals that offer an open access option for a fee are not eligible.”
  • University of Oklahoma’s Open Access (OA) Subvention Fund Policy refers to “true open access journals”, noting “Articles with a hybrid or delayed OA model are not eligible through this fund”
  • The information about University of California San Francisco’s Open Access Publishing Fund includes a section about why it does not support hybrid
  • Northwestern University’s Open Access Fund describes an acceptable open access journal as a “journal published in a fully open access format based on a published schedule of article processing fees”

That said, there were a couple that are considering support for hybrid:

  • Wayne State University’s Scholars Cooperative Open Access Fund states “Hybrid open access arrangements (“paid open access” or “open choice”) may be considered on a case-by-case basis”.
  • Wake Forest University Open Access Fund does support hybrid, but the cost for all open access is split three ways between the Library the Research Office and the author.
UK library-run funds

In November last year the UCL, Newcastle and Nottingham Universities published the results of a survey with Jisc: “Institutional policies on the use of Open Access Funds“. The report noted that of the respondents 18 institutions in the UK had a central institutional fund (not provided by RCUK/COAF). The report noted there were different approaches to using these central funds. At the time four institutions paid for papers in fully Open Access journals only; four paid for papers in both fully Open Access and hybrid journals, without encouraging authors in favour of Green or Gold; and five institutions encourage authors to choose Green where possible.

In response to a list query in October 2016 (which is not a comprehensive survey by any means), there was a mixture of arrangements in the UK library-run funds. Four funds did not support hybrid, four did, and there were three that supported them in particular circumstances.

Some UK funds are primarily non-hybrid with a small number of exceptions.

  • University College London has a fund which provides limited funds “for other UCL corresponding authors who are full (not honorary or visiting) members of staff or students where the funder does not cover open access charges”. This fund generally only pays for papers in fully OA journals. When it comes to hybrids the policy is very much to recommend Green, but the fund does occasionally pay for papers in hybrid journals “where the author makes a case for it”.
  • The University of Bath has a Bath open access fund  for journals that operate “a ‘Gold’ or paid Open Access model only AND the journal is a Q1 title as measured by Journal Citation Reports or SciMago Journal Indicators”. Note that this fund will support hybrid by exception, with Associate Dean agreement.
  • Lancaster University has a small fund available with strict criteria for when it can be used.  The research paper must both be likely to be rated as 4* in the next REF and be the most appropriate place to publish and does not offer a compliant green route or is an open access only journal. Applications need approval from the Heads of Department and Associate Dean for Research.

Other funds do not distinguish between hybrid and fully OA journals:

  • King’s College London are in the second pilot year of an Open Scholarship Fund which currently does not distinguish between hybrid and full open access journals – but this may be considered if the funds are exhausted.
  • Northumbria University Newcastle has an institutional Open Access fund to cover APCs in both fully gold and hybrid journals.
  • Liverpool University has an institutional open access fund here that has very minimal criteria (CC BY, no retrospective OA, no page or colour charges) that pays both hybrid and fully OA APCs. The fund is reviewed every six months.
  • Queen Mary University will be starting to offer a small institutional fund this year to cover non funded research which will support hybrid

There are some UK institutions where no central fund exists but Departments or Faculties have established their own funds with their own rules.

Conclusions

The increase in funds that do not allow payment for hybrid since 2014 indicates that increasingly the gloss has come off hybrid. Originally considered to be a transition method towards fully Open Access journals, the lack of movement towards this outcome has meant a tightening by funders on what can be spent on hybrid. It will be interesting to revisit this in another two years’ time.

Published 24 October 2016
Written by Dr Danny Kingsley and Dr Philip Boyes 
Creative Commons License

Hybrid open access – an analysis

Welcome to Open Access Week 2016. The Office of Scholarly Communication at Cambridge is celebrating with a series of blog posts, announcements and events. In today’s blog posts we revisit the issue of paying for hybrid open access. We have also published a related post “Who is paying for hybrid?” listing funder policies on hybrid.

Recent years have seen a proliferation of funder open access mandates, the terms of which can differ markedly, adding to the confusion of an already complex area. The Registry of Open Access Repository Mandates and Policies (ROARMAP) lists 80 funders with open access requirements, and the list continues to grow.

Within the UK, policies fall into three broad categories: those that mandate green Open Access without paying a fee, such as the HEFCE policy; those that prefer gold but make no additional funds available, such as the NIHR policy, and those that have a preference for gold and offer block grants to institutions to help cover the associated costs, such as the Research Councils UK (RCUK) and Charities Open Access Fund (COAF) policies.

Accompanying this expansion of mandates, unsurprisingly, has been an increase in the amount being spent to support Open Access. The Open Access Directory lists 179 funds for OA journal articles worldwide, compared with 81 in early 2014.

All this brings into sharper relief the question of how open access funds support hybrid publishing. But first a quick history lesson.

Hybrid origins

Hybrid journals provide open access to specific articles where an Article Processing Charge has been paid in an otherwise subscription journal. A few learned societies offered hybrid options in the early 2000s. Hybrid open access options were first offered by large publishers in 2004 with Springer’s Open Choice product charging USD3000 per article. This price has not changed in the past 12 years. In the UK the Springer Compact now pays for hybrid under a different model.

Wiley Online Open’s trial began the same year, charging USD2500. Today the price ranges from USD1,500 – 5,200. Oxford Open launched in 2005, and in 2006 Elsevier Open Access and Sage Choice began. In 2007, Taylor & Francis Open Select, Cambridge Open and Nature Publishing Group’s open access offering began.

The uptake of hybrid began slowly. It is very difficult to obtain statistics on what percentage of journals have hybrid Open Access content but in his 2012 analysis The hybrid model for open access publication of scholarly articles – a failed experiment?, (open access version here ) Bo-Christer Bjork found the number of hybrid journals had doubled in the previous couple of years to over 4,300, and the number of such articles was around 12,000 in 2011. This represented a small proportion of eligible authors (1-2 %).

That analysis was published the same year as the Finch Report which recommended a gold path to Open Access. The resulting RCUK Open Access Policy and RCUK Block Grants to fund Open Access APCs has dramatically increased the  expenditure on hybrid in the UK since 2013. According to a report published in 2015, “the UK’s profile of OA take-up is significantly different from the global averages: its use of OA in hybrid journals and of delayed OA journals is more than twice the world average in both cases, while its take-up of fully OA journals with no APC (Gold-no APC) is less than half the world average and falling.”

At Cambridge University we have spent literally millions of pounds on hybrid Open Access – which constitutes approximately 85% of our total APC spend. This is a higher percentage than estimates across the country, which are a 76% spend on hybrid Open Access.

Double dipping

Hybrid represents a second income stream to publishers and has raised questions about ‘double dipping’. Some publishers manage this by reducing the cost of subscriptions in proportion to the percentage of hybrid in a given journal, such as Nature Publishing Group. However ‘big deals’ for subscriptions can render this relatively ineffective, and the reduction is spread across all subscribers, regardless of who has paid the article processing charge. This means research intensive institutions (such as Cambridge) are contributing heavily to the system but not receiving a relative reduction.

To address this issue at a local level, several publishers have created offsetting arrangements, where discounts or refunds are provided in proportion to the contribution the institution has made in APC payments above subscriptions. However, each of these schemes operates differently and they can be complicated to administer, or have other preconditions such making large prepayments to publishers.

The biggest problem from an implementation perspective, however, is that they are by no means universal. By far the biggest publisher, Elsevier, for example, offers no form of offsetting at all, although they nevertheless assert that they do not double dip. The result is that in very many cases, institutions and authors continue to have to pay twice for material in hybrid journals, swelling publisher coffers at the expense of research funding.

Very expensive

One of the problems with hybrid is that even ignoring the added cost of subscriptions to the non Open Access material in those journals, hybrid Open Access charges are more expensive than those for fully Open Access journals.

In March last year both the Wellcome Trust and the RCUK undertook a review of their Open Access policies. The Reckoning: An Analysis of Wellcome Trust Open Access Spend 2013 – 14  noted: “The average APC levied by hybrid journals is 64% higher than the average APC charged by a fully OA title”.  In Wellcome’s data, the average APC for a hybrid article in 2014-15 was £2104, compared with only £1396 for fully OA journals. Worryingly, the data showed that fully OA APC costs had risen more than their hybrid counterparts since the previous year.

Similarly in the Research Councils UK 2014 Independent Review of Implementation the observation was that article processing charges for hybrid Open Access were “significantly more expensive” than fully OA journals, “despite the fact that hybrid journals still enjoyed a revenue stream through subscriptions”.

A Max Planck Digital Library Open Access Policy White Paper published on 28 April 2015 noted that The Wellcome Trust had a significantly higher average APC cost than German, Austrian and SCOAP3 figures. This was because the Wellcome Trust pays for hybrid APCs, “which are not only much higher than most pure open access costs but are also widely considered not to reflect a true market value. In Germany and many other countries, hybrid APCs are excluded from the central funding schemes.”

A study undertaken last year considered APCs in the five-year period between 2010 and 2014 found the mean for fully-OA journals published by non-subscription publishers was£1,136 compared with £1,849 for hybrid journals. The same study also found that traditional subscription publishers are capturing most of the APC market. The top-10 publishers in terms of numbers of APCs received from participant institutions (who received 76% of the total APCs paid from the sample) “only included two fully-OA publishers (PLOS and BMC). The others were established publishers (Elsevier, Wiley, Springer and so on) who are mostly gaining APC income from hybrid journals.”

The 2014 report Developing an effective market for open access article processing charges was written for a consortium of research funders comprising Jisc, Research Libraries UK, Research Councils UK, the Wellcome Trust, the Austrian Science Fund, the Luxembourg National Research Fund and the Max Planck Institute for Gravitational Physics. The authors noted of the hybrid journal market that it is “highly dysfunctional, with very low uptake for most hybrid journals and a relatively uniform price in most cases without regard to factors such as discipline or impact“.

Value for money?

A second issue which has become apparent as open access mandates have expanded is the extent to which publishers – mostly of hybrid journals – do not deliver the Open Access option that has been paid for. In many cases, the ‘immediate’ Open Access for which an author or institution has paid an APC may take months or even years to be made Open Access; some articles are never made Open Access at all. Even when articles are made available, there is no guarantee that it will have the appropriate licence. It is by no means uncommon for articles to carry more restrictive licences than those requested, or for the appropriate licence to appear on a journal website while the PDF of the article itself bears only a publisher copyright notice and a prominent ‘All rights reserved’.

In March 2016 the Wellcome Trust published a report into compliance among its paid-for articles in 2014-15, concluding:

The good news is that we have seen an improvement in correct and programmatically identifiable licences (from 61% of papers in ’13-‘14, to 70% in ’14-‘15) and a similar increase in overall compliance from 61% to 70%.  The bad news, however, is that in 30% of cases we are not getting what we are paying for.

The source of this non-compliance was overwhelmingly hybrid journals, and the largest publishers were the worst offenders: in the Wellcome data, 31% of Elsevier hybrid articles (and 26% of their ‘fully OA’ articles!) were non-compliant, as were 54% of Wiley’s.

One might conclude, then, that hybrid Open Access represents a bad deal for funders and institutions, with poor service and double-dipping.

Other hybrid issues

To further complicate matters, some have argued that the open access/hybrid dichotomy is too stark. Some journals, particularly coming from learned societies, (e.g. Plant Physiology, from the American Society of Plant Biologists) make all articles open access after a certain period, but charge an optional APC to make them available sooner. This would generally be considered hybrid publishing, but could be seen as a rather different category from the majority of corporate hybrid journals, in which articles never become Open Access unless an APC is paid. There is a possibility that strict funder mandates against hybrid could close off such journals to researchers, exacerbating the anxieties regarding open access felt by many learned societies.

Where does this leave authors and institutions? It’s clear that the situation remains very much in flux. The problems that have existed with hybrid since the beginnings of Open Access are far from resolved, despite the expansion of journal offsetting schemes. Meanwhile, prices continue to rise and while many funders have taken the step of allowing their funds to be used only for fully Open Access journals, it is still a minority of the largest and most powerful funding bodies.

The result is confusion for researchers and an increased administrative burden for institutions, who have to manage and advise on a proliferation of divergent funder and publisher policies, as well as conducting regular and extremely resource-intensive compliance-checking of hybrid publications to ensure publishers have delivered what has been paid for. As numbers of Open Access publications increase, it is questionable how sustainable this will be.

Published 24 October 2016
Written by Dr Philip Boyes and Dr Danny Kingsley 
Creative Commons License

Request a copy: process and implementation

This blog post looks at a recent feature implemented in our repository called ‘Request a copy’ and discusses the process and management of the service. There is a related blog post which discusses the uptake and reaction to the facility.

As part of our recent upgrade to the University’s institutional repository (now renamed ‘Apollo‘), we implemented a new feature called ‘Request a copy’. ‘Request a copy’ operates on the principle of peer-to-peer sharing – if an item in Apollo is not yet available to the public, a repository user can ask the author for a copy of the item. Authors sharing copies of their work on an individual basis falls outside the publisher’s copyright restrictions; here, the repository is acting as a facilitator to a process which happens anyway – peer to peer sharing.

The main advantage of the ‘Request a copy’ feature is to open up the University’s most current research to a wider audience. Many of our users do not necessarily come from an academic background, or may be based within another discipline, or an institution where journal subscriptions are more limited. The repository is often their first port of call to find new research as it ranks highly in Google search results. We hope that these users will benefit from ‘Request a copy’ by being able to access new outputs early, at researchers’ discretion. Additionally, this may provide an added benefit to researchers by introducing new contacts and potential collaborations.

How it works

Screen Shot 2016-10-06 at 13.53.30Items in Apollo that are not yet accessible to the wider public are indicated by a padlock symbol that appears on the thumbnail image and filename link which users can usually click to download the file.

Reasons why the file may not yet be publicly available include:

  • Some publishers require that articles in repositories cannot be made available until they are published, or until a specified time after publication
  • We hold a number of digitised theses in the repository, and for some we have been unable to contact the author to secure permission to make their thesis available
  • Authors may choose to make their dataset available only once the related article is published

When a user clicks on a thumbnail or filename link containing a padlock, they are directed to the ‘Request a copy’ form. Here, they provide their name, email address and a message to the author. On clicking ‘Request copy’, an email is sent to the person who submitted the article, containing the user’s details. The recipient of this email then has the option to approve or deny the user’s request, to contact the user for more information, or (if they are not the author) to forward the request to the author.

How it really works

In practice, the process is slightly more complicated. For most of the content in the repository, the person who submitted an item will be a member of repository staff, rather than the item’s author. This means that for the most part, emails generated by the ‘Request a copy’ form were initially sent to members of the Office of Scholarly Communication team. In some cases, these requests were sent to people who have left the University, and we have had to query the system to retrieve these emails. As an interim measure, we have now directed all emails to support@repository.cam.ac.uk. These still need manual processing.

Theses

For theses where we have not received permission from the author to make them available, we forward requests to the University Library’s Digital Content Unit, who have traditionally provided digitised copies of theses at a charge of £65. We have  found however, that once information about this charge is communicated to the requester, very few (approximately 1%) actually complete the process of ordering a thesis copy.

We have been working with the Digital Content Unit on a trial where thesis copies were offered at £30, then £15. However, even at these cheaper prices, uptake remained low (it increased to 10%, but due to the small size of the sample, this only equated to two and three requests at each price point, and therefore may not be statistically significant). This indicates that the objection was to being charged at all, rather than to the particular amount. Work in this area remains ongoing to try and offer thesis copies as cheaply as possible to requesters, while allowing the Digital Content Unit to cover their costs.

Articles

If the request is for an article, we first need to check whether the article has actually been published and is already available Open Access. Although we endeavour to keep all our repository records up to date, unless we are informed that an article has been published, repository staff need to check each article for which publication is pending. This is a time-consuming manual process, and when we have a large backlog, sometimes it can take a while before an article is updated following publication.

If we found that the article has indeed been published and can be made Open Access, we amend the record, make the article available and email the requester to let them know they can now download the file directly from the repository.

On the other hand, if the article is still not published, or if it is under an embargo, we need to forward the request to the corresponding author(s). Sometimes their name(s) and email address(es) will be included within the article itself, and sometimes we have a record of who submitted the article via the Open Access upload form. However, if it is not clear from the article who the corresponding author is, or if their contact details are not included, and if the article was submitted by an administrator rather than one of the authors, we then need to search via the University’s Lookup service for the email addresses of any Cambridge authors, and search the internet for email addresses of any non-Cambridge authors, before we can forward on the request.

As a result, it can take repository staff up to 30 minutes to process an individual request. This is quicker if the article has been requested previously and the author’s contact details are already stored, but can take longer when we need to search. Sometimes, there is also repeat correspondence if the author has any queries, which adds to the total time in processing each request.

Amending our processes

Since introducing ‘Request a copy’, we have started collecting the email addresses of corresponding authors when an article is submitted, and we have commissioned a repository development company to ensure that ‘Request a copy’ emails can be sent directly to those authors for whom we have an email address – a feature that we are hoping to implement in the next few weeks.

However, if the author moves institution, their university email address will no longer be valid, and any requests for their work will again need to come via repository staff. One way to solve this would be to ask for an external (non-university) email address for the corresponding author at the point where they upload the article to the repository. However, this would introduce an extra step to an already onerous process and may act as a further barrier to authors submitting articles in the first place.

Generally, ‘Request a copy’ is a great idea and provides many benefits to the research community and beyond. But the implementation of this service has been challenging. The amount of time taken by each request has meant that some staff members have been redeployed from their usual jobs to facilitate these requests, which also has an impact on the backlog of articles in the repository that need to be checked in case they have since been published. If an article is published but still in the backlog (and therefore not publicly available in the repository), unnecessary requests for it could result in a reputational issue for the Office of Scholarly Communication and the University.

We will continue to look at our processes over the coming academic year, to see how we can improve our current workflows, and identify and resolve any issues, as well as determining where best to focus any further development work. In the related blog post on ‘Request a copy’, I’ll be talking about usage statistics for the service so far, some more unexpected use cases we have encountered, and feedback from our users that will help us to shape the service into the future.

Published 7 October 2016
Written by Sarah Middle
Creative Commons License

Request a copy: uptake and user experience

This post looks at the University of Cambridge repository  ‘Request a copy’ service from the user’s perspective in terms of uptake so far, feedback we have received, and reasons why people might request a copy of a document in our repository. You may be interested in the related blog post on our ‘Request a copy’ service, which discusses the concept behind ‘Request a copy’, the process by which files are requested, and how this has been implemented at Cambridge

Usage Statistics

The Request a Copy button has been much more successful than we anticipated, particularly because there is no actual ‘button’. By the end of September 2016 (four months after the introduction of ‘Request a copy’), we had received 1120 requests (approximately 280 requests per month), the vast majority of which were for articles (68%) and theses (28%). The remaining 4% of requests were for datasets or other types of resource. We are aware that this is a particularly quiet time in the UK academic year, and expect that the number of requests will increase now term has started again.

Of the requests for articles during this period, 38% were fulfilled by the author sending a copy via the repository, and 4% were rejected by clicking the ‘Don’t send a copy’ button. However, these figures could be misleading as a number of authors have also advised us that they have entered into correspondence with the requester to ask them for further information about who they are and why they are interested in this research. Eventually, this correspondence may result in the author emailing a copy of the paper to the requester, but as this happens outside the repository, it does not appear in our fulfilment statistics. Therefore, we suspect the figure for accepted requests is in actual fact slightly higher.

Of the articles requested during this period, 45% were yet to be published, and 55% were published but not yet available to those without a subscription to the journal. The large number of requests made prior to publication indicates the value of having a policy where articles are submitted to the repository on acceptance rather than publication – there is clearly interest in accessing this research among the wider public, and if they are able to make use of it rather than waiting during the sometimes lengthy period between acceptance and publication, this can make the research process more efficient.

Author Survey

To find out why authors might not be fulfilling requests through the repository links, Dr Lauren Cadwallader, one of our Open Access Research Advisors, sent a survey on 6 July 2016 to the 113 authors who had received requests but had not clicked on the repository link or been in touch with repository staff to advise of an alternative course of action. This survey had a 13% response rate, with 15 participants, as well as eight email responses from users who provided feedback but did not complete the survey.

The relatively low response rate is indicative of either a lack of engagement with or awareness of the process – it is possible that the request emails and survey email were dismissed as spam, or that researchers were unable to respond due to an already heavy workload. One way of addressing this could be to include some information about ‘Request a copy’ in our existing training sessions, in particular to emphasise how quick the process can be in cases where the author is happy to approve the request without needing any further information from the user. We have also been developing the wording of the email sent to the author, to explain the purpose of the service more clearly, and to make it sound like a legitimate message that is less likely to be dismissed as spam.

Of the 15 people who participated in the survey, the majority were aware that they had received an email, which shows that lack of response is not always due to emails being lost in spam filters. When asked for the reason why they did not fulfil the request via the repository link, 35% of authors replied that they had emailed the requester directly, either to send the file, to request more information, or to explain why it was not possible for them to share the file at this time. This finding is quite positive, as it indicates that over a third of these requests are indeed being followed up. Although it would be helpful to us to be able to keep track of approvals through the system, at least this means that the service is fulfilling its purpose in providing a way for authors to interact with other interested researchers, and to share their work if appropriate. In fact, one of the aspects that participants liked best about the ‘Request a copy’ service was the ability to communicate directly with the requestor.

Two authors did not respond to the request because the article was available elsewhere on the internet, such as their personal / departmental website, or a preprint server (where the restrictions relating to repositories do not apply), although they did not communicate this to the requestor. In these cases, it is definitely positive that the authors are happy to share their work; however, it does show that there is often an assumption among researchers that people interested in reading their articles will be restricted to those already in their specific disciplinary communities.

Requests from people who are unaware of sites where the research might also be made available demonstrates that there is indeed an appetite among those outside of academia, or from different subject areas. This is generally a really positive thing, as it facilitates the University’s research outputs to educate and inspire a new audience beyond the more traditional communities, and could potentially lead to new collaboration opportunities. To ensure that requestors are able to access the material, and that researchers are not bombarded with requests for documents that are already freely available, authors can provide links to any external websites that are hosting a preprint version of the article, and we will add them to the repository record.

Other responses indicated that we were not necessarily emailing the right person, as participants said that they had not approved the request because they were not the corresponding author, or because they thought a co-author had already responded. At the outset of the service, we felt that emailing as many authors as possible would increase the likelihood of receiving a response; however, the survey results show that it would be better to send requests to the corresponding author(s) only, at least in cases where it is clear who they are.

An issue we have encountered on a semi-regular basis since HEFCE’s Open Access policy came into force is that of making an article’s metadata available prior to its publication. Although HEFCE and funder policies state that an article’s repository record should be discoverable, even if the article itself must be placed under embargo based on publisher restrictions, there is concern among some authors that metadata release breaches the publisher’s press embargo. You can read about this issue in some detail here.

Receiving requests for an article via the ‘Request a copy’ service can be unsettling for authors as it demonstrates how easily the repository record can be accessed, and rather than respond to the request, they contact the Open Access team to ask for the metadata record to be withdrawn until the article is published. This demonstrates a need to communicate more clearly, both on our website and within the ‘Request a copy’ pages in the repository, what is required of authors as part of HEFCE and funder Open Access policies. We will also be more explicit in the ‘Request a copy’ emails sent to authors in stating that sharing their articles via this service will not be seen as a breach of the publisher’s embargo. In cases where the author does not wish to disseminate their article before it is published, they have the option to deny any requests they receive.

Facilitating requests

There have been several instances where press interest around an article at the point of publication has generated a large number of requests, each of which must be responded to individually by the author. This has resulted in several authors asking that we automatically approve every request rather than forwarding them on. Unfortunately this is not possible for us to do, due to the legal issues surrounding ‘Request a copy’.

It is perfectly acceptable for an author to send a copy of their article to an individual, but if a repository makes that article available to everyone who requests it before the embargo has been lifted, this would be a breach of copyright because it would be ‘systematic distribution’. While responding to multiple requests is likely to be seen as an annoyance by an already overstretched researcher, we hope that a large volume of requests will also be viewed in a positive light, as it demonstrates the interest people have in their work.

Use cases

An interesting example of a request we received was actually from one of the authors of the article, as they did not have access to a copy themselves. This raises some questions about communication between the researchers in this case, if the ‘Request a copy’ service was seen to be a better way of gaining access to the author’s own research, rather than contacting one of their co-authors.

A more surprising use case is that of a plaintiff who had lost a legal case. The plaintiff was requesting an as-yet unpublished article that had been written about the case, because the article appears to argue in favour of the plaintiff and could potentially inform a future appeal. This is a good example of how the ‘Request a copy’ service could be of direct benefit in the world outside academia.

Although the vast majority of requests have been for research outputs such as articles, theses and datasets, we also occasionally receive requests for images that belong to collections held in different parts of the University, where high-quality versions are stored in the repository under restricted access conditions. With these requests, it can be more difficult to find who the copyright-holder is, which sometimes requires detective work by the repository team. In one case, permission had to be sought from a photographer who only has a postal address, and therefore required more explanation about the repository more generally, as well as the specific request.

Looking to the future

We will use this research and any further feedback we receive to improve the experience of our ‘Request a copy’ service for both authors and requestors, including implementing the ideas suggested above. Usage statistics will continue to be monitored, and we may run a user survey again to determine how far the service has improved, as well as to identify any new issues.

In the meantime, if you have any comments or questions about our ‘Request a copy’ service, either as an author or a requester (or both), please send us an email to support@repository.cam.ac.uk .

Published 7 October 2016
Written by Sarah Middle
Creative Commons License