Tag Archives: Jisc

How do you know if you’re achieving cultural change?

On 15th November 2017, the University of Cambridge held its first research data management (RDM) conference, Engaging Researchers in Good Data Management. The Office of Scholarly Communication collaborated with SPARC Europe and Jisc, hosted the one-day event at St. Catherine’s College. In attendance were researchers, administrators, and librarians all sharing their experiences with promoting good RDM. Having a mixture of people from various disciplines and backgrounds allowed many different points of view on engaging researchers to be discussed. In the afternoon, the attendees split off into focus groups to concentrate on a number of nagging questions.

Our group’s topic of discussion: How do we effectively measure cultural change in attitudes towards data management? Leading the discussion was Marta Teperek from Delft University of Technology. There was a mixture of around 30 librarians and researchers from all over the world discussing strategies for engaging with researchers.

How do we set about achieving ‘cultural change’?

Marta started the conversation off by asking what everyone present was already doing at their institutions to engage researchers. Many shared their experiences and some frustrations at pushing good data management habits. One person shared that at his university the initial push toward better data management was achieved by creating and delivering RDM workshops for PhDs and young researchers in the Digital Humanities. These students were already interested in digital preservation, so they were a keen audience. Targeting PhD students and early career researchers may be a more effective strategy because they could develop good data management habits early in their careers. The earlier the intervention, the easier it would (hopefully) be.

Overall, most agreed that directly speaking to researchers is more effective than having initiatives relayed from the top-down. Attendees perceived compliance as a driver rather than a useful stick to persuade researchers to take data management seriously. Even if only a few researchers turned up to data management events, it was still increasing exposure.

Some argued for a multi-prong strategy. Initiatives like the Data Stewards at Delft TU and the Data Champions at the University of Cambridge were perceived as good ways to reach out to researchers in their departments and provide more customized advice. At the same time, having expectations of good data management relayed from on high could help creating greater impetus.

What do we mean by ‘cultural change’?

Naturally, the conversation progressed to what the phrase ‘cultural change’ actually means. It was difficult to determine in 45 minutes what kind of ‘cultural change’ we wanted to see within our different institutions. We started by asking some questions. What were our goals? What would need to happen before we said yes, the culture is changing? Which really meant what do we measure to find evidence of cultural change? Is it better metadata, more awareness of copyright, researchers reaching out to us for help, or an increase in number of grants awarded that would signal an actual change? It would seem that there could be many definitions of ‘cultural change’, but the crucial takeaway is that it is essential to define what your parameters of cultural change will be in the planning stages of any RDM programme.

Where is the evidence?

The conversation progressed to how do we find and gather evidence. With all of the work being done by researchers, librarians, and administrators, how do we know what is actually effective? We cannot state that engaging with researchers (which can be time-consuming) is working without having actual evidence to confirm it. A number of different ideas were discussed, with the time when feedback was gathered being a particular point of variance.

Quantifiable information such as number of datasets deposited, number of datasets downloaded and re-used, and number of grants with a Data Management Plan could be collected. For example, the University of Illinois conducted a detailed analysis of 1,260 data management plans using a controlled vocabulary list and looked at possible correlations between solutions for data management listed in funded and unfunded proposals.

Another method of benchmarking included asking researchers to periodically complete short surveys on data management practice in order to measure any noticeable changes. In that way, an institution can assess whether their engagement strategies work and whether it achieves the desirable effects (improvement of data management practice). Delft, EPFL, Cambridge and Illinois collaborated on development of an agreed set of survey questions. Conducting this same survey across different institutions enables benchmarking and comparison of the different techniques and how effective they are in achieving cultural change in data management. In addition to this survey, the team also interviews some researchers in order to gather additional qualitative data and more detailed insights into data management practice. The hope is that carrying out these quantitative surveys and qualitative interviews periodically will correct for the potential problem of self-selecting participants.

In the future

Ultimately, it turned out that most of those attending the focus group discussion were already working actively to develop systems to measure impact and gather feedback. However, the possibility of carrying out long-term cross-institutional research that would allow comparisons between different data management programmes is very tantalising. The final takeaway from this focus group discussion was that the majority of those attending would be very keen to take part in such research, so watch this space!

Published 18 December 2017
Written by Katie Hughes and Lucy Welch
Creative Commons License

Engaging Researchers with Good Data Management: Perspectives from Engaged Individuals

We need to recognise good practice, engage researchers early in their career with research data management and use peers to talk to those who are not ‘onboard’. These were the messages five attendees at the Engaging Researchers in Good Data Management conference held on the 15th of November.

The Data Champions and Research Support Ambassadors programmes are designed to increase confidence in providing support to researchers in issues around data management and all of scholarly communications respectively. Thanks to the generous support of the Arcadia Foundation, five places were made available to attend this event. In this blog post the three Data Champions and two Research Support Ambassadors who were awarded the places give us the low-down on what they got out of the conference and how they might put what they heard into practise.

Recordings of the talks from the event can be found on the Cambridge University Library YouTube channel.

Financial recognition is the key

Dr Laurent Gatto, Senior Research Associate, Department of Biochemistry, University of Cambridge and Data Champion

As a researcher who cherishes good and reproducible data analysis, I naturally view good data management as essential. I have been involved in research data management activities for a long time, acting as a local data champion and participating in open research and open data events. I was interested in participating in this conference because it gathered data champions, stewards and alike from various British and European institutions (Cambridge, Lancaster, Delft), and I was curious to see what approaches were implemented and issues were addressed across institutions. Another aspect of data championship/stewardship I am interested in is the recognition these efforts offer (this post touches on this a bit).

Focusing on the presentations from Lancaster, Cambridge and Delft, it is clear that direct engagement from active researchers is essential to promote healthy data management. There needs to be an enthusiastic researcher, or somebody that has some experience in research, to engage with the research community about open data, reproducibility, transparency, security; a blunt top-down approach lead to limited engagement. This is also important due to the plurality of what researchers across disciplines consider to be data. An informal setting, ideally driven by researchers and, or in collaboration with librarians, focusing on conversations, use-cases, interviews, … (I am just quoting some successful activities cited during the conference) have been the most successful, and have sometime also lead to new collaborations.

Despite the apparent relative success of these various data championing efforts and the support that the data champions get from their local libraries, these activities remain voluntary and come with little academic reward. Being a data champion is certainly an enriching activity for young researchers that value data, but is comes with relatively little credit and without any reward or recognition, suggesting that there is probably room for a professional approach to data stewardship.

With this in mind, I was very interested to hear the approach that is currently in place at TU Delft, where data stewards hold a joint position at the Centre for Research Data and at their respective faculty. This defines research data stewardship as an established and official activity, allows the stewards to pursue a research activity, and, explicitly, links research data to research and researchers.

I am wondering if this would be implemented more broadly to provide financial recognition to data stewards/champions, offer incentives (in particular for early-career researchers) to approach research data management professionally and seriously, make data management a more explicit activity that is part of research itself, and move towards a professionalisation of data management posts.

Inspiration and ideas

Angela Talbot, Research Governance Officer, MRC Biostatistics Unit and Data Champion

Tasked with improving and updating best practice in the MRC Biostatistics Unit, I went along to this workshop not really knowing what to expect but hopeful and eager to learn.

Good data management can meet with resistance as while it’s viewed as an altruistic and noble thing to do many researchers worry that to make their research open and reproducible opens them to criticism and the theft of ideas and future plans. What I wanted to know are ways to overcome this.

And boy did this workshop live up to my expectations! From the insightful opening comments to the though provoking closing remarks I was hooked. All of the audience were engaged in a common purpose, to share their successes and strategies for overcoming the barriers that ensure this becomes best practice.

Three successful schemes were talked through: the data conversations in Lancaster, the Data Champion scheme at the University of Cambridge and the data stewards in TU Delft. All of these successful schemes had one thing in common: they all combine a cross department/ faculty approach with local expertise.

Further excellent examples were provided by the lightning talks and for me, it was certainly helpful to hear of successes in engaging researchers on a departmental level.

The highlight for me were the focus groups – I was involved in Laurent Gatto’s group discussing how to encourage more good data management by highlighting what was in to for researchers who participate but I really wish I could have been in them all as the feedback indicated they had given useful insights and tips.

All in all I came away from the day buzzing with ideas. I spent the next morning jotting down ideas of events and schemes that could work within my own unique department and eager to share what I had learnt. Who knows, maybe next time I’ll be up there sharing my successes!!

We need to speak to the non-converted

Dr Stephen Eglen, Reader in Computational Neuroscience, Department of Applied Mathematics & Theoretical Physics, University of Cambridge and Data Champion

The one-day meeting on Engaging Researchers in Good Data Management served as a good chance to remind all of us about the benefits, but also the responsibilities we have to manage, and share, data. On the positive side, I was impressed to see the diversity of approaches lead by groups around the UK and beyond. It is heartening to see many universities now with teams to help manage and share data.

However, and more critically, I am concerned that meetings like this tend to focus on showcasing good examples to an audience that is already mostly convinced of the benefits of sharing. Although it is important to build the community and make new contacts with like-minded souls, I think we need to spend as much time engaging with the wider academic community.   In particular, it is only when our efforts can be aligned with those of funding agencies and scholarly publishing that we can start to build a system that will give due credit to those who do a good job of managing, and then sharing, their data. I look forward to future meetings where we can have a broader engagement of data managers, researchers, funders and publishers.

I am grateful to the organisers to have given me the opportunity to speak about our code review pilot in Neuroscience. I particularly enjoyed the questions. Perhaps the most intriguing question to report came in the break when Dr Petra ten Hoopen asked me what happens if during code review a mistake is found that invalidates the findings in the paper? To which I answered (a) the code review is supposed to verify that the code can regenerate a particular finding; (b) that this is an interesting question and it would probably depend on the severity of the problem unearthed; (c) we will cross that bridge when we come to it. Dr ten Hoopen noted that this was similar to finding errors in data that were being published alongside papers. These are indeed difficult questions, but I hope in the relatively early days of data and code sharing, we err on the side of rewarding researchers who share.

Teach RDM early and often

Kirsten Elliott, Library Assistant, Sidney Sussex College, University of Cambridge and Research Support Ambassador

Prior to this conference, my experience with Research Data Management (RDM) was limited to some training through the Office of Scholarly Communication and Research Support Ambassadors programme. This however really sparked my interest and so I leapt at the opportunity to learn more about RDM by attending this event. Although at times I felt slightly out of my depth, it was fascinating to be surrounded by such experts on the topic.

The introductory remarks from Nicole Janz were a fascinating overview of the reproducibility crisis, and how this relates to RDM, including strategies for what could be done, for example setting reproducing studies as assignments when teaching statistics. This clarified for me the relationship between RDM and open data, and transparency in research.

There were many examples throughout the day of best practice in promoting good RDM, from the “Data Conversations” held at Lancaster University, international efforts from SPARC Europe and even some from Cambridge itself! Common ground across all of them included the necessity of utilising engaged researchers themselves to spread messages to other researchers, the importance of understanding discipline specific issues with data, and an expansive conception of what counts as “data”.

I am based in a college library and predominantly work supporting undergraduate students, particularly first years. In a way this makes it quite a challenge to present RDM practices as many of the issues are most obviously relevant to those undertaking research. However, I think there’s a strong argument for teaching about RDM from very early in the academic career to ingrain good habits, and I will be thinking about how to incorporate RDM into our information literacy training, and signposting students to existing RDM projects in Cambridge.

Use peers to spread the RDM message

Laura Jeffrey, Information Skills Librarian, Wolfson College, University of Cambridge and Research Support Ambassador

This inspirational conference was organised and presented by people who are passionate about communicating the value of open data and replicability in research processes. It was valuable to hear from a number of speakers (including Rosie Higman from the University of Manchester, Marta Busse-Wicher from the University of Cambridge and Marta Teperek from TU Delft) about the changing role of support staff, away from delivering training to one of coordination. Peers are seen to be far more effective in encouraging deeper engagement, communicating personal rather than prescriptive messages (evidenced by Data Conversations at Lancaster University). A member of the audience commented that where attendance is low for their courses, the institution creates video of researcher-led activities to be delivered at point of need.

I was struck by two key areas of activity that I could act on with immediate effect:

Inclusivity – Beth Montagu Hellen (Bishop Grosseteste) highlighted the pressing need for open data to be made relevant to all disciplines. Cambridge promotes a deliberately broad definition of data for this reason. Yet more could be done to facilitate this; I’ll be following @OpenHumSocSci to monitor developments. We’re fortunate to have a Data Science Group at Wolfson promoting examples of best practice. However, I’m keen to meet with them to discuss how their activities and the language they use could be made more attractive to all disciplines.

Communication – Significant evidence was presented by Nicole Janz, Stephen Eglen and others, that persuading researchers of the benefits of open data leads to higher levels of engagement than compulsion on the grounds of funder requirements. This will have a direct impact on the tone and content of our support. A complimentary approach was proposed: targeted campaigns to coincide with international events in conjunction with frequent, small-scale messages. We’ll be tapping into Love Data Week in 2018 with more regular exposure in email communication and @WolfsonLibrary.

As result of attending this conference, I’ll be blogging about open data on the Wolfson Information Skills blog and providing pointers to resources on our college LibGuide. I’ll also be working closely with colleagues across the college to timetable face-to-face training sessions.

Published 15 December 2017
Written by Dr Laurent Gatto, Angela Talbot, Dr Stephen Eglen, Kirsten Elliott and Laura Jeffrey
Creative Commons License

Planning scholarly communication training in the UK

In June 2017 a group of people (see end for attendees) met in London to discuss the issues around scholarly communication training delivery in the UK. Representatives from RLUK, UKSG, SCONUL, UKCoRR, Vitae, Jisc and some universities had a workshop to nut through the problem. Possibly because of the nature of the attendees of the group, the discussion was very library-centric, but this does not preclude the need for training outside the library sector. This blog is a summary of the discussion from that day.

Background

The decision to hold a meeting like this came out of the a library skills workshop run at UKSG recently. In ensuing discussions, it was agreed that it would be a good idea to get stakeholders together for a symposium of some description to try and nut out how we could collaborate and provide training solutions for scholarly communication across the sector. There is plenty of space in this area for multiple offerings but we do want to make sure we are covering the range of areas and the types of delivery modes and levels required. In preparation for the discussion the group created a document listing scholarly communication training on offer currently.

What is scholarly communication?

An informal survey of research libraries in the UK earlier this year showed that while all respondents had some kind of service that supports aspects of scholarly communication, only half actually used the term ‘scholarly communication’ to describe those services.

A discussion around the table concluded that the term scholarly communication encompasses a wide range of definitions. Some libraries take the boundary that it refers to post-publication. Others address the pre-publication aspect and meet the need of Early Career Researchers for advice on publishing. Services can focus on the academic’s profile of themselves and their research, or the research lifecycle. In some cases there is a question about whether research data management is part of the equation.

The failure of library schools to deliver

It is fairly universally acknowledged that it is a challenge to engage with library schools on the issue of scholarly communication, despite repositories being a staple part of research library infrastructure for well over a decade. There are a few exceptions but generally open access or other aspects of scholarly communication are completely absent from the curricula. (Note: any library school that wishes to challenge this statement, or provide information about upcoming plans are welcome to send these through to info@osc.cam.ac.uk)

This raises the question – if library schools are not providing, how do we recruit and train the staff we need? Indeed, who are we actually recruiting? Is it essential for staff to have a library degree, or experience in an academic library? Or are our requirements more functional such as the ability to manipulate large data sets, or experience working with academics, or an understanding of the Higher Education environment?

While libraries are starting to employ post-graduate researchers because they can lend skills to the library, library culture is a consideration. Employing researchers who are not librarians has the benefit of bringing in expertise from outside, but there are challenges to integrate their work into the library culture. We need to look at competencies in terms of the structure and size of the organisation, both for current staff and staff of the future.

In the absence of scholarly communication instruction within the basic qualification, skills training in this space would appear to need to be addressed at the profession level.

One possible route to prepare the next generation is offering some modular approach of on the job learning with very practical experience. An option could be to work with people who have come from outside the library space. Given libraries seem to be starting to bring skill sets in, we need to consider how this sits with the existing profession.

Audiences and their training needs

The goal of the meeting was to resolve what kinds of training the sector needs, for whom and how it is delivered. For example, with many general library staff there is a basic need to understand the issues with scholarly communication. The number one question is ‘what is scholarly communication’? The possibly it is enough for these people to just be familiar with the terminology.

It is possible we need lots of short courses on the general topic of: this is what OA is, basics of RDM etc (that could potentially be delivered online), but probably fewer more complex courses on issues like analysing publisher and funder policies. There are also debates and higher order areas which require face to face debate.

  • Front facing staff
    • Need an overview so the language is familiar and they can refer queries on
  • People working in scholarly communication
    • Day to day practicalities of funder open access compliance
  • Specialist roles in scholarly communication
    • Specific areas
  • Senior managers
    • Very much need a refresher so they can help their staff.
    • Similar overview training, leadership is around the advocacy
    • Need conceptual framework for scholarly communication – how do the technical parts sit together for the infrastructure and governance of institutions
    • Stakeholder management skills.

Skill sets in scholarly communication

It was agreed that budgetary, presentation and negotiation skills are needed in this area as general skills. When it comes to specialist skills these include:

  • Research Integrity
  • Bibliometrics
    • Involved in providing specialist advice on metrics within a school discussion
    • Providing advice on impact
  • Pushing the open research agenda
  • Academic reward structure
  • Technical and infrastructure eg: integrating ORCIDS etc

Considerations – Lack of perceived need?

There appears to be a problem with a lack of perceived need for training in this space. We are encountering issues where people in libraries are saying ‘I don’t think this is our job’. This points to what should we be presenting librarianship as – what kind of people do we want in the profession? A ‘traditional librarian’ of 20 years ago is not the same job now, the skills are different. Today much of an academic librarian’s job is about winning over people who don’t want to hear the message. It is possible there does need to be a different sort of person who is pushing an open access agenda.

There have been other innovations in library work that required engaging different behaviours and tasks in the past. For example, is this move towards a scholarly communication future different from when the discovery search was introduced? The eResources experience is similar in terms of new competencies required in the profession. However the difference in the scholarly communication environment is there is an external driver – we need to understand the politics of how open access can move forward in the UK.

Considerations – budgets

There is a mismatch between what people would love to have, what can be designed and what people can afford. Anecdotally the group heard that training budgets are really squeezed so priority and focus might be heavily influenced by this, with geography and travelling costs being central to decisions.

The group discussed the need to make training accessible to all. Even free events can be prohibitive in terms of travel, and hosting them in off-peak periods can be helpful with costs. The blockage is not just money, it includes time – in terms of loss of a team member while they are away. This is particularly problematic if scholarly communication is only a part of their job. Most of the need comes from really small institutions where the work is part of a bigger role, however that is where there is little money. This also raises challenges for the time available for those people to self educate.

UKSG run events in London which is expensive for organisations north of London to attend. To increase participation UKSG are now trying to put regional events on, and have shifted their training to a webinar programme rather than face to face.

SCONUL has done basic copyright training and this has thrown up price sensitivity. One solution is trying to keep it local, and members can volunteer staff in kind.

One option could be online training where participants log on at a certain time once a week for 10 weeks. Many of the people in scholarly communication work in universities, and have distance education software available to them. An alternative is having courses done in house – that could part of a modular package (but how do you link this?). The course content needs to be agnostic enough to be useful (not discussing DSpace or PURE for example) before delving into institutional specifics. Make it modular with core principles and then have options.

There was a suggestion that we create a nonprofit making shared collaborative service. The costs to developing this type of deliverable include the development of the training materials, infrastructure costs, room hire, catering etc. Can we make it all online and available? This could work if it were modular.

Next steps

We have not yet bottomed out the need yet – perception of needs at the practitioner level and senior management might be different. Cost is an issue here. Universities need to work out how much it costs to do in-house training – what is the opportunity cost to employ a staff member without experience or training and then get them up to speed?

It would be useful to have an understanding of what training is happening within institutions. What subjects/topics are being taught, who is doing it, what language is being used, is there a dedicated staff member. Where else do people get information and support?

The general plan is to reconvene in September.

Useful Resources

Skill sets analyses

Here are links to work that has already been done on the required skill sets:

Organisations providing or coordinating training

Organisations are running similar events and then participants have to choose what to focus on. If we divvy it up across the sector it might help the situation.

The Society for College, National and University Libraries (SCONUL) does basic copyright training. There is more focus on the leadership end of the equation. The Collaboration Strategy Group is considering a shared service. People come from non traditional groups and this reflects a broader skills sets required in libraries than traditional library courses give you. SCONUL are about to scope out where those services might be and try to identify needs into the future. There are challenges are in recruiting people given the slightly moralistic nature of library culture and whether they are welcoming of people from different background. How do we promote, retain and incentivise people who may not come from this area?

Research Libraries UK (RLUK) don’t do direct training, but they do have programmes of works and networks around these issues. The RLUK board recently had a meeting to look at a new strategy – updating the existing 2014-2017 RLUK Strategy. They are looking at the bigger picture for scholarly communication – the infrastructure challenges, the bigger picture related to licensing and costs and how to leverage members in the consortia. Their role is very much supporting and helping out.

UK Serials Group (UKSG) runs a conference programme. One day events are a mix of standing repeated courses and one off sessions. In conferences often the breakout sessions are the things that people find really valuable. These include soft skills like mindfulness in leadership. The audience tends to be practitioners, people in their mid-career. Traditional areas such as library have been focused around collection management because that is where publishers are. But it is not just about traditional publishing. They are our members and that is moving our agenda to meet those needs. UKSG cannot get anywhere in contributing to university publishing courses. Libraries are starting to employ people who have publishing backgrounds.

The Association of Research Managers and Administrators (ARMA) has special interest groups in open access. (Note: ARMA were invited to this meeting but unfortunately couldn’t attend.)

The Chartered Institute for Library and Information Professionals (CILIP) conducts training at a local level. It was agreed we can’t have the conversation without having CILIP in the room – they are wanting to offer more support for academic libraries and seem to be recognising that the library schools program for CILIP is not the be-all and end-all any more. This is partly why they have developed a recognised trainer programme. (Note: CILIP were invited to this meeting but unfortunately couldn’t attend.)

Representatives attending the discussion

  • Helen Dobson – Manchester University
  • Danny Kingsley – Cambridge University
  • Claire Sewell – Cambridge University
  • Anna Grigson representing UKSG
  • Fiona Bradley – RLUK
  • Ann Rossiter – SCONUL
  • Katie Wheat – Vitae
  • Sarah Bull – UKSG
  • Stephanie Meece -UKCoRR
  • Frank Manista – Jisc
  • Helen Blanchett – Jisc (a member of the group coordinating the meeting, but was unable to attend on the day)

ARMA and CILIP were also invited but were not able to send a representative.

Published 15 August 2017
Written by Dr Danny Kingsley 

Making the connection: research data network workshop

During International Data Week 2016, the Office of Scholarly Communication is celebrating with a series of blog posts about data. The first post was a summary of an event we held in July. This post looks at the challenges associated with financially supporting RDM training.

corpus-main-hallFollowing the success of hosting the Data Dialogue: Barriers to Sharing event  in July we were delighted to welcome the Research Data Management (RDM) community to Cambridge for the second Jisc research data network workshop. The event was held in Corpus Christi College with meals held in the historical dining room. (Image: Corpus Christi )

RDM services in the UK are maturing and efforts are increasingly focused on connecting disparate systems, standardising practices and making platforms more usable for researchers. This is also reflected in the recent Concordat on Research Data which links the existing statements from funders and government, providing a more unified message for researchers.

The practical work of connecting the different systems involved in RDM is being led by the Jisc Research Data Shared Services project which aims to share the cost of developing services across the UK Higher Education sector. As one of the pilot institutions we were keen to see what progress has been made and find out how the first test systems will work. On a personal note it was great to see that the pilot will attempt to address much of the functionality researchers request but that we are currently unable to fully provide, including detailed reporting on research data, links between the repository and other systems, and a more dynamic data display.

Context for these attempts to link, standardise and improve RDM systems was provided in the excellent keynote by Dr Danny Kingsley, head of the Office of Scholarly Communication at Cambridge, reminding us about the broader need to overhaul the reward systems in scholarly communications. Danny drew on the Open Research blogposts published over the summer to highlight some of the key problems in scholarly communications: hyperauthorship, peer review, flawed reward systems, and, most relevantly for data, replication and retraction. Sharing data will alleviate some of these issues but, as Danny pointed out, this will frequently not be possible unless data has been appropriately managed across the research lifecycle. So whilst trying to standardise metadata profiles may seem irrelevant to many researchers it is all part of this wider movement to reform scholarly communication.

Making metadata work

Metadata models will underpin any attempts to connect repositories, preservation systems, Current Research Information Systems (CRIS), and any other systems dealing with research data. Metadata presents a major challenge both in terms of capturing the wide variety of disciplinary models and needs, and in persuading researchers to provide enough metadata to make preservation possible without putting them off sharing their research data. Dom Fripp and Nicky Ferguson are working on developing a core metadata profile for the UK Research Data Discovery Service. They spoke about their work on developing a community-driven metadata standard to address these problems. For those interested (and Git-Hub literate) the project is available here.

They are drawing on national and international standards, such as the Portland Common Data Model, trying to build on existing work to create a standard which will work for the Shared Services model. The proposed standard will have gold, silver and bronze levels of metadata and will attempt to reward researchers for providing more metadata. This is particularly important as the evidence from Dom and Nicky’s discussion with researchers is that many researchers want others to provide lots of metadata but are reluctant to do the same themselves.

We have had some success with researchers filling in voluntary metadata fields for our repository, Apollo, but this seems to depend to a large extent on how aware researchers are of the role of metadata, something which chimes with Dom and Nicky’s findings. Those creating metadata are often unaware of the implications of how they fill in fields, so creating consistency across teams, let alone disciplines and institutions can be a struggle. Any Cambridge researchers who wish to contribute to this metadata standard can sign up to a workshop with Jisc in Cambridge on 3rd October.

Planning for the long-term

A shared metadata standard will assist with connecting systems and reducing researchers’ workload but if replicability, a key problem in scholarly communications, is going to be possible digital preservation of research data needs to be addressed. Jenny Mitcham from the University of York presented the work she has been undertaking alongside colleagues from the University of Hull on using Archivematica for preserving research data and linking it to pre-existing systems (more information can be found on their blog.)

Jenny highlighted the difficulties they encountered getting timely engagement from both internal stakeholders and external contractors, as well as linking multiple systems with different data models, again underlining the need for high quality and interoperable metadata. Despite these difficulties they have made progress on linking these systems and in the process have been able to look into the wide variety of file formats currently in use at York. This has lead to conversations with the National Archive about improving the coverage of research file formats in PRONOM (a registry of file formats for preservation purposes), work which will be extremely useful for the Shared Services pilot.

In many ways the project at York and Hull felt like a precursor to the Shared Services pilot; highlighting both the potential problems in working with a wide range of stakeholders and systems, as well as the massive benefits possible from pooling our collective knowledge and resources to tackle the technical challenges which remain in RDM.

Published 14 September 2016
Written by Rosie Higman
Creative Commons License

Disruptive innovation: notes from SCONUL winter conference

On Friday 27 November Danny Kingsley attended the SCONUL Winter Conference 2015 which addressed the theme of disruptive innovation and looked at the changes in policy and practice which will shape the scholarly communications environment for years to come. This blog is a summary of her notes from the event. The hastag was #sconul15  and there is a link in Twitter.

Disruptions in scholarly publishing

Dr Koen Becking, President of the Executive Board, Tilburg University, spoke first. He is the lead negotiator with the publishers in the Netherlands. Things are getting tight as we count down to the end of the year given the Dutch negotiations with Elsevier (read more in ‘Dutch boycott of Elsevier – a game changer?‘)

Koen asked: what is the role of a university – is it knowledge to an end, knowledge in relation to learning or knowledge in relation to professional skills? He said that 21st century universities should face society. While Tilburg University is still tied to traditional roots, it is now focused in the idea of ‘third generation university’. The idea of impact on society – the work needs to impact on society.

The Dutch are leading on the open access fight and Koen said they may look at legislation to force the government goal of open access to research articles of 40% by 2016 & 100% by 2024. [Note that the largest Dutch funder NOW has just changed their policy to say funds can no longer be used to pay for hybrid OA and that green OA must be available ‘the moment of’ publication].

Kurt noted that the way the Vice Chancellors got involved in the publisher discussions in the Netherlands was the library director came to him ask about increasing the subscription budget and the he asked why it was going up so much given the publisher’s profit levels. Money talks.

Managing the move away from historic print spend

Liam Earney from Jisc said there were several drivers for the move from historic print spend and we need models that are transparent, equitable, sustainable and acceptable to institutions. They have been running a survey on acceptable metrics on cost allocation (note that Cambridge has participated in this process). Jisc will shortly launch a consultation document on institutions on new proposals.

Liam noted that part of their research found that it was apparent that across Jisc bands and within Jisc bands there are profound differences in what institutions paying for the same material – sometimes by a factor of 100’s of 1000’s pounds different to access the same content in similar institutions.

They also worked out that if they took a mix of historical print spend and a new metric it would take over 50 years to migrate to a new model. This is not realistic.

Jisc is supported by an expert group of senior librarians (including members at Cambridge) who are working on an alternative. Liam noted that historical print spend is harmful to the ability of a consortium to negotiate coherently. Any new solution needs to meet the needs of academics and institutions.

Building a library monograph collection: time for the next paradigm shift?

Diane Brusvoort from the University of Aberdeen comes from the US originally and talked about collaborative collection development – we can move together. Her main argument was that for years we have built libraries on a ‘just in case model’ and we can no longer afford to do that. We need to refine our ‘just in time’ purchasing to take care of faculty requests, also have another strand working across sector to develop the ‘for the ages’ library.

She mentioned the FLorida Academic REpository (FLARE) which is the statewide shared collection of low use print materials from academic libraries in Florida. Libraries look at what is in FLARE and move the FLARE holding into their cataloguing. It is a one copy repository for low use monographs.  The Digital Public Library of America is open to anything that had digitised content can be put in the DPA portal and deals with the problem of items that they are all siloed.

Libraries are also taking books off the shelf when there is an electronic version. This is a pragmatic decision not made because lots of people are reading the electronic one preferentially, it is simply to save shelf space.

Diane noted a benefit of UK compared to UK is the size – it is possible to do collaborative work here in ways you can’t in the US. We need collaborative storage and to create more opportunities for collaborative collections development.

The Metric Tide

Professor James Wilsdon – University of Sussex spoke about the HEFCE report he contributed to The Metric Tide: Report of the Independent Review of the Role of Metrics in Research Assessment and Management. 

This report looked at responsible uses of quantitative data in research management and assessment. He said we should not turn our backs on big data and its possibilities but we know of our experience in the research systems that these can be used as blunt tools in the process. He felt that across the community at large the discussion about metrics was unhelpfully polarised. The debate is open to misunderstanding and need a more sophisticated understanding on ways they can be used more responsibly.

The agreement is that peer review is the ‘least worst’ form of academic governance that we have. Metrics should support not supplant academic management. This underpins the design of assessment exercises like the REF.

James noted that the metrics review team was getting back together that afternoon to discuss ‘section d’ in the report. He referred to this as being ‘like a band reunion’.

A new era for library publishing? The potential for collaboration amongst new university presses

This workshop was presented by Sue White, Director of Computing and Library Services and Graham Stone, Information Resources Manager, University of Huddersfield.

Sue talked about the Northern Collaboration of libraries looking at joining forces to create a publishing group. They started with a meeting in October 2104. There is a lot of uncertainty in the landscape, with a big range of activity from well established university presses to those doing no publishing at all. She said the key challenges to the idea of a joint press was the competition between institutions. But they decided the idea merited further exploration.

Discussions were around the national monograph strategy roadmap  that advocated university publishing models. The Northern Collaboration took a discussion paper to Jisc – and they suggested three areas of activity. They were:

  • Benchmarking and data gathering to see what was happening in the UK.
  • Second to identify best practice and possible workflow efficiencies- common ground.
  • Third was exploring potential for the library publishing coalition.

The project is about sharing and providing networks for publishing ventures. In the past couple of days Jisc has agreed to take the first two forward and welcome input. They want some feedback about taking it forward.

Graham then spoke about the Huddersfield University Press which has been around since 2007 – but was re-launched with an open access flavour. They have been publishing open access materials stuff for three to four years. They publish three formats – monographs, journal publications and sound recordings.

The principles governing the Press is that everything is peer reviewed, as a general rule everything should be open access and they publish by the (ePrints) open access repository which gets lots of downloads. The Press is managed by the library but led by the academics. Business model is a not for profit as it is scholarly communication. If there were any surplus it would be reinvested in the Press. In last four years they have published 12 monographs, of which six are open access.

Potential authors have to come with their own funding. Tends to be an institutional funder sponsored arrangement. The proposal form has a question ‘how is this going to be funded’? This point is ignored for the peer review process. Having money does not guarantee publishing. It means it will be looked at but doesn’t guarantee publishing. The money pays for a small print run, copy editing not staff costs. About a 70,000 word monograph costs in the region of £3000-£4000.

Seven journals are published in the repository – there is an overlay on the repository, preserved in Portico. Discoverable through Google (via OAI-PMH) compliance with repository, Library webscale discovery includes membership of the DOAJ. Their ‘Teaching and lifelong learning’ journal has every tickbox on DOAJ.

The enablers for this Press have been senior support in the university at DVC level and the capacity and enthusiasm of an Information Resource Manager to absorb the Press into existing role. Also having an editorial board with people across the institution. The Press is operating on a shoestring hard. It is difficult to establish reputation and convincing the potential stakeholders and impact. A lack of long term funding means it is difficult to forward plan.

They also noted that there are not very many professional support networks out there and it would be good to have one. They need specialist help with author contracts and licences.

Who will disrupt the disruptors?

The last talk was by Martin Eve, Senior Lecturer in Literature, Technology and Publishing, Birkbeck, University of London.  This was an extremely entertaining and thought provoking talk. The slides are available.

Martin started with critical remarks about the terminology of ‘disruptive’, arguing that often the word is used so the public monopoly can be broken up into private hands. That said, there are parts of the higher education sector which are broken and need disruption.

Disruption is an old word – from Latin used first in 15th century. Now it actually means the point at which an entire industry is shifted out. What we see now is just a series of small increments. The changes happening in the higher education sector are not technological they are social and it is difficult to break that cycle of authority and how it works.

Martin argued that libraries need to be strategic and pragmatic. We have had a century long breakdown of the artificial scarcities in trading of knowledge coming to a head in the digital age. There are new computational practices with no physical or historical analogy – the practices don’t fit well with current understandings. He gave a couple of historical examples where in the 1930s people made similar claims.

The book as a product of scholarly communication is so fetishized that when we want the information we need the real object – we cannot conceive of it in another form.

Universities in the digital age just don’t look like they did before. We have an increasingly educated populace – more people can actually read this stuff so the argument that ‘the public’ can’t understand it is elitist and untrue. Institutional missions need to be to benefit society.

Martin discussed the issues with the academic publication market. A reader always needs a particular article – the traditional discourses around the market play out badly. You don’t know if you need a particular article until you read it and if you do need it you can’t replace it with anything else.

Certain publications can have a rigorous review practice because they are receiving high quality submissions. But they only get high quality submissions if you have lots of them and they get that reputation because of a rigorous review practice. So early players have the advantage.

He noted that different actors care about the academic market in different ways. Researchers produce academic products for themselves – to buy an income and promotion. Publishers frame their services as doing things for authors – but they don’t do enough for readers and libraries. Who pays? Researchers have zero price sensitivity. Libraries are stuck between rock & hard place. They have the cash but are told how to spend it. The whole thing is completely dysfunctional as a market. In the academy, reading is underprivileged. Authorship is rewarded.

Martin then talked about open access and how it affects the Humanities. He noted that monographs are acknowledged as different – eg: HEFCE mandate. There are higher barriers for entry to new publishers – people don’t have a good second book to give away to an OA publisher. There are different employment practices, for example creative writers are often employed on a 0.5 contract – they are writing novels and selling them and commercial publishers get antsy about requirements for open access because there is a cross over with trade books.

The subscription model exists on the principle that if enough people contribute then you have enough centrally to pay for what the costs are. It assumes a rivalrous mode – the assumption is there will always be greedy people who won’t pay in if they don’t get an exclusive benefit.

The Open Library of the Humanities is funded by a library consortium. It is based on arXiv funding model and Knowledge Unlatched. Libraries pay into a central fund in the same way of a subscription. Researcher who publish with us do not have to be at an institution who is funding or even at an institution. There are 128 libraries financially supporting the model (as of Monday should be 150). The rates are very low – each one only has to pay about $5 per article. They are publishing approximately 150 articles per year.

Published 28 November 2015
Written by Dr Danny Kingsley
Creative Commons License

Where to from here? Open Access in Five Years

As part of the Office of Scholarly Communication Open Access Week celebrations, we are uploading a blog a day written by members of the team. Thursday is a piece by Dr Arthur Smith looking to the future.

Introduction

Academic publishing is not what it used to be. Open access has exploded on the scene and challenged the established publishing model that has remained largely unchanged for 350 years. However, for those of us working in scholarly communications, the pace of change feels at times frustratingly slow, with constant roadblocks along the way. Navigating the policy landscape provided by universities, funders and publishers can be maddening, yet we need to remain mindful of how far we have come in a relatively short time. There is no sign that open access is losing momentum, so it’s perhaps instructive to consider the direction we want open access to take over the next five years, based upon the experiences of the past.

So how much is the University of Cambridge publishing and is it open access? Since 1980, according to Web of Science, the University’s publications increased from 3000 articles per year to more than 11,000 in 2014 (Fig. 1). Over the same period the proportion of gold open access articles rose steadily since first appearing on the scene in the late 1990s. Thus far in 2015 nearly one in ten articles is available gold open access, although this ignores the many articles available via green routes.

image02

Fig. 1. Publications at the University of Cambridge since 1980 according to WoS (accessed 14/10/2015).

 

The HEFCE policy

By far the most important development for open access in the UK has been the introduction of HEFCE’s open access policy. As the policy applies to all higher education institutions it affects every university researcher in the UK. While the policy doesn’t formally start until April 2016, so far progress has been slow (Fig. 2). We believe that less than a third of all the University’s articles that are published today are currently compliant with the HEFCE policy, and despite a strong information campaign, our article submission rate has stagnated at around 250 articles per month, well off the monthly target of 930.

image03 image04

Fig. 2. Publications received to the University of Cambridge open access service. The target number of articles per month is 930.

It’s understandable that some papers will fall through the cracks, but even for high impact journals many papers still don’t comply with the policy. But let’s be clear, aside from any policy compliance issues and future REF eligibility, these numbers reveal that fully two thirds of research papers produced at the University cannot be read without a journal subscription. And if readers can’t afford to pay for access then they’ll happily find other means of obtaining research papers.

What about inviting authors to make their research papers open access? Since June I have tracked five high impact journals and monitored the papers published by University of Cambridge authors (Fig. 3). Upon first discovery of a published paper, only 29% of articles were compliant with the HEFCE policy, which is consistent with our overall experience in receiving AAMs. But even after inviting authors to submit their accepted manuscripts to the University’s open access repository, the number of compliant articles rose to only 42%. Less than a third of authors who were directly contacted and asked to make their work open access eventually submitted their manuscripts. Clearly, the merits of open access are not enough to convince authors to act and distribute their manuscripts.

image03

Fig. 3. Compliant articles published in five high impact journals. Even after direct intervention less than half of all articles are HEFCE compliant.

SCOAP³

The SCOAP3 initiative is a publishing partnership that makes journals in the field of particle physics open access. This innovative scheme brings together multiple universities, funders and publishers and turns traditional journals, that are already widely respected by the physics community, into purely open access journals. No intervention is required by either authors or university administrators, making the process of publishing open access as simple as possible. The great advantage of this scheme is that authors don’t need to worry about choosing an open access option from the publisher, nor deal with messy invoices or copyright issues. All of these problems have been swept away.

Jisc Springer Compact

Like SCOAP3 the recently announced Jisc Springer Compact is a coalition of universities in the UK that have agreed a publishing model with Springer that makes ~1600 journals open access. Following a similar Dutch agreement, this publishing model means that any authors with qualifying institutional affiliations will have their publications made open access automatically. We’ve already started receiving our first requests under this scheme. However, unlike the SCOAP3 initiative which ‘flips’ entire journals to gold OA, the journals under the UK Jisc Springer Compact are still hybrid and only content produced by qualifying authors is open access. While this is great for those universities signed up to the deal, it still leaves a great many papers languishing under the subscription model.

Affiliation vs. Community

So which of these strategies will prove to the most successful? Will universities take ownership of open access publishing or will subject based communities come together in publishing coalitions.

The advantage of subject based initiatives is they flip entire journals for the benefit of a whole research community, making all the work within a specific discipline open access. However, without sufficient cohesion and drive within an academic community it’s likely that adoption will be fragmented across the myriad of disciplines. It’s no surprise that SCOAP3 emerged out of the particle physics community, given this scholarly community’s involvement in the development of arXiv, but it’s unrealistic to expect this will be the case everywhere.

Publishing agreements based around institutional affiliations will undoubtedly become more common, but until all universities have agreements in place with all the major publishers (Elsevier, Wiley, Springer, etc.) then a large fraction of scholarly outputs will still remain locked down.

What does the future hold?

Ultimately I want to do myself out of a job. As odd as that sounds, the current system of paying publishers for individual papers to be made open access is a laborious and time consuming process for authors, publishers and universities. Similarly the process of making accepted manuscripts available under the green model is equally ridiculous. Publishers should be automatically depositing AAMs on behalf of authors. There is no evidence that making AAMs available has ever killed a journal, and besides, the sooner we can reach agreements with all the major publishers and research funders that result in change on a global scale the better it will be for everyone.

Published 22 October 2015
Written by Dr Arthur Smith
Creative Commons License

Cambridge expenditure on APCs in 2014

Cambridge (along with many other institutions) were recently approached by Jisc to report on our article processing charges (APC) payments for 2014  as part of Jisc’s APC data collection project to address the Total Cost of Ownership of scholarly communication. Stuart Lawson, who is compiling these datasets has made the files available on Figshare.

A couple of caveats – This dataset only contains APCs which were paid centrally; there will be many other APCs paid by the University of Cambridge and its staff which are not included in this dataset.

Also we ended up listing the publications that were submitted to our system in 2014 because that was our starting point, rather than considering the payments from 2014 and working back. This might be an issue for the analysis – it will depend on which way people have interpreted ‘2014’. I should note that 74 (12.13%) of the invoices listed in this data were actually paid in January 2015.

Headline numbers

  • 610 funded articles were submitted in 2014 to our system for publication
  • 495 have been invoiced and paid as at March 2015
  • The amount spent on APCs (including VAT) for these invoices was £936,224.86
  • This gives an average cost per APC paid (including VAT if charged) of £1891.36
  • The range of APCs is from £94.61 for an article published by Magnolia Press, to £3,869.72 for an article published by Wiley

What does this mean?

It means we are spending a lot of (RCUK) money on APCs.  We also have supported payment of page and colour charges and have paid for researchers to join memberships that offer a discount for APCs out of the RCUK fund – neither of those categories of expenditure was captured in this data set.

The University is participating in the various Jisc Collections series of offsetting programs with publishers and we are discussing other ways of managing this expenditure. However, we really need to consider whether this is the way of the future.

Issues with reporting

Pulling the information together for this list revealed a few issues. First, while we agree with the collection of data to allow aggregation across the sector, for us to pull the required information together was challenging because we do not collect the information in this way.

However there are some indications this type of detail will be requested on a standard basis for reporting. Certainly Jisc suggesting this as a way forward. In their ‘APC data collection’ blog  they state:

HEIs will be able to benchmark their APC data. Using a standard template will help to produce comparable data between institutions which can be more easily aggregated. The data fields to be completed have been chosen from careful analysis of HEI needs. This means that the spreadsheet can be used for both internal reporting and also external reporting including to the Wellcome Trust for compliance monitoring of the Charity Open Access Fund, and potentially RCUK.

 So we therefore need to consider this information when designing new systems.

Issues with invoices

We have a considerable block of Purchase Orders that have not been invoiced. While there will always be a delay because of the length of time between acceptance and publication in some instances, some of these are very old.

The issue of items not being invoiced can partially be explained by the cancellation of Purchase Orders. In some cases the team has contacted the author and found that the email is bouncing because the author has moved to a different institution. In other cases the author decided not to go ahead with open access publication, so we have raised a Purchase Order against something that no longer exists.

Long standing Purchase Orders (over 14 months) are potentially a problem because it is money being held as committed funds. We are now adding the process of checking older  un-invoiced Purchase Orders to the ever-growing list of things to do in the workflow for ensuring compliance.

Published 26 March 2015
Written by Dr Danny Kingsley
Creative Commons License