Tag Archives: research data management

Data Diversity Podcast #2 – Dr Alfredo Cortell-Nicolau

In our second instalment of the Data Diversity Podcast, we are joined by archaeologist Dr Alfredo Cortell-Nicolau, a Senior Teaching Associate in Quantitative and Computational Methods in Archaeology and Biological Anthropology at the McDonald Institute for Archaeological Research and Data Champion.

As is the theme of the podcast, we spoke to Alfredo about his relationship with data and learned from his experiences as a researcher. The conversation also touched on the different interpersonal, and even diplomatic, skills that an archaeologist must possess to carry out their research, and how one’s relationship with individuals such as landowners and government agents might impact their access to data. Alfredo also sheds light on some of the considerations that archaeologists must go through when storing physical data and discussed some ways that artificial intelligence is impacting the field. Below are some excerpts from the conversation, which can be listened to in full here.

I see data in a twofold way. This implies that there are different ways to liaise with the data. When you’re talking about the actual arrowhead or the actual pot, then you would need to liaise with all the different regional and national laws regarding heritage and how they want you to treat the data because it’s going to be different for every country and even for every region. Then, of course, when you’re using all these morphometric information, all the CSV files, the way to liaise with the data becomes different. You have to think of data in this twofold way.

Dr Alfredo Cortell-Nicolau

Lutfi Othman (LO): What is data to you?

Alfredo Cortell-Nicolau (ACN): In archaeology in general, there are two ways to see the data. In my case for example, one way to see it is that the data is as the arrowhead and that’s the primary data. But then when I conduct my studies, I extract lots of morphometric measures and I produce a second level of data, which are CSV files with all of these measurements and different information about the arrowheads. So, what is the data? Is it the arrowhead or is it the file with information about the arrowhead? This raises some issues in terms of who owns the data and how you are going to treat the data because it’s not the same. In my case, I always share my data and make everything reproducible. But when I share my data, I’m sharing the data that I collected from the arrowheads. I’m not sharing the arrowheads because they are not mine to share.

This is kind of a second layer of thought when you’re working with Archaeology. When you’re studying, for example, pottery residues, then you’re sharing the information of the residues and not the pot that you used to obtain those residues. There are two levels of data. Which is the actual data itself? The data which can be reanalyzed in different ways by different people, or the data that you extracted only for your specific analysis? I see data in this twofold way. This implies that there are different ways to liaise with the data. When you’re talking about the actual arrowhead or the actual pot, then you would need to liaise with all the different regional and national laws regarding heritage and how they want you to treat the data because it’s going to be different for every country and even for every region. Then, of course, when you’re using all these morphometric information, all the CSV files, the way to liaise with the data becomes different. You have to think of data in this twofold way.

On some of the barriers to sharing of archaeological data

ACN: There are some issues in how you would acknowledge that the field archaeologist is the one who got the data. Say that you might have excavated a site in the 1970s and some other researcher comes later, and they may be doing many publications after that excavation, but you are not always giving the proper attribution to the field archaeologist because you cited the first excavation in the first publication, and you’re done. Sometimes, that makes field archaeologists reluctant to share the data because they don’t feel that their work is acknowledged enough. This is one issue which we need to try to solve. Take for example a huge radiocarbon database of 5000 dates: if I use that database, I will cite whoever produced that database, but I will not be citing everyone who actually contributed indirectly to that database. How do I include all of these citations? Maybe we can discuss something like meta-citations, but there must be some way in which everyone feels they are getting something out of sharing the data. Otherwise, there might be a reaction where they think “well, I just won’t share. There’s nothing in for me to share it so why should I share my data”, which would be understandable.

On dealing with local communities, archaeological site owners and government officials

ACN: When we have had to deal with private owners, local politicians and different heritage caretakers, not everyone feels the same way. Not everyone feels the same way about everything, and you do need a lot of diplomatic skills to navigate through this because to excavate the site you need all kinds of permits. You need the permit of the owner of the site, the municipality, the regional authorities, the museum where you’re going to store the material. You need all of these to work and you need the money, of course. Different levels of discussion with indigenous communities is another layer of complexity which you have to deal with. In some cases, like in the site where we’re excavating now, the owner is the sweetest person in the world, and we are so lucky to have him. I called him two days ago because we were going to go to the site, and I was just joking with him, saying I’ll try not to break anything in your cave, and he was like, “this is not my cave. This is heritage for everyone. This is not mine. This is for everyone to know and to share”. It is so nice to find people like that. That may happen also with some kinds of indigenous communities. The levels of politics and negotiation are probably different in every case.

On how archaeologists are perceived

LO: When you approach a field or people, how do they view the archaeologists and the work?

ACN: It really depends on the owner. The one that we’re working with now, he’s super happy because he didn’t know that he had archaeology in his cave. When we told him, he was happy because he’s able to bring something to the community and he wants his local community to be aware that there is something valuable in terms of heritage. This is one good example. But we have also had other examples, for instance, where the owner of the cave was a lawyer and the first thing he thought was “are there going to be legal problems for me? If something happens in the cave, who’s the legal responsibility.” In another case there was there was another person that just didn’t care, she said “you want to come? Fine. The field is there, just do whatever you want.” So, there are different sensibilities to this. Some people are really happy about the heritage and don’t see it as a nuisance that they have to deal with. 

LO: How about yourself as a researcher, archaeologist: do you see yourself as the custodian of sorts, or someone who’s trying to contribute to this or local heritage for the place? Or is it almost scientific and you’re there to dig.

ACN: When I approach the different owners, I think the most important thing is to let them know that they have something valuable to the local community and they can be a part of that. They can be a part of being valuable to the local community. Also, you must make it clear that it’s not going to be a nuisance for them and they don’t have to do anything. I think the most important part is letting them know how it can be valuable for the community. I usually like them to be involved, and they can come and see the cave and see what we are doing. In the end it’s their land and if they see that we are producing something that is valuable to the community then it is good for them. In this case, the type of data that we produce is the primary type of data, that is, the actual different pottery sherds, the different arrowheads, etcetera. In this current excavation, we got an arrowhead that is probably some 4- or 5000 years old and you get (the land owners) to touch this arrowhead that no one in 5000 years has seen. If you can get the owners to think of it in this way, that they’re doing something valuable for your community, then they will be happier to participate in this whole thing and to just let us do whatever we want to do, which is science.

LO: How do you store physical data? Or do you let the landowner store it?

ACN: That depends on the national and regional laws and different countries have different laws about this. The cave where I’m working right now is in Spain, so I’m going to talk about the Spanish law, which is the one that I that I follow and it’s going to be different depending on every country. In our case, with the different assemblages that you find, you have a period of up to 10 years where you can store them yourself in your university and that period is for you to do your research with them. After that period, it goes to whichever museum they are supposed to be going, which depends on the law that says that it has to be the museum that is the closest to the cave or site where they were excavated. Here, the objects can then be displayed and the museum is the ones responsible for managing them, and storing them long term.

There is one additional thing: If you are excavating a site that has already been excavated, then there is a principle of keeping the objects and assemblages together. For example, there is this cave that was excavated in the 1950s and they store all the assemblages in the Museum of Prehistory of Valencia, which was the only museum in the whole region. Now, they excavated it again a few years ago and now there are museums that are closer to the cave but because the bulk of the assemblages are in Valencia and they don’t want to have it separated in two museums, they still have to go to Valencia. This is the principle of not having the assemblages separated and it is the most important one.


As always, we learn so much by engaging with our researchers about their relationship with data, and we thank Alfredo for joining us for this conversation. Please let us know how you think the podcast is going and if there are any question relation to research data that you would like us to ask!

The September 2023 Data Champion Forum

The Cambridge Data Champions had a fantastic September Forum at the West Hub. The forum started with an introduction to the West Hub by  Library Manager Daniele Campello and we welcomed Clair Castle as the new interim Research Data Manager with the Office of Scholarly Communication (University Library).

Dr Mandy Wigdorowitz kicked off the presentations by sharing with the Data Champions what she aims to achieve as the University’s Open Research Community Manager. This includes raising the profile of Open Research at the University and ensuring that scholarly and research outputs that are deemed to be open are indeed accessible and interoperable in accordance with FAIR principles.  As Open Research Community Manager, Mandy advocates for Open Research among University researchers from both the STEMM and AHSS (Art, Humanities and Social Sciences) disciplines. The latter proves to be more challenging as researchers in AHSS may often have valid reasons from refraining from making their research data open, such as working with sensitive data or working with interlocutors who object to their data being shared. Such issues will be addressed at the Cambridge Open Research Conference that she is organising, which takes place on 17th November 2023 at Downing College, Cambridge as well as online. To end, Mandy invited the Data Champions to join her Open Research initiative, a community of advocates for Open Research across the University.

Before lunch, Madeleine Taylor (Information Security Risk and Governance Manager with University Information Services, UIS) presented a follow up to a webinar session on monitoring the Information and Cybersecurity (ICS) risks for research data across the university, which she conducted with the Data Champions a couple weeks prior. After a brief introduction of what she has done so far to protect Cambridge’s research communities against ICS threats, she asked the Data Champions for help in her task of securing research data against ICS risks. They can do so by providing her with a sense of what data their own research communities are working with and how they were storing them. As the Data Champions ate the delicious lunch of sandwiches and cakes provided by the West Hub caterers, they provided feedback to Madeleine on two forms that she proposed as methods of gathering the information she needed: a 3-minute research data impact assessment form and a research data cyber security risk form. Maddy will continue to work with the Research Data Team and the Data Champions to refine, and gather information, through these forms.

Thank you to the West Hub and Daniele Campello for hosting the Data Champions Forum in your welcoming building!

If you are a member of the University of Cambridge and are interested in attending the Data Champions Forum, please join us as a Data Champion. If you are passionate about research data management and data sharing or you would like to find out more about what being a Data Champion entails, please visit the Data Champions webpage. We welcome applications from those working in all academic subjects across AHSS and STEMM disciplines. If you are unsure about how being a Data Champion would impact your research, please get in touch with the Research Data Team!

Cartoon by Clare Trowell CC-BY-NC-ND



Open Research in Cambridge: 2022 in review

2022 has been another fantastic year for Open Research in Cambridge and I’m so proud of what we have achieved together as a community of researchers, library staff, technicians, administrators, publishers and more. I’d like to highlight some of the key themes in our work this year and thank all who have contributed to this work in any way throughout the year (though I have limited myself to naming chairs of workstrands below). The following video by our Pro-Vice-Chancellor (Research), Prof Anne Ferguson-Smith, gives an indication of the importance that the university places on this work.

Understanding disciplinary differences

I know that I’m not alone in hearing that researchers in Arts, Humanities & Social Sciences disciplines often feel a disconnect between the language and priorities of “Open Science” and their experiences of how research is conducted – this is one of the reasons we choose to frame it as “Open Research” here in Cambridge. I see a strong desire from many to engage with open research practices, paired with frustration with the challenges of translating the terminology of open science to other areas. In order to better understand these issues, we established two working groups (Open Research in the Humanities and Open Qualitative Research), each of which was tasked with forgetting what they think they should do due to how open science is generally described, and instead describe what they see as the opportunities for open research within their disciplines.

The Open Research in the Humanities group was chaired by Prof Emma Gilby and supported by Dr Matthias Ammon. Their excellent report is already available on Apollo and through a series of blog posts here on Unlocking Research. The Open Qualitative Research group was chaired by Dr Meg Westbury and their report is due to come to the university’s Open Research Steering Committee in January. We will be sharing this more widely in early 2023 – it’s well worth watching out for! Both reports will inform how we talk about Open Research at Cambridge and will shape the transformative programme that we are in the process of developing.

Research data management

Our small but dedicated Research Data team, led by Dr Sacha Jones, has had another impressive year. Our Data Champions Network goes from strength to strength, and has expanded into departments that have not been represented in previous years. Other key projects have included a review of our research data services with recommendations for future development, a project on electronic research notebooks, and lots of work to support open research system developments, all while continuing to support researchers with data deposits and writing data management plans. This team is expanding next year which will enable even more work to meet the needs of different disciplines.

The future of scholarly publishing

We hosted a series of three strategic workshops on the future of scholarly communication earlier this year, developed in collaboration between Cambridge University Libraries and Cambridge University Press. Led by independent facilitator Mark Allin, participants across disciplines and career stages came together to discuss the problems of scholarly communication, potential long-term solutions to these problems and a strategy to help Cambridge get us there. The proposals emerging from the meeting are currently being developed and include newly developed infrastructures for diamond open access publishing projects and a series of high-level strategic meetings aimed at strategic improvements to equity in academic publishing. There are already diamond publishing initiatives within Cambridge, and projects will start in early 2023 to understand existing initiatives in greater detail and to provide the infrastructure to establish additional diamond journals. 

The library’s annual Open Research Conference took a similar visionary approach in its focus on the future of open access. Titled Open Access: Where Next?, the conference featured expert speakers on how we can think beyond open access toward more innovative, sustainable and equitable open futures. We heard from researchers excluded by certain approaches to open access, how other researchers are addressing issues through their own scholar-led approaches, alongside how openness fits into changing research cultures and can facilitate experimental publishing projects. A full round up with videos of each session is available on the Unlocking Research blog.  My thanks to Dr Bea Gini for her leadership in planning this conference.

Open Access now

While we are actively working towards a new future for scholarly publishing, we also need to ensure that our researchers have ways to make their work open access right now. We do this in a number of ways, engaging with the academic community and contributing expert open access advice on publishing agreements that are negotiating across the sector and administering the block grants that are provided by funders and the university to cover the costs of publishing in fully open access venues. All of this requires close reading and interpretation of funder requirements to ensure that we are able to support our researchers in what they are required to as well as what they would like to do. I’d like to specifically thank Alexia Sutton, who leads our Open Access team, and Dr Samuel Moore, our Scholarly Communication Specialist, for their leadership in this area.

We are particularly pleased with the engagement from across the university with the ongoing Rights Retention Pilot, which provides a route to open access for articles that cannot be made immediately available through existing publishing deals, are not eligible for the block grants mentioned above or where the publisher simply does not provide any route to immediate open access. We are now consulting on the development of a Self-Archiving Policy which is buit on what we have learned throught he pilot and will sit within our Open Access Publications Policy Framework. Members of the university can find out more by reading this document (accessible to Raven users only). It has been an honour to lead a dedicated group of library and research staff on this project.

Open research systems

Everything we do requires that we have the right technical infrastructure in place. The Open Research Systems team is led by Dr Agustina Martinez-Garcia and based within Cambridge University Libraries’ Digital Initiatives directorate. This year has seen projects to upgrade links between Symplectic Elements and Apollo, technical changes to support the rights retention pilot, a review of the open research systems landscape, contributing to thinking around future publishing platforms, electronic research notebooks and data infrastructure, and planning ahead for the upgrade to DSpace 7, improvements in the thesis service, and building connections between DSpace repositories and Octopus. This is not a comprehensive list and we plan to showcase more of their work on the blog in 2023.

Research enquiries, briefings and training

I want to end with huge thanks to the library staff based both in the Office of Scholarly Communication and in the Faculty & Department Libraries who do so much throughout the year, answering frontline research support queries, signposting as required, providing tailored briefings and training on highly complex and constantly changing topics. We especially value the disciplinary insights we get through working closely with the Research Support Librarians that are based within the Schools.

Join our team!

Open Research is an incredibly rewarding area to work in and the scale of what we’re trying to achieve is really ambitious. I’m delighted that the importance of what we are doing is recognised by both Cambridge University Libraries and the wider university and as a result we are expanding our team!

We are currently recruiting for an Open Research Community Manager to establish and develop a Cambridge Open Research Community, bringing researchers across the university community together through regular online and in person events to enable exchange of expertise in open and rigorous research practices. In January, we plan to advertise for two Research Data Coordinators and an Open Research Administrator, with a Research Services Manager post following later in the year. All of these roles will be listed on the university’s jobs site as well as on LinkedIn, mailing lists etc. If you’re interested in our work and would like to find out more about these opportunities please get in touch at info@osc.cam.ac.uk!