EU-Citizen.Science: A Platform for Mainstreaming Citizen Science and Open Science in Europe

Citizen Science (CS) is a prominent field of application for Open Science (OS), and the two have strong synergies, such as: advocating for the data and metadata generated through science to be made publicly available [1]; supporting more equitable collaboration between different types of scientists and citizens; and facilitating knowledge transfer to a wider range of audiences [2]. While primarily targeted at CS, the EU-Citizen. Science platform can also support OS. One of its key functions is to act as a knowledge hub to aggregate, disseminate and promote experience and know-how; for example, by profiling CS projects and collecting tools, resources and training materials relevant to both fields. To do this, the platform has developed an information architecture that incorporates the public participation in scientific research (PPSR)—Common Conceptual Model①. This model consists of the Project Metadata Model, the Dataset Metadata Model and the Observation Data Model, which were specifically developed for CS initiatives. By implementing these, the platform will strengthen the interoperating arrangements that exist between other, similar platforms (e.g., BioCollect and SciStarter) to ensure that CS and OS continue to grow globally in terms of participants, impact and fields of application.


INTRODUCTION
The Citizen Science (CS) landscape is growing rapidly, gaining new actors and expanding into more diverse fields. National initiatives to promote and develop CS are emerging in many countries, and its scientific, societal and political impacts are becoming more widely recognized. This is evident in the European Union's (EU) Horizon 2020 program, which has seen the number of projects funded under its Science with and for Society (SwafS) stream rise from 21 in 2014 to 35 in 2019 [3]. This growth has largely been uncoordinated, however, and taken place without a shared agenda-and this creates challenges. For example, it has led to a huge range of approaches, tools, resources, projects and impacts. While this diversity is welcome, it can result in duplicated efforts, or make it difficult for newcomers to navigate the range of methods and approaches found across different disciplines, sectors and regions.
While CS is grounded in the scientific process, CS approaches to research often require considering additional factors ranging from the ethical engagement of volunteer communities to tailed data quality practices. To address the unique practices associated with CS, the EU-Citizen.Science platform  was created as a central place to exchange experiences, gather information and network around CS-and wider Open Science (OS) practices. This three-year EU-funded project (2019-21) is being led by a consortium of 21 partners from universities, non-governmental organizations (NGOs), local authorities, civil society organizations, small and medium-sized enterprises and natural history museums. The platform's alpha version was released in March 2020 as a mutual learning space where CS tools and resources, best practice examples and project profiles are collected, curated and made available for anyone interested in CS, such as researchers (including those not current leveraging CS), public citizens, the media, scientific institutions, policy-makers and funding bodies. By 2021, EU-Citizen.Science aims to be the central knowledge-sharing hub for CS in Europe. Section 1.1 provides a fictional example of how EU-Citizen.Science can be used to find projects and resources.
By advocating for, and supporting efforts to realize greater citizen involvement in science, EU-Citizen.Science supports many aims of OS. The ethos described in the 10 principles of CS [4] is a key foundational reference point for the platform, and is consistent with and supportive of the central aims of OS for transparency, accessibility, sharing and collaborative inclusion. For example, involvement in CS projects can enhance public interest in science [5].
It is important to note that unlike open repositories for documents and data such as B2SHARE, Zenodo, or OSF, EU-Citizen.Science does not host any data gathered by CS initiatives; it only gathers descriptive metadata for projects and resources, along with the open-repository-generated DOI to signpost the hosted

EU-Citizen.Science: A Platform for Mainstreaming Citizen Science and Open Science in Europe
location of the resource, or permalink to the project. EU-Citizen.Science does, however, provide guidelines for making citizen-generated data publicly available and, where possible, published in an open-access format. In keeping with the FAIR principles (findable, accessible, interoperable, and reusable) for data handling and processing, and to ensure the interoperability of the platform with other CS platforms, and to enable the exchange of information about projects and resources via an application programming interface (API), EU-Citizen.Science has implemented: (1) the Project Metadata Model to describe CS projects, (2) the Digital Documents ontology from Schema.org (with vocabulary from Dublin Core) to describe CS resources, and (3) the Courses ontology from Schema.org to describe CS training materials. This information architecture not only supports such network collaborations; it also greatly aids the platform's search functions for users.
This article first explains how the two fields of CS and OS intersect (Section 2), and then explores the information architecture behind the platform in detail (Section 3). It concludes by considering the challenges the platform faces (Section 4) and plans for its future use and development (Section 5).

EU-Citizen.Science in Use: A Practical Example
Sally and James plan to set up an NGO to monitor air quality in their town. They want to know if there are similar initiatives in Europe that could provide them with a starting point. On EU-Citizen.Science, they search for projects and resources that include the term "air quality" in their metadata. Their search yields several results, including D-NOSES  , a project about odour pollution. On the platform, they can read about the project's aims and discover its free-to-use tools: the OdourCollect app  and the International Odour Observatory  . Data already collected through D-NOSES is openly accessible on the D-NOSES app, alongside open-access information and resources to further their knowledge on how citizens can monitor odour pollution. Through EU-Citizen.Science, Sally and James can also connect with other CS projects, providing them with a wealth of experience and knowledge to build upon. This helps to kick-start their own initiative and they use the open-access tools, which have been tested by others, to start collecting data on odours in their region. Once their own NGO has been set up, they add it as a new project to EU-Citizen.Science, using relevant metadata to ensure that others can find them in the same way they found D-NOSES.

A Brief Overview of CS
While CS has a tradition dating back centuries [6], the concept was not formally established until the 1990s. Two of the definitions developed then are still applicable today. Bonney [7] defined it as projects in which non-professional scientists voluntarily contribute scientific data to professionally led research projects, while Irwin [8] described it as a way of developing concepts of scientific citizenship, and opening up science and science policy processes to the public.

EU-Citizen.Science: A Platform for Mainstreaming Citizen Science and Open Science in Europe
These early definitions highlight the continuing challenge in pinning down exactly what CS is. While Bonney focuses on the acceleration of research through new data, and gains in topical knowledge by citizens, Irwin prioritizes participation, the opening up of science and two-way knowledge exchange. This relative ambiguity about CS is further complicated by the numerous terms and typologies that have emerged in recent years, which include, among others: community science, civic science, people-powered science, volunteered (or crowd-sourced) geographic information, community remote sensing, citizen observatories, crisis mapping and citizen-generated data  .
Indeed, Haklay et al. [9] attest to reluctance within the CS community to establish a uniform, binding definition: rather, actors are more interested in an open discussion about which criteria and practices describe many phenomena incorporated within CS. This dynamic, flexible approach allows for constant development and adaptation, and also means that CS-and its definitions-encompass and promote an open understanding of the diverse research practices and participatory activities it can include. Yet this heterogeneity of terms, approaches and understandings makes it all the more important to develop a common metadata standard for the field, one that builds a shared frame of reference independent of designation and discipline.

Where CS and OS Intersect
CS and OS are related paradigms that have seen recent growth in practical activities and policy support. The Organization for Economic Co-operation and Development [12] defines OS as "efforts to make the output of publicly funded research more widely accessible in digital format to the scientific community, the business sector, or society more generally". There have been dedicated efforts to explore the connections between OS and CS. A policy brief by DITOs Consortium by 2018 [13] performs an analytical consideration of the relationship between the two fields, and identifies key synergies and shared challenges, such as diversity and inclusion, education and training, funding, and incentives or rewards. The authors portray them as complementary, overlapping approaches, and identify the "accessibility of research results and processes" as a shared central tenet ( Figure 1).
Hecker et al. [11] also locate CS in the immediate environment of OS, specifically as a rapidly growing field in OS and open innovation. This conceptualization suggests a slightly different relationship between the two terms ( Figure 2), where CS falls under the broader OS, or open innovation, umbrella  . Common to both definitions is the emphasis on common areas, including broadening access to research and data (Section 2.3 provides an example of this).  Eitzel et al. [10] and Hecker et al. [11]

EU-Citizen.Science: A Platform for Mainstreaming Citizen Science and Open Science in Europe
In 2020, the CS and OS Community of Practice collaborated on a whitepaper  . A major finding was that CS can strengthen OS through an emphasis on process, particularly by contending with the opportunities and challenges of involving a much wider range of stakeholders than traditional OS may allow. This includes several key elements. The OS approach to open access often focuses on making the outputs and processes of academic research more open and available by improving existing processes. In CS, open access is also seen as a valuable way to gain greater knowledge about a research topic, and therefore participate more comprehensively in a research study. Open Data (OD) is a second key element. In general, OS has a more advanced technical infrastructure to support the collection and management of open and FAIR information. CS, meanwhile, is required to consider the ethics of OD, for example privacy concerns.
In CS, open software and hardware are important not only for enabling more professionally led research, but also to democratize the tools of science and enable more diverse public stakeholder groups to participate in or lead research activities. The same is valid for Open Educational Resources (OER), which include many open access, OD, and open software and hardware tools designated for educational purposes. In OS generally, OER suggests that high quality educational tools should be made available to a large number of learners. In CS, there is also an emphasis on the diversity of educational tools, to encompass both teaching aids (e.g., lesson plans) and resources for informal or individual learning (e.g., tutorials or instructional videos).

Ensuring Citizen-generated Data Are Open
Opening up science and research processes to greater participation by citizens increases the awareness and uptake of, and interest in, scientific results and outcomes; it can also lead to larger volumes of data [16]. To ensure citizen-generated data are open and accessible, and their long-term storage, management and secondary use are not dependent on institutional support, several OD management and storage services are being established. A leading initiative is the European Open Science Cloud (EOSC) [16].
Since 2018, the EOSC has aimed to accelerate the deployment and consolidation of an open, trusted, virtual, federated environment in Europe to store, share and reuse research data across borders and scientific disciplines, and to provide access to a wide range of related services. Within the field of CS, the EU-funded Cos4Cloud project  is building upon the experience of citizen observatories, and especially the technological challenges they have faced in offering new or improved services, to increase the quantity and quality of CS data within the EOSC framework. In this way, Cos4Cloud is, like EU-Citizen.Science, supporting and promoting the growth and impact of CS and OS. Many CS projects also share their data via domain-specific repositories, such as the Global Biodiversity Information Facility for biodiversity data.

THE VALUE OF METADATA IN EU-CITIZEN.SCIENCE
For EU-Citizen.Science, the key to aggregating and further disseminating open tools, resources, data and training effectively, and to a broad audience, was to implement a well-structured information architecture and data layer within the platform: one that supports a logical content structure, useful content filtering, relevant keywords and an effective global search function. The platform's metadata structure plays a crucial role in the discovery and findability of its content, and assists the users in identifying content relevant to their needs.

How EU-Citizen.Science Uses Metadata
Given the nature of EU-Citizen.Science-as a signposting knowledge hub that profiles content hosted or stored elsewhere-its metadata models focus on basic descriptive information, such as: the title and author of resources; keywords to describe the content; and enriched information that is context and content specific. This enhances the platform's value to the CS and OS communities. The implemented ontology provides the formal naming of categories, their definitions, and their inter-relationships, which are thus the subject of the metadata. The goal of having ontologies is to share knowledge more easily, by organizing content in a system that reduces complexity according to terminology and typologies that have been agreed upon by the fields of CS and OS. The main ontology components relevant to the platform are class ontologies (types of content) and attribute ontologies (aspects, properties, features and characteristics of the content). The three relevant content class ontologies for EU-Citizen.Science are: CS resources, CS training materials, and CS projects.

Metadata for CS Resources
For the description of resources, the platform looks to the Dublin Core Metadata Initiative schema  10 , which is specifically designed to describe creative content such as digital resources (e.g., video, images, Web pages), physical resources (e.g., books, CDs) and physical objects (e.g., artworks). The platform uses the Dublin Core controlled vocabularies, such as those for "resourceType", in combination with the ontology proposed by Schema.org for CreativeWorks  11 and Things  12 . Schema.org vocabulary can be used with many different encodings, including RDFa, Microdata and JSON-LD. These vocabularies cover entities, relationships between entities and actions, and can easily be extended through a well-documented extension model. This provides EU-Citizen.Science with interoperability potential across a wide range of linked data cloud and semantic Web implementations.
According to Schema.org  13 , over 10 million sites use their standard to markup their Web pages and email messages. Founded by Google, Microsoft, Yahoo and Yandex, Schema.org vocabularies are developed by  10 http://dublincore.org  11 https://schema.org/CreativeWork  12 https://schema.org/Thing  13 https://schema.org an open community process and are maintained by the Schema.org initiative. Within these, the Resource Description Framework (RDF) is a specification developed and maintained under the auspices of the W3C  14 . The RDF schema provides mechanisms for describing groups of related resources and the relationships between these resources. Using this schema within the information architecture of the platform enhances its interoperability with other such platforms and resources, and any future potential linked data cloud or semantic Web implementations.

Metadata for CS Training Materials
Although training materials can be described using the Dublin Core metadata for resources outlined above, they also require specific metadata fields that relate more specifically to their educational character and purpose, such as training videos and Massive Online Open Courses. For training materials profiled on the platform, the Schema.org Course  15 metadata was used to describe educational courses, education offered through different media or modes of study, educational events, and/or creative works that aim to build the knowledge, competence or ability of learners; and the Dublin Core LRMI terms  16 developed to markup and describe educational resources. This builds on the extensive vocabulary provided by Schema.org and other standards to increase the platform's value and interoperability.

Metadata for CS Projects
For the description of projects, the platform implemented the PPSR-Common Conceptual Model (see Section 3.2), which consists of the Project Metadata Model (PMM), the Dataset Metadata Model (DMM) and the Observation Data Model (ODM). The PMM captures information primarily at the CS project level, including research topic, audience and intended outcome. The DMM documents considerations such as intellectual property concerns. The ODM is being developed in tandem with work on the Open Geospatial Consortium Sensor Things API standard, and also takes into consideration existing metadata standards, particularly Dublin Core.
The PPSR-Common Conceptual Model  17 specifically describes CS projects, providing a shared vocabulary for a range of metadata fields. Implementing the PMM empowers EU-Citizen.Science's interoperability and enables it to enter into collaborative data-sharing aggregation networks with other key regional platforms, such as BioCollect  18 , SciStarter  19 and the EU's Joint Research Center repository of CS projects  20 .

Metadata for CS Data
Although the EU-Citizen.Science platform does not serve as a repository for data that have been generated by citizens participating in CS initiatives, it is worth noting that the PPSR DMM does provide a schema for describing how observations are collected, how those data may be used, and other information on the data as a whole; and the PPSR ODM provides a schema for describing the structure of individual observations including: location, date, observer, and attributes specific to the research.

How the PPSR-Common Conceptual Model Was Developed
In 2010, DataONE, funded by the US National Science Foundation, chartered a working group on a PPSR conceptual model. Under the umbrella of an organization devoted to increasing scientific data sharing, the PPSR working group focused on a handful of projects, which included identifying relevant metadata standards for CS. Meanwhile, at the inaugural meeting of the Australian Citizen Science Association in 2015, researchers from Australia, Europe and the US came together to discuss challenges associated with data sharing and interoperability. The culmination was an agreement to co-found a "Data and Metadata" working group within the US-based Citizen Science Association (CSA). Many representatives of the DataONE PPSR working group quickly joined.
Since these developments, the CSA Data and Metadata working group has become the primary driver behind the conceptual model. Since 2015, it has been developed by an international group of CS researchers and practitioners, from the CSA's "Data and Metadata" working group, the European Citizen Science Association's (ECSA) "Projects, Data, Tools, and Technology" working group, and the European COST Action on Citizen Science's working group to "Improve data standardization and interoperability". This work has been further supported by a project funded by the Alfred P. Sloan Foundation.

How to Remain up to Date and Relevant
As with all technology-based projects, EU-Citizen.Science will face a continuous challenge to remain up to date. In this case, however, the challenge is not limited to new technological developments; the platform will also have to remain relevant to the fields of both CS and OS-which, as noted in Section 2, are constantly evolving. The platform's metadata model will play a key role here. Its robustness for wider use within these fields will also be tested; it will need to prove it works for imprecise and moving targets, and that it can help users with differing needs and priorities to find what they are looking for.
Another measure to help in this regard is the constant feedback loops that have been built into the platform's development, which will remain an operational feature moving forward. By communicating regularly with the platform's users, and with the wider CS and OS communities, it should be possible for the platform to maintain a searchability that is fit for purpose. Indeed, EU-Citizen.Science should initiate such discussions.

EU-Citizen.Science: A Platform for Mainstreaming Citizen Science and Open Science in Europe
However, not all CS and OS projects are familiar with the need for metadata; nor is it always a priority for projects, especially those run largely by volunteers or those facing time and budget limitations. The platform may need to advocate for the importance of robust metadata to remain relevant: Its function as the best place to find projects and resources is only as strong as the metadata uploaded to it, and depends on content being kept up to date. To achieve this, it will need to maintain a user-friendly interface and continually demonstrate added value for its community.

Metadata Becoming a Burden for, or Barrier to, Users
In its initial release, EU-Citizen.Science sought a balance between a desire to gather thorough metadata for all projects, and the drop-off effect of asking for too much information (and therefore making a bigger demand on people's time). This balance is especially relevant for the platform, as its value to a wider community of citizens and researchers relies on a mix of rich information-sharing and active participation on the platform.
Looking forward, possible solutions to managing this trade-off include Metadata Description As a Service (MDaS) tools, such as the "Metadata Wizard". These tools provide a user-friendly and highly efficient environment for creating, editing, previewing and validating metadata for each project. MDaS includes tools that can automate and facilitate the creation of high-quality metadata records [17]. For EU-Citizen.Science, these could, for example: (1) automate the population of contact information for ECSA affiliates, and keywords from controlled vocabularies; (2) be a validator to highlight any missing or error elements, whether directly on a dedicated graphical user interface (GUI) and/or in a printable report suitable for metadata review; (3) be a tool to Copy/Paste or Drag-and-Drop entire sections, subsections or individual content between different records or other tools (including XML-Notepad and text editors); (4) offer built-in documentation which guides users through questions about metadata.

CONCLUSIONS AND FUTURE EXPECTATIONS
The EU-funded project to develop the EU-Citizen.Science platform will end in December 2021, after which it will be managed by ECSA. The platform will continue to bring greater openness to science: by facilitating the sharing and reuse of, and access to, critical resources; by bringing clarity and order to the ever-growing number and diversity of projects that fall under the umbrellas of CS and OS; by capturing information about the accessibility and licensing of resources and training materials; by signposting users to OD assets generated by projects; and by highlighting the availability of open access publications, open hardware and software, and OER.
To increase its relevance for OS further, the platform should strengthen its links with open access e-infrastructures, particularly within the EOSC framework. Another important role will be to demonstrate the benefits of implementing a FAIR-based platform, lastly, the EU-Citizen.Science platform should increase its advocacy for CS and OS. As well as gathering and curating projects, tools and other resources, it can be a "mouthpiece" for the many benefits of CS and OS: for research and researchers, for project participants,

EU-Citizen.Science: A Platform for Mainstreaming Citizen Science and Open Science in Europe
and for society more widely. This can be achieved by initiating and moderating discourse and discussions between all the key stakeholders (especially the policy audience), sharing success stories, and establishing itself as the central platform for CS and OS-in Europe and beyond.

EU-Citizen.Science: A Platform for Mainstreaming Citizen Science and Open Science in Europe
Simone Rüfenacht is a project officer at the European Citizen Science Association (ECSA) in Berlin, Germany. Simone has an academic background in ecology; she received a Master's degree from the Swiss Federal Institute of Technology Zurich in 2012 and a PhD from the University of New South Wales in Sydney, Australia, in 2016. She first came in contact with CS through leading the Sydney Chapter of the Society for Conservation Biology in a volunteering capacity. Before deciding to focus on CS, Simone worked as a postdoc in molecular ecology. ORCID: 0000-0001-8561-3751 Luigi Ceccaroni is a scientist at Earthwatch, UK. Previously, he was founder and research lead at 1000001 Labs, where he designed and developed decision support systems in the healthcare and environment domains; a senior member of research staff at Eurecat / Barcelona Digital Technology Centre; adjunct professor of the Software Department, at the Technical University of Catalonia (UPC); director of research at TMT Factory (a Spanish SME); and a member of research staff at Fujitsu Laboratories of America. He obtained a BSc degree in Environmental Sciences from the University of Bologna, an MSc degree in Information-Technology Languages and Systems, and a PhD degree in Artificial Intelligence at UPC. Luigi is the author of more than 100 publications peer-reviewed and managed more than 16 M ¤ as lead researcher in the period 2009-2021. He currently coordinates and is the principal investigator of the "MICS" European Horizon 2020 project (2019-2021) about measuring the impact of CS, and previously coordinated and was the principal investigator of the "Citclops" and the "Synergy-COPD" European FP7 projects, and wrote the winning proposal for the BackHome project, an FP7 European project about brain-neural computer interfaces. ORCID: 0000-0002-3116-0811 Jaume Piera is the head of the EMBIMOS (EnvironMental and sustainaBility particIpatory inforMatiOn systemS) research group at the Institute of Marine Sciences, Spain. EMBIMOS is an interdisciplinary research group that focuses on understanding, creating and implementing participatory information systems for decision-making in favor of environmental sustainability. Citizen Observatories, as the platforms to develop CS, is one of the central research topics for the group. ORCID: 0000-0001-5818-9836