Skip to main content

From fragments to insights: highlights from SciLake’s closing event


Event

From fragments to insights: highlights from SciLake’s closing event

By Stefania Amodeo

Scientific knowledge is richer than ever, yet hard to navigate. Research outputs are spread across articles, datasets, software, protocols, workflows, and many other sources. Even when content is openly available, it can remain functionally fragmented: difficult to connect, query, reuse, and translate into evidence for new research or policy decisions. Designed to help close this gap, SciLake brought the consortium together for its final dissemination event on Tuesday 10 March 2026, a half‑day online gathering showcasing practical approaches to tackling fragmentation through customisable Scientific Knowledge Graphs (SKGs) piloted in real, domain-specific contexts (neuroscience, cancer, transportation, and energy). This post captures the key takeaways shared across the event.

The SciLake journey and the fragmentation problem

In the opening talk, Thanasis Vergoulis (ARC) framed the challenge: valuable scientific knowledge is abundant, but scattered across heterogeneous sources and encoded in formats that make it hard to connect at scale.

SciLake’s response was presented as a customisable ecosystem of components that helps communities:

  • ingest and transform domain corpora,
  • generate domain-specific SKGs aligned with standards, especially the RDA Scholarly Interoperability Framework (SKG‑IF)
  • expose the graphs via APIs and graph technologies,
  • and build value-added services on top.

Two example services illustrated the concept end-to-end:

  • a knowledge discovery service using citation-network analysis, and
  • a reproducibility assistance service to help identify reproducible studies and flag reproducibility issues.

SciLake’s work is also especially timely in the era of large language models (LLMs) and generative AI: SKGs can provide reliable context and help reduce hallucinations in AI‑assisted discovery and summarisation.

Slides

Panel: Turning scattered knowledge into “living” knowledge graphs

Three key themes that emerged from the discussion:

  • Build on open backbones, then specialise with communities. A rich, domain-agnostic base like the OpenAIRE Graph can be curated into community-specific subsets through clear criteria and workflows.
  • Combine knowledge graphs and large language models (LLMs). SKGs offer traceability, updates, and explicit reasoning; LLMs are strong at conversational access and extraction. Together, LLMs can provide interaction while SKGs provide reliable, auditable knowledge base.
  • Make trust practical with provenance and correction loops. Linking research objects is essential, but automated enrichment and linking can introduce errors. Trust therefore depends on being able to trace each link back to its source (provenance) and on having feedback loops so communities can flag issues, correct them, and improve the graph over time.

Panel resource page

Pilot demonstrations: SciLake tools through domain case studies

The demo session showed SciLake’s approach in practice across five pilots, each starting from a curated collection and moving toward an interoperable, enriched SKG.

Key highlights across the pilots:

  • Energy planning: extracting geographic “objects of study” from publications, linking them to places in OpenStreetMap, and enabling spatial discovery.
  • Cancer: a cancer-specific KG integrating ontologies and gene/drug resources, supporting pathway exploration, interaction networks, and literature ranking.
  • Maritime transport: stakeholder-driven monitoring and discovery with emphasis on open access and reproducibility. Ranking signals (e.g., influence/PageRank and recency-biased popularity) were computed and released openly.
  • CCAM (automated mobility): building targeted subgraphs and reducing noise compared with generic search. Entity extraction and linking for CCAM concepts supported trend spotting, with human-in-the-loop validation.
  • Neuroscience: building an SKG enriched with neuroscience entities and impact indicators, enabling discovery and query-based meta-analysis, and helping explore how dataset citation relates to citation impact.

Demo resource page

Panel: Unlocking hidden patterns in knowledge graphs

This panel shifted from what is already possible in pilots to the harder questions: what it takes to scale SKGs into sustainable services.

Key takeaways:

  • Trust and verification come first. Researchers need to avoid both under-trust (not using tools) and over-trust (uncritical adoption).
  • The next step is richer representations. Progress depends on models that support reasoning and synthesis, not only similarity search and summarisation.
  • Data ingest remains a bottleneck. PDFs, handwritten tables, and missing abstracts still limit what can become machine actionable.
  • Scaling is more than algorithms. It requires sustained investment in infrastructure, interoperability, and long-term sustainability.

Panel resource page

Beyond SciLake: takeaways for Open Science

Interoperable SKGs are becoming a practical instrument for Open Science. They help move from “open but scattered” to “open and reusable” by enabling discovery, querying, and evidence-based navigation.

Community knowledge makes the difference. Domain experts are essential for deciding which entities and relationships matter, what “quality” means, and which evidence paths are meaningful.

Provenance and governance are non‑negotiable. Especially when AI is involved, users need to trace claims back to sources, inspect curation decisions, and correct errors.

SciLake’s final event clarified that solving knowledge fragmentation is not about “one more platform.” It is about enabling communities to create and sustain interoperable, trustworthy knowledge structures that evolve with research and that make open research outputs more usable for people, services, and responsible AI.

Read more …From fragments to insights: highlights from SciLake’s closing event

From Fragments to Insights: Making Scientific Knowledge Discoverable


Event

The culminating event of the SciLake project was a half-day online event showcasing practical solutions to scientific knowledge fragmentation through customizable Scientific Knowledge Graphs piloted in real-world, domain-specific contexts.

  • Date

    Tuesday, 10th March 2026

  • Time

    9:00-12:30 CET

  • Location

    Online via Zoom, hosted by OpenAIRE


Scientific research today faces a critical challenge: valuable knowledge is scattered across countless sources, making discovery difficult and hindering informed decision-making. After three years of collaborative research and innovation, SciLake has created practical solutions that give researchers the tools to turn their domain knowledge into standardized, interoperable Scientific Knowledge Graphs (SKGs).

Bridging Domain Expertise and Knowledge Management

SciLake bridges the gap between domain expertise and technical knowledge management. While scientists understand their fields deeply, they often lack the specialized skills to organize knowledge effectively. Conversely, technical experts need domain insights to structure information meaningfully.

Through collaboration with experts in neuroscience, cancer, transportation, and energy research, we have created a highly customizable ecosystem of components for seamless data access and knowledge discovery.

Program Highlights

  • Two expert panels on
    • Turning scattered knowledge into living knowledge graphs
    • Unlocking hidden patterns in knowledge graphs
  • Live demonstrations from four domain-specific pilot applications
  • Q&A sessions

Agenda

Times are CET

09:00 - 09:05 5'

Welcome address

09:05 - 09:20 15'

Introduction to the problem of scientific knowledge fragmentation - the SciLake project journey

Thanasis Vergoulis (ARC)

+
09:20 - 10:10 50'

Panel: "Turning scattered knowledge into living knowledge graphs"

Stefania Amodeo (Chair, OpenAIRE), Miriam Baglioni (ISTI-CNR), Nicolau Duran (SIRIS Academic), Eric Jeangirard (MESR), Haris Papageorgiou (ARC)

+
10:10 - 10:20 10'

Break

10:20 - 11:20 60'

Demonstrations of SciLake tools through pilot case studies

+
11:20 - 11:30 10'

Break

11:30 - 12:20 50'

Panel: "Unlocking hidden patterns in knowledge graphs"

Natalia Manola (Chair, OpenAIRE), Bianca Kramer (Sesame Open Science), Andreas Kolleger (Neo4j), Francesco Osborne (Open University), Nick Yakovets (TUe)

+
12:20 - 12:30 10'

Closing remarks

Read more …From Fragments to Insights: Making Scientific Knowledge Discoverable

GraphAlg Presented at Dutch-Belgian Database Day 2025


Workshop

GraphAlg Presented at Dutch-Belgian Database Day 2025

By Stefania Amodeo

SciLake partner Daan de Graaf from TU Eindhoven presented his work on graph algorithm support in AvantGraph at the Dutch-Belgian Database Day (DBDBD) 2025, held on December 12, 2025, at the University of Antwerp.

His paper, titled Graph Algorithms for Everyone, Everywhere, introduced GraphAlg, an open-source project developed within SciLake, designed to make graph algorithms accessible to researchers, developers, and students. During the conference, Daan presented a poster and conducted live demos, allowing participants to interact with the GraphAlg tutorial and explore its capabilities firsthand.

About GraphAlg

GraphAlg is a domain-specific language for graph algorithms that compiles to relational algebra, making advanced graph analysis accessible directly within database systems.

GraphAlg is now open source, making it easier than ever to work with graph algorithms. We built it for researchers, developers, and students who want to dive into graph theory without the usual steep learning curve. The project includes hands-on tools and tutorials to help you get started quickly.

Key Resources

Congratulations to the TUe team on this achievement! We are proud to see our research making an impact in the database and graph algorithm communities.

Read more …GraphAlg Presented at Dutch-Belgian Database Day 2025

SciLake at EBRAINS Summit 2025


Workshop

SciLake at EBRAINS Summit 2025

By Archana Golla and Stefania Amodeo

In December 2025, we presented the SciLake project at the EBRAINS Summit in Brussels, a major event bringing together researchers, clinicians, and tech developers from across Europe's brain-science community. Maya Kobchenko and Sophia Pieschnik represented our team, sharing how we are building a Scientific Knowledge Graph specifically designed for neuroscience research.

About the EBRAINS Summit

The EBRAINS Summit 2025 – Transforming Brain Research and Medicine took place from December 8-11 in Brussels. The event brought together researchers, clinicians, and technology developers to showcase how EBRAINS' infrastructure advances neuroscience and drives innovation across the field. The program featured scientific talks, poster sessions, a science market exhibition, and a dedicated public day. A key theme throughout was the importance of cross-disciplinary collaboration, FAIR data practices, and interoperable tools, all values that align closely with SciLake's mission.

Our Contribution: A Neuroscience-Focused Scientific Knowledge Graph

SciLake's poster presentation was delivered by Maya Kobchenko and Sophia Pieschnik from the University of Oslo, highlighting our merged service architecture that powers a domain-specific Scientific Knowledge Graph for neuroscience.

Built on the SciLake ecosystem, our workflow integrates a neuroscience gateway derived from the OpenAIRE Graph, maps metadata to the SKG-IF format, and loads it into graph engines including Neo4J and AvantGraph for efficient visualization, querying, and analytics.

The application tier delivers impact-driven discovery through BIP! Finder's indicators, measuring popularity, influence, impulse, and citation count. To strengthen reproducibility, we employed a trained entity-recognition model that classifies openMINDS terms, enriching the relationships among research products and enabling more nuanced exploration of the neuroscience literature landscape.

For SciLake, the Summit represented an invaluable opportunity to showcase the advances in our neuroscience pilot to domain experts. 

Resources

Our poster abstract (no. 45) can be found in the official Book of Abstracts.

We invite researchers interested in exploring our neuroscience-focused Scientific Knowledge Graph to learn more through the SciLake neuroscience case study page.

Read more …SciLake at EBRAINS Summit 2025

Open Science in Transport Research: SciLake Pilots at ICTR 2025


Workshop

Open Science in Transport Research: SciLake Pilots at ICTR 2025

By Afroditi Anagnostopoulou, Athanasios Ballis, Xenophon Kitsios, Stefania Amodeo

The SciLake project made a significant impact at the 12th International Congress on Transportation Research (ICTR 2025), held October 16–18, 2025, in Thessaloniki, Greece. Our partners from CERTH/HIT and ICCS organized a dedicated workshop and presented a conference paper highlighting how SciLake tools are advancing open science practices in transport research.

Workshop: Fairness & Transparency in Transport Research

On October 16, SciLake partners co-organized an engaging workshop titled "Fairness & Transparency in Transport Research." The event brought together 15 stakeholders from academia, industry and public authorities to explore how FAIR principles can transform transport research.

The workshop featured:

  • Keynote presentations on Open Science and FAIR principles in transport research
  • Live demonstrations of SciLake tools through the Maritime and CCAM (Cooperative Connected Automated Mobility) pilot cases, showcasing how Knowledge Graphs enable semantic connections between research data
  • Interactive discussions where participants shared insights and recommendations for implementing FAIR principles

Afroditi Anagnostopoulou from CERTH/HIT presented the Maritime pilot, demonstrating how SciLake services, powered by Knowledge Graph infrastructure, enable researchers to access and share maritime transport data more effectively while maintaining semantic relationships. Athanasios Ballis from ICCS showcased the CCAM pilot, highlighting tools that promote transparency in autonomous vehicle research through graph-based data visualization.

Conference Paper: Open Science Insights from Maritime and CCAM Pilots

We also presented a research paper entitled "Open Science in Transport Research: Insights from the Maritime and CCAM Pilot Cases," co-authored by Afroditi Anagnostopoulou (CERTH/HIT), Athanasios Ballis (ICCS), Xenophon Kitsios (CERTH/HIT), Anastasia Bolovinou (ICCS), Vassilis Kappatos (CERTH/HIT), and Angelos Amditis (ICCS).

The paper demonstrates how FAIR data principles and Open Science practices can be implemented in transport research through two pilot cases. Central to this implementation is the use of semantic Knowledge Graphs that connect disparate data sources and enable researchers to discover relationships across datasets. The Maritime pilot addresses challenges of interoperability and data access, aiming to improve operational efficiency and situational awareness through digital platforms, open standards, and graph-based data structures. The CCAM pilot focuses on secure and equitable data sharing for automated transportation systems, promoting better collaboration between academia, industry, and public stakeholders through semantic knowledge representation.

Key findings from the paper include:

  • Trust and transparency are operationalized through provenance tracking, citation context extraction, and graphical visualization of data links enabled by Knowledge Graph technologies
  • Common challenges across both sectors include fragmented data ownership, limited interoperability, and institutional reluctance to share data
  • SciLake's modular digital tools, including AvantGraph, Lake API, and SciNoBo, significantly mitigated these hurdles by offering explainable analyses, interoperable data access, and semantic Knowledge Graph capabilities that reveal hidden connections in research data
  • Legal frameworks such as GDPR and EU data law play a crucial role alongside technical infrastructure in realizing FAIR principles

The paper's recommendations emphasize:

  • Expanding standardization efforts to ensure consistency across transport sub-sectors, leveraging Knowledge Graphs to harmonize domain-specific semantics
  • Involving broader stakeholder groups in co-designing data governance models
  • Investing in infrastructure that supports transparency, interoperability, and reuse through semantic technologies
  • Policy incentives to encourage private sector participation while ensuring data privacy and ethical compliance

Rather than presenting Open Science as a one-size-fits-all solution, the paper highlights the importance of domain-specific configurations of FAIR principles. Knowledge Graphs emerge as a critical enabler for capturing domain semantics while maintaining interoperability. The authors conclude that the shift towards more open, inclusive, and data-driven transport research is not only possible, but already underway when supported by aligned strategies, adaptive technologies, and semantic knowledge representation.

About ICTR 2025

ICTR 2025, co-organized by the Hellenic Institute of Transport (CERTH-HIT) and the Hellenic Institute of Transportation Engineers (HITE), focused on "Transportation in the Era of Artificial Intelligence." The conference has been a cornerstone of transportation research for 23 years, bringing together local and international communities to bridge research findings with policymaking efforts.

The event explored how AI is transforming transportation through advancements in autonomous vehicles, smart infrastructure, sustainability, and supply chain optimization: themes that align closely with SciLake's mission to make research data more accessible and actionable through Knowledge Graph technologies.

Impact and Next Steps

Feedback from participants emphasized the need for improved data accessibility and transparency in an increasingly digitalized transport sector. The results will be documented in SciLake deliverables, with follow-up activities including working with stakeholders to promote implementation of FAIR and Open Science principles through semantic Knowledge Graph infrastructure.

All materials (agenda and presentations) from the workshop are available at https://doi.org/10.5281/zenodo.17481960.

The conference paper is also available at https://doi.org/10.5281/zenodo.17481746

For more information about SciLake Pilots visit https://scilake.eu/case-studies

Read more …Open Science in Transport Research: SciLake Pilots at ICTR 2025