Ruben Verborgh, a renowned professor of Semantic Web technology at Ghent University and a technology advocate for Inrupt, is at the forefront of reshaping how we understand data ownership and the semantic web. In part one of our conversation, Ruben discussed his journey to reclaiming his data from Facebook. In this installment, we dive deeper into the concept of the semantic web and its potential impact on privacy, innovation, and the future of data control.
Andrew Baker: Tell us a bit about your professional journey so far.
Ruben Verborgh: I’m a professor of computer science at Ghent University in Belgium, with a focus on researching how data is managed on the web. Unlike many who concentrate on big data—centralizing massive amounts of information—I’ve been more interested in distributed data. My work explores how we can make sense of data that isn’t stored in one central place but is scattered across the web. Initially, my research was rooted in public and open data, but over time, my focus has shifted to broader questions of control and interoperability.
AB: Much of your work revolves around the semantic web. Can you explain what that term means?
RV: Forget the jargon—at its core, the semantic web is about structuring data in a way that machines can understand and interpret. This is crucial because, without semantics, we’re reliant on a handful of organizations dictating how data should be stored and accessed. The semantic web gives individuals and companies the power to control their own data while ensuring it remains interoperable across systems. It’s about creating a world where data can flow seamlessly while remaining under the user’s control.
AB: What technologies are you working on to help make this vision of the semantic web a reality?
RV: One of the most exciting projects I’m involved in is Solid, which focuses on personal data stores—or “Data Pods.” A Data Pod is like a private data vault where all the information you generate or that others produce about you is securely stored. You control what happens to that data, whether to share it, use it, or keep it private.
Solid isn’t the only approach to personal data stores, but it stands out because of its commitment to interoperability. We don’t dictate how people or companies should structure their data. Instead, the system allows users to organize their data however they choose while maintaining compatibility with other platforms. This combination of flexibility and interoperability is a major step forward.
AB: How does the semantic web intersect with data ownership and innovation?
RV: The semantic web has two main pillars. The first is privacy—it’s about empowering people to take control of their data. The second is innovation. For app developers, one of the biggest hurdles today is obtaining user data, which typically involves harvesting it from users or third-party platforms. This process pits them against tech giants like Facebook in a competition they can’t win.
But with personal data vaults, users already have control of their data. New apps can simply connect to these vaults, use the data with the user’s permission, and focus on delivering value. There’s no need for invasive data collection. This model benefits not only individuals but also businesses, as it allows them to sidestep the costly and ethically fraught data race.
AB: Can the semantic web address challenges in data alignment and ownership?
RV: Absolutely. One of the key challenges today is the lack of standardization. For example, tools like Tapmydata spend a significant amount of effort aligning data from different companies, each of which represents people and their information in unique ways. This alignment is currently a manual and repetitive process.
Semantic technologies can automate much of this work. By creating a common framework for representing and exchanging data, the semantic web reduces redundancy and streamlines operations. This isn’t just about making life easier for companies—it’s about ensuring that data ownership remains firmly in the hands of users while fostering a more interconnected and innovative digital ecosystem.
The Takeaway
The semantic web represents a paradigm shift in how we think about data ownership and interoperability. With projects like Solid paving the way, the future holds the promise of a web where individuals truly control their information, and innovation thrives without the need for intrusive data harvesting. As Ruben Verborgh emphasizes, the semantic web isn’t just a technological development—it’s a vision for a more equitable and privacy-respecting digital future.