Daniel Fernández-Álvarez

PhD Student

Biography

[BUILDING SITE!] Daniel Fernández-Álvarez is a PhD student of Computer Science at the University of Oviedo. His research interests include everything related to Semantic Web technologies and Linked Open Data, as well as record linkage, data analysis and data normalization. He is linked to WESO Reseacrh Group since 2014.

Daniel has participated in several insternational and national projects related to Semantic Web as a technician or Researcher. He is also working as Associate teacher at the University Of Oviedo.

Interests

Semantic Web
Linked Open Data
Record lnkage

Education

PhD in Computer Science, Studying

University of Oviedo
Master in Web Engineering, 2015

University of Oviedo
Bachelor of Computer Science, 2012

University of Oviedo

Selected Publications

Challenges in RDF validation

The RDF data model forms a cornerstone of the Semantic Web technology stack. Although there have been different proposals for RDF serialization syntaxes, the underlying simple data model enables great flexibility which allows it to be successfully employed in many different scenarios and to form the basis on which other technologies are developed. In order to apply an RDF-based approach in practice it is necessary to communicate the structure of the data that is being stored or represented. Data quality is of paramount importance for the acceptance of RDF as a data representation language and it must be enabled by the use of tools that can check if some data conforms to some specific structure. There have been several recent proposals for RDF validation languages like ShEx and SHACL. In this chapter, we describe both proposals and enumerate some challenges and trends that we foresee with regards to RDF validation. We devote more space to what we consider one of the main challenges, which is to compare ShEx and SHACL and to understand their underlying foundations. To that end, we propose an intermediate language and show how ShEx and SHACL can be converted to it.

Jose Emilio Labra Gayo, Herminio García-González, Daniel Fernández-Álvarez, Eric Prud’hommeaux

In book: Current Trends in Semantic Web Technologies: Theory and Practice. doi: 10.¹⁰⁰⁷⁄₉₇₈-3-030-06149-4_6, 0001

PDF

Inference and Serialization of Latent Graph Schemata Using ShEx

Shape Expressions have recently been proposed as a high-level language to intuitively describe and validate the topology of RDF graphs. Current implementations of Shape Expressions are focused on checking which nodes of certain graph ﬁt in which deﬁned schemata, in order to get automatic typings or to improve RDF data quality in terms of completion and consistency. We intend to reverse this process, i.e., we propose to study the neighborhood of graph nodes that have already been typed in order to induce templates in which most of the individuals ﬁt. This will allow to discover latent schemata of existing graphs, which can be used as a guideline for introducing coherent information in existing structures or for quality veriﬁcation purposes. We consider that collaborative or general-purpose graphs are specially interesting domains to apply this idea.

Daniel Fernández-Álvarez, Jose Emilio Labra Gayo

SEMAPRO 2016, 0001

PDF

Inference of Latent Shape Expressions Associated to DBpedia Ontology

In order to perform any operation in an RDF graph, it is recommendable to know the expected topology of the targeted information. Some technologies and syntaxes have been developed in the last years to describe the expected shapes in an RDF graph, such as ShEx and SHACL. In general, a domain expert can use these syntaxes to define shapes in a graph, with two main purposes: data validation and documentation. However, there are some scenarios in which the schema cannot be predicted a priori, but it emerges at the same time that the graph is filled with new information. In those cases, the shapes are latent in the current content. We have developed a prototype which is able to infer shapes of classes in a knowledge graph and used it with classes of DBpedia ontology. We serialize our results using ShEx.

Daniel Fernández-Álvarez, Herminio García-González, Johannes Frey, Sebastian Hellmann, Jose Emilio Labra Gayo

International Semantic Web Conference 2018 (ISWC2018), 0001

PDF

MERA: A Musical Entities Reconciliation Architecture Based on Semantic Technologies

In this paper, the authors describe Musical Entities Reconciliation Architecture (MERA), an architecture designed to link music-related databases adapting the reconciliation techniques to each particular case. MERA includes mechanisms to manage third party sources to improve the results and it makes use of semantic technologies, storing and organizing the information in RDF graphs. They have implemented a prototype of their approach and have used it to link sources with different levels of data quality. The prototype has been effective in more than 94% of the cases under the conditions of their experiments. The authors have also compared their prototype with a well-known music-specialized search engine, outperforming the search results in the two experiments that they performed.

Daniel Fernández-Álvarez, Jose Emilio Labra Gayo, Daniel Gayo-Avello, Patricia Ordoñez de Pablos

International Journal on Semantic Web and Information Systems (IJSWIS), doi: 10.4018/IJSWIS.2017100103, 0001

journal

ShExML: An heterogeneous data mapping language based on ShEx

Data interoperability is currently a problem that we are facing more intensely due to the appearance of fields like Big Data or IoT. Many data is persisted in information silos with neither interconnection nor format homogenisation. Our proposal to alleviate this problem is ShExML, a language based on ShEx that can map and merge heterogeneous data formats into a single RDF representation. We advocate the creation of this type of tools that can facilitate the migration of nonsemantic data to the Semantic Web.

Herminio García-González, Daniel Fernández-Álvarez, Jose Emilio Labra Gayo

CEUR Workshop Proceedings, 0001

PDF

Selected projects

sheXer

Inferring latent schemata in RDF graphs with ShEx

Wesnomed

Tele-medicine app based in SNOMED vocabulary

Urban Red

Urban-RED is funded by Call of Energizing Actions ‘Networks of Excellence’ of the Spanish Ministry of Economy and Competitiveness. Urban-RED aims to promote reserch activity about sustainable urban development

ClassRank

Algorithm to measure the class importance in RDF graphs

MERA

Architecture highly configurable designed to link music-related data-suources using string comparison techniques and semantic technologies.

Landportal

The Portal allows for the collection, sourcing, and searching of otherwise fragmented and inaccessible data and information on land governance and land use from diverse sources, produced by governments, academia, international organizations, indigenous people and NGOs.

WebIndex

Designed and produced by the World Wide Web Foundation, the Web Index is the world’s first measure of the World Wide Web’s contribution to social, economic and political progress in countries across the world. WESO has contributed to the project developing the portal and managing linked data for 2014 and older editions.

Teaching

Daniel has taught or he’s currently theaching in the following subjects at the University of Oviedo:

Entertainment Software and Computer Games. Degree in Computer Science.
Information and Communication Technologies for Art Historians. Degree in Art History.
Programming Methodology. Degree in Computer Science.
Basic Programming. Degree in Geomatics.

Contact

fernandezalvdaniel@uniovi.es
dani.miw
Science Faculty, St. Federico García Lorca, Oviedo, Spain
Thrusday 10:00 to 13:00 or email for appointment