KIT-IA (Knowledge-drIven Techniques for Intelligent Applications in heterogeneous contexts)

Introduction

Knowledge management plays a key role in Artificial Intelligence (AI). Since half a century ago, the use of knowledge in expert systems, semantic networks, or frames made it possible to solve complex real-world problems in many domains with the most popular examples being in medicine diagnostics. Since then, Knowledge Representation and Reasoning (KRR) is still one of the key subfields of AI, and numerous different strategies to manage knowledge have been proposed and successfully evaluated in practical applications. More modern approaches to represent knowledge include ontologies, Open Linked Data, or, more recently, knowledge graphs (KG). Those techniques were mainly developed by the Semantic Web (SW) community (hence they are usually called semantic technologies), but can also be applied to any non-Web-based application. Complex AI systems usually need to incorporate a KRR module in conjunction with other AI techniques such as Machine Learning (ML), Natural Language Processing (NLP), etc. to actually achieve their goals.

The main objective of this project is to propose novel knowledge-driven and multilingual-aware techniques improving the services offered by intelligent systems, regardless of the application domain. As examples of these techniques, we want to link the connected data to their actual meanings, being able to access them following the users intended meanings regardless their language, supporting flexible queries or queries expressed in natural language, extracting information in an intelligent way from unstructured sources, just to name few. Our techniques will address heterogeneous contexts and different knowledge domains, broadening thus the applicability of our proposals. To show the usefulness of our developments, we will ground them to some specific real-world scenarios as well.

In particular, we plan to address different research problems:

Improvement of knowledge-based querying and information access. This includes extracting the actual structure of a KG, using graph embeddings to build ontology axioms, computing the semantic difference between ontologies, extending existing query answering systems to accept formal queries or natural language and to exploit KGs, developing novel methods for flexible query answering, etc.
Building knowledge-driven Natural Language Processing (NLP). This includes combining semantic technologies and different language models (both non-contextualized, e.g., word2vec, and contextualized, e.g., transformers-based, ones), using KGs to model multilingual data and to improve translations or cross-lingual access to information, etc.
Development of intelligent applications for mobile users. This includes adapting all the previous techniques to work on mobile devices, providing support for KGs on mobile devices, developing novel techniques for adaptive semantic reasoning on mobile devices, or applying the paradigm of Personal Knowledge Graphs to mobile computing.

Team members


Eduardo Mena (Main researcher 1)	Fernando Bobillo (Main researcher 2)	Carlos Bobed	Ignacio Huitzil	Jorge Bernad	Jorge Gracia	Lacramioara Dranca

Collaborators

José Hilario Canós
Sébastien Ferré
Ángel Luis Garrido
Alfredo Goñi
María Granados Buey
Miguel López-Otal
Álvaro Peiró
Lucía Pitarch
María del Mar Roldán
Umberto Straccia
Jeff Z. Pan

Related publications: journals

Julián Moreno Schneider, Georg Rehm, Elena Montiel-Ponsoda, Víctor Rodríguez-Doncel, Patricia Martín-Chozas, María Navas-Loro, Martin Kaltenböck, Artem Revenko, Sotirios Karampatakis, Christian Sageder, Jorge Gracia, Filippo Maganza, Ilan Kernerman, Dorielle Lonke, Andis Lagzdins, Julia Bosque-Gil, Pieter Verhoeven, Elsa Gomez Diaz, Pascual Boil Ballesteros. Lynx: A knowledge-based AI service platform for content processing, enrichment and analysis for the legal domain. Information Systems 106:101966, May 2022.
Shashwat Goel, Jorge Gracia, Mikel Lorenzo Forcada. Bilingual dictionary generation and enrichment via graph exploration. Semantic Web Journal 13(6): 1103-113. September 2022.
Anas Fahad Khan, Christian Chiarcos, Thierry Declerck, Daniela Gifud, Elena González-Blanco García, Jorge Gracia, Maxim Ionov, Penny Labropoulou, Francesco Mambrini, John P. McCrae, Émilie Pagé-Perron, Marco Passarotti, Salvador Ros Muñoz, Ciprian-Octavian Truică. When linguistics meets web technologies. Recent advances in modelling linguistic linked data. Semantic Web Journal 13(6): 987–1050, September 2022.
Ignacio Huitzil and Fernando Bobillo. Fuzzy Ontology Datatype Learning using Datil. Expert Systems with Applications 228:120299, pp. 1-16, October 2023.
Ángel Luis Garrido, María Soledad Pera and Carlos Bobed. SJORS - A Semantic Recommender System for Journalists. Business & Information Systems Engineering 65:6, pp. 1-18, December 2023.
Carlos Bobed, Fernando Bobillo, Ernesto Jiménez-Ruiz, Eduardo Mena and Jeff Z. Pan. Praedixi, Redegi, Cogitavi: Adaptive Knowledge for Resource-aware Semantic Reasoning. Expert Systems with Applications 250:123838, pp. 1-14, March 2024.
José Félix Yagüe, Ignacio Huitzil, Carlos Bobed and Fernando Bobillo. FUKG: Answering Flexible Queries over Knowledge Graphs. The Electronic Library 42(3): 368-392, June 2024.
Carlos Bobed, Jorge Bernad and Pierre Maillot. Language-Model Based Informed Partition of Databases to Speed Up Pattern Mining. Proceedings of the ACM on Management of Data 2(3):184, pp. 1-24, June 2024.
Francisco Navarrete, Ángel Luis Garrido, Carlos Bobed and Antonio Vallecillo. Ontology-driven automated reasoning about property crimes. Business & Information Systems Engineering, August 2024.
Dagmar Gromann, Elena-Simona Apostol, Christian Chiarcos, Marco Cremaschi, Jorge Gracia, Katerina Gkirtzou, Chaya Liebeskind, Liudmila Mockiene, Michael Rosner, Ineke Schuurman, Gilles Sérasset, Purificação Silvano, Blerina Spahiu, Andrius Utka, Ciprian-Octavian Truica, Giedrė Valūnaitė Oleškevičienė. Multilinguality and LLOD: A Survey Across Linguistic Description Levels. Semantic Web Journal 15(5):1915-1958, October 2024.
Ishak Riali, Fareh Messaouda and Fernando Bobillo. ProbFuzOnto: A fuzzy ontology-Driven uncertainty approach using Fuzzy Bayesian Networks. International Journal of Fuzzy Systems. In press.

Related publications: conferences

Ignacio Huitzil, Fernando Alegre, Fernando Bobillo. CAEPIA-APP Competition: GimmeHop. Actas de la XIX Conferencia de la Asociación Española para la Inteligencia Artificial (CAEPIA 20-21), pp. 989-992. Málaga (Spain), September 2021.
Álvaro Cristóbal, Ignacio Huitzil, Fernando Bobillo. Learning OWA Weights by Combining Fuzzy quantifiers with Empirical Data. Actas del XX Congreso Español sobre Tecnologías y Lógica Fuzzy (ESTYLF 20-21), pp. 351-356. Málaga (Spain), September 2021.
Angel Luis Garrido, Carlos Bobed. DRESS: Data-Repository Enhancer through Semantic Sources. Proceedings of the ACM/IEEE Joint Conference on Digital Libraries (JCDL 2021), ACM, pp. 242-245. Champaign, IL (USA), September 2021.
Fernando Bobillo, Julia Bosque-Gil, Jorge Gracia, Marta Lanau-Coronas. Fuzzy Lemon: Making lexical semantic relations more juicy. Proceedings of the 8th Workshop on Linked Data in Linguistics (LDL 2022) @ LREC2022, pp. 45-51. Marseille (France), June 2022.
Fahad Khan, Christian Chiarcos, Thierry Declerck, Maria Pia Di Buono, Milan Dojchinovski, Jorge Gracia, Giedrė Valūnaitė Oleškevičienė, Daniela Gifu. A Survey of Guidelines and Best Practices for the Generation, Interlinking, Publication, and Validation of Linguistic Linked Data. Proceedings of the 8th Workshop on Linked Data in Linguistics (LDL 2022) @ LREC2022. Marseille (France), June 2022.
Jorge Gracia, Besim Kabashi, Ilan Kernerman. TIAD 2022 The Fifth Translation Inference Across Dictionaries Shared Task. Proceedings of the Globalex Workshop on Linked Lexicography @LREC2022, 19–25. Marseille (France), June 2022.
Michael Rosner, Sina Ahmadi, Elena Simona Apostol, Julia Bosque-Gil, Christian Chiarcos, Milan Dojchinovski, Katerina Gkirtzou, Jorge Gracia, Dagmar Gromann, Chaya Liebeskind, Giedre Valunaite Oleskeviciene, Gilles Sérasset, Ciprian-Octavian Truica. Cross-Lingual Link Discovery for Under-Resourced Languages. Proceedings of the 13th Language Resources and Evaluation Conference (LREC 2022), pp. 181-192, Marseille (France), June 2022.
Lucía Pitarch, Lacramiora Dranca, Jorge Bernad, Jorge Gracia. Lexico-Semantic Relation Classification With Multilingual Finetuning. Proceedings of LLOD Approaches For Language Data Research And Management (LLODREAM 2022), pp. 86-89, Vilnius (Ukraine), September 2022.
Javier Vela, Jorge Gracia. Cross-lingual ontology matching with CIDER-LM: results for OAEI 2022. Proceedings of the 17th International Workshop on Ontology Matching (OM 2022), CEUR Workshop Proceedings 3324, pp. 158-165, Hangzhou (China), October 2022.
José Félix Yagüe, Ignacio Huitzil, Carlos Bobed, Fernando Bobillo. Flexible queries over knowledge graphs. Proceedings of the 4th Iberoamerican and 3rd Indo-American Knowledge Graphs and Semantic Web Conference (KGSWC 2022), Communications in Computer and Information Science 1686, pp. 192-200, Springer. Madrid (Spain), November 2022.
Lucía Pitarch. Metaphor Processing in the Medical Domain via Linked Data and Language Models. Proceedings of ESWC 2023 Satellite Events, Lecture Notes in Computer Science 13998, pp. 213-223, Hersonissos (Greece), May 2023.
Lucía Pitarch, Jorge Bernad, Lacramioara Dranca, Carlos Bobed and Jorge Gracia. No clues, good clues: Out of context lexical relation classification. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), pp. 5607-5625, ACL, Toronto (Canada), July 2023.
Ignacio Huitzil and Fernando Bobillo. Some Properties of the Left Recursive Form of the Convex Combination Linguistic Aggregator. Proceedings of the 15th International Conference on Flexible Query Answering Systems (FQAS 2023), Lecture Notes in Artificial Intelligence 14113, pp. 50-62, Springer, Palma de Mallorca (Spain), September 2023.
Ignacio Huitzil, Giuseppe Mazzotta, Rafael Peñaloza and Francesco Ricca. ASP-based Axiom Pinpointing for Description Logics. Proceedings of the 36th International Workshop on Description Logics (DL 2023), CEUR Workshop Proceedings 3515, CEUR-WS.org, September 2023.
Lucía Pitarch, Jordi Bernad and Jorge Gracia. MEAN: Metaphoric Erroneous ANalogies dataset for PTLMs metaphor knowledge probing. Proceedings of the 4th Conference on Language, Data and Knowledge (LDK 2023), pp. 147-152, September 2023.
Ignacio Huitzil, Giuseppe Mazzotta, Rafael Peñaloza and Francesco Ricca. Axiom Pinpointing in DLs via ASP. Proceedings of the 4th Workshop on Explainable Logic-Based Knowledge Representation (XLoKR 2023), September 2023.
Ángel Luis Garrido, Norman U. Bellorín, Álvaro Peiró, and Eduardo Mena. Verification Tasks through Deep Learning in a Semantic Information Extraction System. Proceedings of the 2023 IARIA Annual Congress on Frontiers in Science, Technology, Services, and Applications (IARIA Congress 2023), pp. 111-112, IARIA Press, Valencia (Spain), November 2023.
Ángel Luis Garrido, Jonathan Rodríguez, Mariano Sánchez, José Manuel Antón, Roberto Castán, Susana Sangiao, Carlos Bobed and Eduardo Mena. Sensorization and Optimization of Industrial Graphic Arts Machinery using Artificial Intelligence Techniques. Proceedings of the 2023 IARIA Annual Congress on Frontiers in Science, Technology, Services, and Applications (IARIA Congress 2023), pp. 121-122, IARIA Press, Valencia (Spain), November 2023.
Daniel Huici, Ricardo J. Rodríguez and Eduardo Mena. Apotheosis: Bringing Approximate K-NN and Similarity Digest Algorithms to Digital Forensics (Póster). Digital Forensics Research Conference Europe (DFRWS EU 2024), Zaragoza (Spain), March 2024.
Lucía Pitarch, Carlos Bobed, David Abián, Jorge Gracia and Jorge Bernad. Building MUSCLE, a Dataset for MUltilingual Semantic Classification of Links between Entities. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 2580-2594, ELRA and ICCL, Torino (Italy), May 2024.
Dagmar Gromann, Hugo Gonçalo Oliveira, Lucia Pitarch, Elena Simona Apostol, Jordi Bernad, Eliot Bytyçi, Chiara Cantone, Sara Carvalho, Francesca Frontini, Radovan Garabík, Jorge Gracia, Letizia Granata, Anas Fahad Khan, Timotej Knez, Penny Labropoulou, Chaya Liebeskind, Maria Pia di Buono, Ana Ostroski Anic, Sigita Rackeviciene, Ricardo Rodrigues, Gilles Sérasset, Linas Selmistraitis, Mahammadou Sidibé, Purificação Silvano, Blerina Spahiu, Enriketa Sogutlu, Ranka Stankovic, Ciprian-Octavian Truica, Giedre Valunaite Oleskeviciene, Slavko Zitnik, Katerina Zdravkova. MultiLexBATS: Multilingual Dataset of Lexical Semantic Relations. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 11783-11793, ELRA and ICCL, Torino (Italy), May 2024.
Ivana Filipović Petrović, Miguel López Otal, and Slobodan Beliga. Croatian Idioms Integration: Enhancing the LIdioms Multilingual Linked Idioms Dataset. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 4106–4112, Torino (Italy), ELRA and ICCL, May 2024.
Ignacio Huitzil and Fernando Bobillo. Datil: Herramienta para el aprendizaje de tipos datos difusos en ontologías difusas. Actas del XXII Congreso Español sobre Tecnologías y Lógica Fuzzy (ESTYLF 2024), pp. 355-356, A Coruña (Spain), June 2024
Carlos Bobed, Fernando Bobillo and Eduardo Mena. Razonamiento semántico adaptado a los recursos. Actas de las XXVIII Jornadas de Ingeniería del Software y Bases de Datos (JISBD 2024), pp. 1-4, SISTEDES, A Coruña (Spain), June 2024.
Ángel Luis Garrido, María Soledad Pera and Carlos Bobed. SJORS - A Semantic Recommender System for Journalists. Actas de las XXVIII Jornadas de Ingenieria del Software y Bases de Datos (JISBD 2024), SISTEDES, A Coruña (Spain), June 2024.
Fernando Bobillo, Carlos Bobed, Eduardo Mena and Umberto Straccia. A Fuzzy Logic-based Approach to Semantic Query Answering with Missing Values. Proceedings of the 33rd IEEE International Conference on Fuzzy Systems (FUZZ-IEEE 2024), pp. 1-8, Yokohama (Japan), IEEE, July 2024.
Ángel Luis Garrido, Jonathan Rodríguez, Mariano Sánchez, José Manuel Antón, Carlos Bobed and Eduardo Mena. HEAT-SEER - A Real-Time Data Integration Experience through Semantic Technologies in a Graphic Arts Company. Proceedings of the 2024 IEEE 22nd International Symposium on Intelligent Systems and Informatics (SISY 2024), Pula (Croacia), IEEE, pp. 121-122, September 2024