Pierre-Henri Paris

Pierre-Henri Paris pjɛʁ‿ɑ̃ʁi paʁis

Associate Professor in Artificial Intelligence

Paris-Saclay University

Biography

I am a Maître de Conférences (Associate Professor) at Paris-Saclay University since September 2024. Previously, I worked at Télécom Paris, Institut Polytechnique de Paris. During my PhD at Conservatoire National des Arts et Métiers and Sorbonne University, I focused on entity linking and data quality of knowledge graphs.

My research interests include artificial intelligence, knowledge graphs, and natural language processing.

Interests
  • Artificial Intelligence
  • Knowlegde Graphs
  • Information Extraction
  • Knowledge Representation
  • NLP
Education
  • PhD in Artificial Intelligence, 2020

    Sorbonne University and CNAM

  • M.Sc. in Artificial Intelligence, 2016

    CNAM

  • M.Sc. in Mathematics, 2008

    CY Cergy Paris University (incomplete)

Research Interests

Experience

 
 
 
 
 
Associate Professor (Maître de Conférences)
September 2024 – Present Orsay, France
 
 
 
 
 
Postdoctoral Researcher
Telecom Paris
September 2020 – August 2024 Palaiseau, France

Works on LLM, logic, and knowledge graphs.

Responsibilities include:

  • Collaborations with industrial and academic partners
  • Managing meetings with partners
  • PhD and student supervision
 
 
 
 
 
Researcher
Sorbonne University and CNAM
October 2016 – August 2020 Paris, France
Entity linking and data quality of knowledge graphs.
 
 
 
 
 
Researcher
IGN
March 2016 – September 2016 Saint-Mandé, France
Approach aimed at exploring historical texts describing a diachronic phenomenon based on named-entity linking.
 
 
 
 
 
Software Engineer
LGC
September 2007 – February 2016 Cergy-Pontoise, France

Responsibilities include:

  • Requirements analysis, solution design, development, testing and validation, maintenance, documentation to meet the company’s needs for various projects: e-commerce sites, inventory management, embedded systems, etc.
  • Student supervision
  • Team management

Supervision involvement

Projects

Yago 4.5
The last version of the Yago knowledge base is out!
This enhanced version of the YAGO knowledge base integrates extensive elements of the Wikidata taxonomy, providing a cleaner, logically consistent structure for improved information retrieval and automated reasoning.
MAFALDA
MAFALDA is a new benchmark for fallacy classification that consolidates previous datasets into a unified taxonomy. The benchmark includes manual annotations with explanations, a specialized annotation scheme for subjective NLP tasks, and a novel evaluation method to handle subjectivity. We assessed both language models and human performance on MAFALDA to evaluate their fallacy detection and classification capabilities.
NoRDF
The NoRDF Project is a scientific project at Telecom Paris that aims to model and extract complex information from natural language text. More precisely, we want to enrich knowledge bases with events, causation, conditions, precedence, stories, negation, and beliefs.

Posts

📚 The Vagueness of Vagueness in Noun Phrases

This vision and survey paper is part of the NoRDF project and is a joint work between Pierre-Henri Paris, Syrine El Aoud and Fabian Suchanek (Télécom Paris, DIG team).

For more details, you can check the full AKBC paper here.

📚 Non-Named Entities – The Silent Majority

This is a joint work between Pierre-Henri Paris and Fabian Suchanek (both at Télécom Paris, DIG team).

Non-Named Entities?

Human texts usually contain a lot of noun phrases that often function as verb subjects and objects, as predicative expressions and as the complements of prepositions. When building knowledge bases, proper nouns in noun phrases are extracted and used as entities equipped with facts to populate knowledge bases. During this process, noun phrases that are not proper nouns are left unmapped, thus ignoring a vast amount of information. Consider the following example which contains a named entity and two non-named entities:

Recent Publications

Quickly discover relevant content by filtering publications.
(2024). MAFALDA: A Benchmark and Comprehensive Study of Fallacy Detection and Classification. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), NAACL 2024, Mexico City, Mexico, June 16-21, 2024.

Cite DOI URL

(2024). Neurosymbolic Methods for Dynamic Knowledge Graphs. CoRR.

Cite DOI URL

(2024). YAGO 4.5: A Large and Clean Knowledge Base with a Rich Taxonomy. Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2024, Washington DC, USA, July 14-18, 2024.

Cite DOI URL

(2023). Integrating the Wikidata Taxonomy into YAGO. CoRR.

Cite DOI URL

(2023). MAFALDA: A Benchmark and Comprehensive Study of Fallacy Detection and Classification. CoRR.

Cite DOI URL