FRELS: Fast and Reliable Estimated Linguistic Summaries

Grégory Smits; Pierre Nerzic; Marie-Jeanne Lesot; Olivier Pivert

Communication Dans Un Congrès Année : 2019

FRELS: Fast and Reliable Estimated Linguistic Summaries

(1) , (1) , (2) , (1)

1
2

Grégory Smits

Fonction : Auteur
PersonId : 967948

Symbolic and Human-centric view of dAta MANagement

Pierre Nerzic

Fonction : Auteur
PersonId : 959719

Symbolic and Human-centric view of dAta MANagement

Marie-Jeanne Lesot

Fonction : Auteur
PersonId : 14208
IdHAL : marie-jeanne-lesot
ORCID : 0000-0002-3604-6647
IdRef : 085526282

Learning, Fuzzy and Intelligent systems

Olivier Pivert

Fonction : Auteur
PersonId : 881826

Symbolic and Human-centric view of dAta MANagement

Résumé

The linguistic summarization of a dataset is a process whose complexity depends linearly on the size of the dataset and exponentially on the size of the fuzzy vocabulary. To efficiently summarize large datasets stored in Relational DataBases, reliable estimated cardinalities can be derived from statistics about the data distribution maintained by the RDB Management System, with no expensive data scans. This paper proposes to improve the precision of such estimated summaries while preserving their efficiency, by enriching the statistics-based approach with local scan-based corrections when needed: the proposed FRELS method provides efficient strategies both for identifying the needs and performing the corrections. Experiments conducted on real data show that FRELS remains incomparably more efficient than data-scan-based approaches to data summarization and offers a better precision than purely statistics-based approaches. The generation of estimated linguistic summaries takes a couple of seconds, even for datasets containing millions of tuples, with a reliability of more than 95%.

Domaines

Recherche d'information [cs.IR] Base de données [cs.DB]

Fichier principal

versionSoumise.pdf (376.42 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Grégory SMITS : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02116137

Soumis le : mardi 30 avril 2019-17:10:26

Dernière modification le : samedi 7 octobre 2023-21:36:22

Dates et versions

hal-02116137 , version 1 (30-04-2019)

Identifiants

HAL Id : hal-02116137 , version 1

Citer

Grégory Smits, Pierre Nerzic, Marie-Jeanne Lesot, Olivier Pivert. FRELS: Fast and Reliable Estimated Linguistic Summaries. IEEE International Conference on Fuzzy Systems, Jun 2019, New-Orleans, United States. ⟨hal-02116137⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM UNIV-RENNES1 CNRS INRIA INSA-RENNES ENSSAT IRISA LIP6 CENTRALESUPELEC IRISA-D7 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES SORBONNE-UNIVERSITE SU-SCIENCES UR1-MATH-NUM

96 Consultations

286 Téléchargements

FRELS: Fast and Reliable Estimated Linguistic Summaries

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager