News
- Delighted to be joining MIT LIDS as a visiting scientist in the DataToAI group leaded by K. Veeramachaneni
- Paper accepted at KDD Explorations (To appear in June 2022) on "The Need for Interpretable Features: Motivation and Taxonomy" with A. Zytek, I. Arnaldo, D. Liu, and K. Veeramachaneni.
- Paper accepted at ICPRAI 2022 on "Stochastic pairing for contrastive anomaly detection on time series" with G. Chambaret, F. Bouchara, E. Bruno, V. Martin and F. Chaillan.
- Paper accepted at SIGMOD 2022 (ACM International Conference on Management of Data) on "Sintel: An Overarching Ecosystem for End-to-End Time Series Anomaly Detection" with S. Alnegheimish, D. Liu, C. Sala, and K. Veeramachaneni.
- Paper accepted at ICDE 2022 (International Conference on Data Engineering) on "Provenance-aware Discovery of Functional Dependencies on Integrated Views" with U. Comignani, N. Novelli, and A. Bonifati.
- Tutorial presented at KDD 2021 on August 15, 2021 on Challenges in KDD and ML for Sustainable Development with D. Dao, S. Ermon, and B. Goswami. [website] [abstract]
- Best Demo paper at CIKM 2021 on "DORA The Explorer: Exploring Data with Interactive Deep Reinforcement Learning" with A. Personnaz, S. Amer-Yahia, M. Fabricius, S. Subramanian. [pdf]
Research Focus
I am a Research Director in Data Analytics at IRD, the French research institute on Sustainable Development and my research is focused on designing methods, algorithms, and systems that assist the users in complex and necessary tasks for data intelligence and critical decision making. These tasks combine core data management techniques (including data integration, fusion, cleaning, and preparation) with statistical machine learning methods. The final goal is to let the users focus exclusively on the logic of their application, without being concerned by the underlying models or the execution details of data preprocessing, feature and ML model engineering. I design, code techniques and end-to-end analytical pipelines (in Python and R) on the following key aspects of Data Science and AI:
- Detection and automatic correction of anomalies and data quality problems;
- Data cleaning, integration, fusion, and preparation for data analytics and ML;
- Detection of falsified information (fake news), fact-checking, and truth discovery;
- Applied machine learning with use cases in sustainability science, healthcare and biomedical domains, environmental and Earth Observation sciences.
Publications
Partially on:
DBLP
ResearchGate
GoogleScholar
- A. Zytek, I. Arnaldo, D. Liu, L. Berti-Équille, and K. Veeramachaneni. The Need for Interpretable Features: Motivation and Taxonomy. KDD Explorations, June 2022.
Download: [pdf]
- S. Alnegheimish, D. Liu, C. Sala, L. Berti-Équille, and K. Veeramachaneni. Sintel: An Overarching Ecosystem for End-to-End Time Series Anomaly Detection. Proceedings of 2022 ACM SIGMOD International Conference on Management of Data (SIGMOD 2022).
Download: [pdf] [code]
- G. Chambaret, L. Berti-Équille, F. Bouchara, E. Bruno, V. Martin, and F. Chaillan. Stochastic pairing for contrastive anomaly detection on time series. Proceedings of the 3rd International Conference on Pattern Recognition and Artificial Intelligence (ICPRAI 2022), June 2022, Paris, France.
Download: [pdf]
- U. Comignani, L. Berti-Équille, N. Novelli, and A. Bonifati. Provenance-aware Discovery of Functional Dependencies on Integrated Views. Proceedings of IEEE International Conference on Data Engineering (ICDE 2022).
Download: [pdf] [code]
- A. Chibah, S. Amer-Yahia, L. Berti-Équille. A Framework for Statistically Sound Customer Segment Search. Proceedings of IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA), Porto, Portugal, 2021.
Download: [pdf] [code]
- A. Personnaz, S. Amer-Yahia, L. Berti-Équille, M. Fabricius, S. Subramanian. Balancing Familiarity and Curiosity in Data Exploration with Deep Reinforcement Learning. Proceedings of the 4th Workshop in Exploiting AI Techniques for Data Management in conjunction with SIGMOD 2021, p. 16-23, 2021. Download: [pdf] [slides] [code]
- A. Personnaz, S. Amer-Yahia, L. Berti-Équille, M. Fabricius, S. Subramanian. DORA The Explorer: Exploring Data with Interactive Deep Reinforcement Learning [demonstration paper]. Proceedings of CIKM 2021.
Best Demo Download: [pdf] [application] [code] [video ]
- A. Chibah, S. Amer-Yahia, L. Berti-Équille. QeNoBi: a system for QuErying and miNing BehavIoral patterns [demonstration paper]. Proceedings of the 2021 IEEE 37th International Conference on Data Engineering (ICDE), pp. 2673-2676, 2021.
Download: [pdf] [code] [video]
- G. Chambaret, L. Berti-Équille, F. Bouchara, E. Bruno, V. Martin, F. Chaillan. Amélioration du pronostic par apprentissage profond pour des applications de maintenance prédictive, EGC 2021. Revue des Nouvelles Technologies de l’Information, vol. RNTI-E-37 pp. 325-332, Janvier 2021.
Download: [pdf]
- L. Berti-Équille, D. Dao, S. Ermon, B. Goswami. Challenges in KDD and ML for Sustainable Development. Tutorial at the International Conference on Knowledge Discovery and Data Mining (KDD 2021), Singapore, 15 August 2021.
Download: [website] [abstract]
- Robin Jarry, Marc Chaumont, Laure Berti-Équille, Gérad Subsol. Assessment of CNN-based Methods for Poverty Estimation from Satellite Images. Proc. of the 11th IAPR International Workshop on Pattern Recognition in Remote Sensing (PRRS) in conjunction with the International Conference on Pattern Recognition (ICPR 2020). [pdf]
- Ugo Comignani, Laure Berti-Équille, Noël Novelli. Discovering Multi-Table Functional Dependencies Without Full Join Computation. CoRR abs/2012.06237, 2020.
- Laure Berti-Équille: Active Reinforcement Learning for Data Preparation: Learn2Clean with Human-In-The-Loop. CIDR 2020 [pdf]
- Ugo Comignani, Noël Novelli, Laure Berti-Équille: Data Quality Checking for Machine Learning with MeSQuaL. EDBT 2020: 591-594 [pdf]
- Laure Berti-Équille. Truth Discovery. In: Sakr S., Zomaya A. (eds) Encyclopedia of Big Data Technologies. Springer, Cham, 2019. [Author Copy]
- Laure Berti-Équille. Reinforcement Learning for Data Preparation with Active Reward Learning. Proc. of INSCI 2019, LNCS 11938, 121-132.
- Laure Berti-Équille: ML-Based Knowledge Graph Curation: Current Solutions and Challenges. WWW (Companion Volume) 2019: 938-939
- Laure Berti-Équille. Learn2Clean: Optimizing the Sequence of Tasks for Web Data Preparation. Proceedings of the Web Conference 2019, pp. 2580-2586, San Francisco, May 2019. [pdf]
- Laure Berti-Équille. Reinforcement Learning for Data Cleaning and Data Preparation, HILDA workshop in conjunction with SIGMOD 2019, Amsterdam, The Netherlands, July 2019. [pdf] [slides]
- Laure Berti-Équille, Ji Meng Loh, Saravanan Thirumuruganathan. Are Outlier Detection Methods Resilient to Sampling? (2019) CoRR abs/1907.13276
- Laure Berti-Équille, Hazar Harmouch, Felix Naumann, Noël Novelli, Saravanan Thirumuruganathan. Discovery of Genuine Functional Dependencies from Relational Data with Missing Values. INFORSID 2019, pp. 287-288
- Andrés Troya-Galvis, Pierre Gançarski, Laure Berti-Équille. Remote Sensing Image Analysis by Aggregation of Segmentation-Classification Collaborative Agents. Pattern Recognition, 73: 259-274, 2018. [Publisher Site]
- Laure Berti-Équille, Hazar Harmouch, Felix Naumann, Noël Novelli, Saravanan Thirumuruganathan. Discovery of Genuine Functional Dependencies from Relational Data with Missing Values. Proceedings of the VLDB Endowment (PVLDB), Volume 11, No. 8, April 2018. [pdf]
- Laure Berti-Équille, Angela Bonifati, Tova Milo. Machine Learning to Data Management: A Round Trip. Tutorial at the International Conference on Data Engineering (ICDE 2018), Paris, April 2018. [Publisher Site], [slides]
- Laure Berti-Équille. Qualité des données. Techniques de l'Ingénieur, Dossier H3700v2, 10 oct. 2018.[Publisher Site]
- Andrés Troya-Galvis, Pierre Gançarski, Laure Berti-Équille. Interactions segmentation-classification dans un cadre multi-paradigme pour l'analyse d'images de télédétection. Revue d'Intelligence Artificielle 31(1-2): 133-152, 2017. [Publisher Site]
- Saravanan Thirumuruganathan, Laure Berti-Équille, Mourad Ouzzani, Jorge-Arnulfo Quiane-Ruiz and Nan Tang, UGuide – User-Guided Discovery of FD-Detectable Errors. Proceedings of the 2017 ACM SIGMOD/PODS Conference, Chicago, May 2017. [pdf]
- Eva C. Serrano Balderas, Laure Berti-Équille, Ma. Aurora Armienta Hernandez, and Corinne Grac, Principled Data Preprocessing: Application to Biological Aquatic Indicators of Water Pollution. Proceedings of the DEXA-BIOKDD'17 workshop, Lyon, France, August 2017. [pdf]
- Eva C. Serrano Balderas, Ma. Aurora Armienta Hernandez, Laure Berti-Équille, Corinne Grac and Jean-Christophe Desconnets, Evaluation of Heavy Metals, Pesticides and Emergent pollutants content in the Tula river Mexico. Proceedings of the the 10th European Symposium for Freshwater Sciences (SEFS10), Olomouc, Czech Republic, July 2017. [ResearchGate]
- Tahar Zanouda, Sofiane Abbar, Laure Berti-Équille, Kushal Shah, Abdelkader Baggag, Sanjay Chawla, Jaideep Srivastava. On the Role of Political Affiliation in Human Perception: The Case of Delhi OddEven Experiment. Proceedings of Social Informatics 2017, Oxford UK, September 2017. [pdf]
- Laure Berti-Équille, Yury Zhauniarovich. Profiling DRDoS Attacks with Data Analytics Pipeline. Proceedings of the 26th ACM International on Conference on Information and Knowledge Management (CIKM 2017), Singapore, November 2017. [pdf]
- Andrés Troya-Galvis, Pierre Gançarski, Laure Berti-Équille. A collaborative framework for joint segmentation and classification of remote sensing images. Advances in Knowledge Discovery and Management, 2016. [pdf]
- Laure Berti-Équille, Mouhamadou Lamine Ba. Veracity of Big Data: Challenges of Cross-modal Truth Discovery, ACM Journal of Data and Information Quality, July 2016. [pdf]
- Divy Agrawal, Lamine Ba, Laure Berti-Équille, Sanjay Chawla, Ahmed Elmagarmid, Hossam Hammady, Yasser Idris, Zoi Kaoudi, Zuhair Khayyat, Sebastian Kruse, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Mohammed J Zaki. Rheem: Enabling Multi-Platform Task Execution (Demo). Proceedings of the ACM SIGMOD Conference, 2016. [pdf]
- Hatim Chahdi, Nistor Grozavu, Isabelle Mougenot, Laure Berti-Équille and Younès Bennani. On the Use of Ontology as a priori Knowledge into Constrained Clustering, Proceedings of the 2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA’2016), Montreal, Canada, October 2016. [HAL]
- Laure Berti-Équille, Monica Scannapieco. Quality of Web Data (Chapter). In the 2nd Edition of the book Data Quality: Concepts, Methodologies and Techniques, Springer, 2016. [Publisher Site][Author Copy]
- Laure Berti-Équille, Verikat N. Gudivada, Rihan Hai, Christoph Quix, Hongzhi Wang, (Eds) Proceedings of the International Quality in Databases workshop (QDB 2016) in conjunction with VLDB 2016, Delhi, India, September 2016. [RWTH Publication]
- Ismael Caballero, Manuel Ángel Serrano, Laure Berti-Équille (Eds). Proceedings of the 21st International Conference on Information Quality, Ciudad Real (Spain), June 2016. [pdf]
- Andrés Troya-Galvis, Pierre Gançarski, Laure Berti-Équille. Collaborative segmentation and classification for remote sensing image analysis. Proceedings of the International Conference on Pattern Recognition (ICPR 2016), Cancun, Mexico, December 2016. [Publisher Site]
- Hatim Chahdi, Nistor Grozavu, Isabelle Mougenot, Laure Berti-Équille and Younès Bennani. Towards Ontology Reasoning for Topological Cluster Labeling, Proceedings of the 23rd International Conference on Neural Information Processing (ICONIP 2016), Kyoto, Japan, October 2016. [HAL][pdf]
- Sofiane Abbar, Tahar Zanouda, Laure Berti-Équille, Javier Borge-Holthoefer. Using Twitter to Understand Public Interest in Climate Change: The case of Qatar. Proceedings of the 1st International Workshop on the Social Web for Environmental and Ecological Monitoring (SWEEM 2016) Workshop, Koln, Germany, May 2016. [arXiv]
- Raghvendra Mall, Laure Berti-Équille, Halima Bensmail. Metabolomic Data Profiling to Diabetes Research in Qatar. Proceedings of the 7th International Workshop on Biological Knowledge Discovery and Data Mining (BIOKDD'16) in conjunction with the 27th International Conference on Database and Expert Systems Applications (DEXA’16), June 2016. [Publisher Site]
- Eva Carmina Serrano Balderas, Laure Berti-Équille, Ma. Aurora Armienta Hernandez, Jean- Christophe Desconnets. Water Quality Data Analytics. Proceedings of the 8th International Congress on Environmental Modelling and Software, International Environmental Modelling and Software Society (iEMSs), Toulouse, France, 2016. [pdf]
- Hatim Chahdi, Nistor Grozavu, Isabelle Mougenot, Laure Berti-Équille, Younès Bennani. Génération de contraintes pour le clustering à partir d'une ontologie - Application à la classification d'images satellites. Proceedings of EGC 2016, Reims, France, pp. 81-92, January2016.
- Andrés Troya-Galvis, Pierre Gançarski, Laure Berti-Équille. Un cadre collaboratif pour la segmentation et la classification d'images de télédétection. Proceedings of EGC 2016, Reims, France, pp. 297-308, January 2016.
- Mouhamadou Lamine Ba, Laure Berti-Équille, Kushal Shah, Hossam M. Hammady. VERA: A Platform for Veracity Estimation over Web Data (demo). Proceedings of the 25th World Wide Web Conference (WWW 2016), Montréal, Canada, 2016. [pdf]
- Laure Berti-Équille, Javier Borge-Holthoefer, Veracity of Data: From Truth Discovery Computation Algorithms to Models of Misinformation Dynamics. Synthesis Lectures on Data Management, Morgan & Claypool Publishers, December 2015. [Publisher Site]
- Khalid Belhajjame, Domenico Beneventano, Laure Berti-Équille, James Cheney, Víctor Cuevas-Vicenttín, Tom De Nies, Helena Galhardas, Ashish Gehani, Boris Glavic, Paul T. Groth, Olaf Hartig, Scott Jensen, Andrea Maurino, Gianni Mecca, Renée J. Miller, Luc Moreau, Mourad Ouzzani, Jaehong Park (2015). Editorial. J. Data and Information Quality, 5(3): 8.
- Andres Troya-Galvis, Pierre Gançarski, Nicolas Passat, Laure Berti-Équille, Unsupervised Quantification of Under and Over Segmentation for Object Based Remote Sensing Image Analysis, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol. 8(4), JSTARS-2014-00825.R3, 2015. [pdf]
- Lilia Berrahou, Nathalie Lalande, Eva Serrano, Guilhem Molla, Laure Berti-Équille, Sandro Bimonte, Sandra Bringay, Flavie Cernesson, Corinne Grac, Dino Ienco, Florence Le Ber and Maguelonne Teisseire. A quality-aware spatial datawarehouse for querying hydroecological data, Computers & Geosciences, 2015. [pdf]
- Eva Carmina Serrano Balderas, Corinne Grac, Laure Berti-Équille, Ma. Aurora Armienta Hernandez. Potential application of biological indices based on macroinvertebrates on Mexican streams. Ecological Indicators Journal, 2015. [HAL]
- Laure Berti-Équille. Data veracity estimation with ensembling truth discovery methods. Proceedings of IEEE Big Data Conference 2015, pp. 2628-2636, 2015.[Publisher Site]
- Ismael Caballero, Laure Berti-Équille, Mario Piattini. Towards Principled Data Science Assessment - The Personal Data Science Process (PdsP). Proceedings of International Conference on Enterprise Information Systems, ICEIS, (1):374-378, 2015. [Publisher Site]
- Dalia Attia Waguih, Naman Goel, Hossam M. Hammady, Laure Berti-Équille. AllegatorTrack: Visualizing and Explaining Truth Discovery Results from Multisource Data (Demo). Proceedings of ICDE 2015, Seoul, Korea, April 2015. [pdf]
- Laure Berti-Équille, Ji Meng Loh, Tamraparni Dasu, A masking index for quantifying hidden glitches, Knowledge and Information Systems, Springer, July 2014, Online ISSN 0219-3116. [Publisher Site]
- Amrapali Zaveri, Andrea Maurino, Laure Berti-Équille. Web Data Quality: Current State and New Challenges. Int. J. Semantic Web Inf. Syst., 10(2): 1-6, 2014. [Publisher Site]
- Dalia Attia Waguih, Laure Berti-Équille, Truth Discovery Algorithms: An Experimental Evaluation. CoRR abs/1409.6428 (2014). [arXiv.org]
- Laure Berti-Équille, Ji Meng Loh, Tamraparni Dasu, A masking index for quantifying hidden glitches, Proceedings of ICDM 2013, Dallas, TX, USA, December 2013. [pdf]
- Mohamed Yakout, Laure Berti-Équille and Ahmed Elmagarmid, "Don't be SCAREd: Use SCalable Automatic REpairing with Maximal Likelihood and Bounded Changes", Proceedings of the 2013 ACM SIGMOD/PODS, New York, June 2013. [pdf]
- Xin Luna Dong, Laure Berti-Équille, Divesh Srivastava. Data fusion: Resolving conflicts from mutiple sources. Proceedings of the 14th nternational Conference on Web-Age Information Management (WAIM), 2013. [pdf]
- Vincent Moron, Barbero Renaud, Morgan Mangeas, Laurent Borgniet, Thomas Curt, Laure Berti-Équille. Prediction of September-December fire in New Caledonia (SW Pacific) using July Nino 4 sea surface temperature index. Journal of Applied Meteorology and Climatology, 52 (3), pp. 623-633. ISSN 1558-8424, 2013. [Publisher Site] [pdf]
- Fouzia Moussouni, Laure Berti-Équille. Cleaning, Integrating, and Warehousing Genomic Data from Biomedical Resources (Chapter). In Biological Knowledge Discovery Handbook: Preprocessing, Mining and Postprocessing of Biological Data. Wiley Book Series on Bioinformatics: Computational Techniques and Engineering, Wiley-Blackwell, John Wiley &Sons Ltd, USA. ISBN: 978-1-118-13273-9, September 2013. [pdf]
- Xin Luna Dong, Laure Berti-Équille, Divesh Srivastava, Data Fusion: Resolving Conflicts from Multiple Sources (Chapter). In Managing and Mining Uncertain Data. Springer, 2012. [pdf]
- Melanie Herschel, Laure Berti-Équille, Application de mesures de distance pour la détection de problèmes de qualité de données, In La qualité et la gouvernance des données au service de la performance des entreprises, Hermès-Lavoisier, September 2012. [Amazon]
- Alice Novello, Doris Barboni, Laure Berti-Équille, Jean-Charles Mazur, Pierre Poilecot, Patrick Vignaud. Phytolith Signal of Aquatic Plants and Soils in Chad, Central Africa. Review of Palaeobotany and Palynology, 178, pp. 43-58, 2012. [ResearchGate]
- Laure Berti-Équille, Multi-Scale Data Integration Challenges in the Open Science Data Space. Special issue on Data Integration of the iT - Information Technology Journal, Vol. 54, No. 3, 05/2012. [Publisher Site]
- Laure Berti-Équille, Isabelle Comyn-Wattiau, Monica Scannapieco (Eds), Proceedings of the 17th International Conference on Information Quality (ICIQ 2012), Paris, 16-17 November 2012. [pdf]
- Laure Berti-Équille (Ed), La qualité et la gouvernance des données au service de la performance des entreprises (in French). Hermès-Lavoisier, September 2012. [Amazon] [Publisher Site]
- Noel Novelli, Laure Berti-Équille, Christophe Hurter. ADVISU: Interactive Visualization of Anomalies and Dependencies from Massive Scientific Datasets (Demo). Proceedings of the National Conference on Extraction and Management of Knowledge (EGC-Extraction et Gestion des Connaissances), Bordeaux, France, January 2012. [pdf]
- Jérôme Azé, Nicolas Béchet, Laure Berti-Équille, Sylvie Guillaume, Mathieu Roche, Fatiha Saïs (Eds). Mesurer et évaluer la qualité des données et des connaissances, Proceedings of the 7th Workshop Data and Knowledge Quality (QDC Qualité des Données et des Connaissances) in conjunction with the French conference EGC 2011 (Extraction et Gestion des Connaissances - EGC), Éditions Hermann, RNTI-E-22, 2011. ISBN : 9782705682866 [pdf]
- Laure Berti-Équille, Isabelle Comyn-Wattiau, Mireille Cosquer, Zoubida Kedad, Sylvaine Nugier, Verónika Peralta, Samira Si-Saïd Cherfi, Virginie Thion-Goasdoué. Assessment and analysis of information quality: a multidimensional model and case studies. International Journal of Information Quality, vol. 2(4), pp. 300-323, 2011, (doi:10.1504/IJIQ.2011.043780). [pdf]
- César Guerra-García, Ismael Caballero, Laure Berti-Équille, Mario Piattini. DAQ_UWE : A Framework for Designing Data Quality Aware Web Applications. Proc. of the 16th International Conference on Information Quality (ICIQ), Adelaide, Australia, November 2011. (Best Paper Award). [Publisher Site]
- Laure Berti-Équille, Tamraparni Dasu, Divesh Srivastava : Discovery of complex glitch patterns : A novel approach to Quantitative Data Cleaning. Proceedings of the International Conference on Data Engineering, (ICDE 2011), pp. 733-744, Hannover, Germany, Avril 2011.
[Publisher Site]
- Minji Wu, Laure Berti-Équille, Amélie Marian, Cecilia M. Procopiuc, Divesh Srivastava. Processing Top-k Join Queries. Proceedings of VLDB (VLDB 2010) , Singapore, September 2010.[pdf]
- Xin Luna Dong, Laure Berti-Équille, Yifan Hu, Divesh Srivastava. SOLOMON: Seeking the Truth Via Copying Detection (Demo). Proceedings of VLDB (VLDB 2010), Singapore, September 2010. [pdf]
- Xin Luna Dong, Laure Berti-Équille, Yifan Hu, Divesh Srivastava. Global Detection of Complex Copying Relationships Between Sources. Proceedings of the VLDB Endowment (VLDB 2010), Singapore, September 2010.[pdf]
- Bernd Amann, Laure Berti-Équille, Zoé Lacroix, Maria-Esther Vidal. Challenges of Quality-Driven Resource Discovery. Resource Discovery - Third International Workshop, RED 2010, Paris, France, November 5, 2010, pp.181-189. Lecture Notes in Computer Science 6799 Springer 2012, ISBN 978-3-642-27391-9 [pdf]
- Jérôme Azé, Laure Berti-Équille, Sylvie Guillaume (Eds). Proceedings of the 6th Workshop QDC 2010 (Qualité des Données et des Connaissances) in conjunction with the French conference EGC 2010 (Extraction et Gestion des Connaissances - EGC), Hammamet, Tunisia, 26 Janvier 2010. [pdf]
- L. Berti-Équille, T. Dasu. New Directions for Data Quality Mining. Tutorial presented at the International Conference on Knowledge Discovery and Data Mining (KDD), Paris, France, 28 June 2009. [slides]
- L. Berti-Équille, T. Dasu. Data Quality Mining: New Directions. Tutorial presented at the 2009 IEEE International Conference on Data Mining (ICDM), Miami, Florida, USA, 7 December 2009. [slides]
- L. Berti-Équille, A. Das Sarma, X. L. Dong, A. Marian, and D. Srivastava. Sailing the information ocean with awareness of currents: discovery and application of source dependence. Proceedings of the Biennial Conference on Innovative Data Systems Research (CIDR), January 2009. [Publisher Site]
- X. L. Dong, L. Berti-Équille, and D. Srivastava. Integrating conflicting data: the role of source dependence. Proceedings of the International Conference on Very Large Databases (VLDB 2009), Lyon, France, August 2009. [pdf]
- X. L. Dong, Laure Berti-Équille, and D. Srivastava. Truth discovery and copying detection in a dynamic world. Proceedings of the International Conference on Very Large Databases (VLDB 2009), Lyon, France, August 2009. [pdf]
- V. Peralta, V. Thion-Goasdoué, Z. Kedad, L. Berti-Équille, , I. Comyn-Wattiau, S. Nugier, S. Sisaïd-Cherfi, Multidimensional Management and Analysis of Quality Measures for CRM Applications at EDF, Proceedings of the 14th International Conference on Information Quality (ICIQ’09), Hasso Plattner Institute, University of Potsdam, Germany, November 2009.
- Laure Berti-Équille, Melanie Herschel, Ahmed Elmagarmid (Eds). Proceedings of the 7th international workshop on Quality in Databases (QDB 2009) in conjunction with VLDB 2009, Lyon, France, 2009. [pdf]
- L. Berti-Équille. Tracing Data Pollution in Large Business Applications. Proceedings of the 13th International Conference on Information Quality (ICIQ’08), Massachusetts Institute of Technology, Cambridge, U.S.A., November 2008.
-
L. Berti-Équille. Measuring and Constraining Data Quality with Analytic Workflows. Proceedings of the 6th International Workshop on Quality in Databases (QDB'09) in conjunction with the International Conference on Very Large Databases (VLDB 2008), Auckland, New Zealand, August 2008.
-
J. Akoka, L. Berti-Équille, O. Boucelma, M. Bouzeghoub, I. Comyn-Wattiau, M. Cosquer, V. Goasdoué, Z. Kedad, S. Nugier V. Peralta, M. Quafafou, S. Sisaïd-Cherfi. Évaluation de la qualité des systèmes multisources. Une approche par les patterns. Proceedings of the 2nd Workshop on Data and Knowledge Quality (QDC 2008) in conjunction with the French National Conf. on Extraction and Management of Knowledge (Extraction et Gestion des Connaissances - EGC), Nice, France, January 29, 2008.
- Laure Berti-Équille, Quality-Extended Query Processing for Mediation Systems (Chapter). In Information Quality Management: Theory and Practice, pp. 23-50, Springer, 2007. [pdf]
- Laure Berti-Équille, Modelling and Measuring Data Quality for Quality-Awareness in Association Rule Mining (Chapter). In Quality Measures in Data Mining, pp. 101-126, Fabrice Guillet and Howard Hamilton (Eds.), Springer, 2007. [Publisher Site]
- Laure Berti-Équille, Quality Awareness for Managing and Mining Data, Habilitation à Diriger des Recherhes University of Rennes 1, France, 2007. [pdf] [slides]
- Laure Berti-Équille. Data Quality Awareness: a Case Study for Cost Optimal Association Rule Mining. Knowledge and Information Systems, 11(2):191-215, 2007.[pdf]
- Toufik Ahmed, Ahamed Asgari, Ahmed Mehaoua, Eugene Borcoci, Laure Berti-Équille, Georgious Kormentzas. End-to-End Quality of Service Provisioning Through an Integrated Management System for Multimedia Content Delivery. Special Issue of Computer Communications on Emerging Middleware for Next Generation Networks, 30(3), 638-651, 2007. [pdf]
- F. Moussouni, L. Berti-Équille, G. Rozé, O. Loréal and E. Guérin. Q-DEX : A Database Profiler for Generic Bio-Data Exploration and Quality Aware Integration, Proceedings of the Intl. Workshop on Approaches and Architectures for Web Data Integration and Mining in Life Sciences (WebDIM4LS) in conjunction with the 8th Intl. Conference on Web Information Systems Engineering (WISE2007), Nancy, France, December 2007. [pdf]
- J. Akoka, L. Berti-Équille, O. Boucelma, M. Bouzeghoub, I. Comyn-Wattiau, M. Cosquer, V. Goasdoué, Z. Kedad, S. Nugier V. Peralta, S. Si-Said-Cherfi, A Framework for Quality Evaluation in Data Integration Systems, Proceedings of the 9th International Conference on Enterprise Information Systems (ICEIS 2007), Madeira, Portugal, June 2007. [pdf]
- Laure Berti-Équille. Contributions to Quality-Aware Online Query Processing, IEEE Data Eng. Bull. 29(2):32-42, 2006. [PS]
- Laure Berti-Équille, Fabrice Guillet (Eds). Proceedings of the 2nd Workshop on Data and Knowledge Quality (QDC 2006) in conjunction with the French conference EGC 2006, Villeneuve d’Ascq, France, January 17, 2006. [pdf]
- Monica Scannapieco, Laure Berti-Équille. Report from the First and Second International Workshops on Information Quality in Information Systems - IQIS 2004 and IQIS 2005 in conjunction with ACM SIGMOD/PODS Conferences, ACM SIGMOD Record, 35(2):51-54, June 2006. [pdf]
- L. Berti-Équille. Qualité des données, Techniques de l’Ingénieur H3700, pp. 1-19, collection Technologies logicielles – Architecture des systèmes, 2006. [Publisher Site]
- J.-A. Benvenuti, L. Berti-Équille, E. Jacopin. Lessons Learned from Ontology Design. Proceedings of the 9th Intl. Protégé Conference, pp. 101-104, Stanford, CA, USA, July 23-26, 2006. [pdf]
- L. Berti-Équille, F. Guillet (Eds). Proceedings of the 2nd Workshop on Data and Knowledge Quality (QDC 2006) in conjunction with the French National Conf. on Extraction and Management of Knowledge (Extraction et Gestion des Connaissances - EGC), Villeneuve d’Ascq, France, January 17, 2006. [pdf]
- M. Scannapieco, L. Berti-Équille, F. Naumann, C. Batini, D. Srivastava. Report from the First and Second International Workshops on Information Quality in Information Systems - IQIS 2004 and IQIS 2005 in conjunction with ACM SIGMOD/PODS Conferences, ACM SIGMOD Record, 35(2):51-54, June 2006.
- L. Berti-Équille. Quality-Aware Association Rule Mining. Proceedings of the 10th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2006), LNCS 3918, pp. 440-449, Singapore, April 9-12, 2006.
- L. Berti-Équille, Modèle décisionnel basé sur la qualité des données pour sélectionner les règles d’associations légitimement intéressantes. Actes de la Conférence Extraction et Gestion des Connaissances, (EGC 2006), Cépaduès, pp. 593-598, Villeneuve d’Ascq, France, January 17, 2006.
- Laure Berti-Équille, Carlo Batini, Divesh Srivastava (Eds). Proceedings of the 2nd ACM Workshop on Information Quality in Information Systems (IQIS 2005) in conjunction with ACM SIGMOD/PODS International Conference, Baltimore, MD, USA, June 17, 2005. [pdf]
- Laure Berti-Équille, Fabrice Guillet (Eds). Proceedings of the 1st Workshop on Data and Knowledge Quality (DKQ 2005) Workshop in conjunction with the French National Conference on Extraction and Management of Knowledge (Extraction et Gestion des Connaissances - EGC 2005), Paris, France, January 18, 2005. [pdf]
- L. Berti-Équille, C. Batini, D. Srivastava (Eds). Proceedings of the 2nd ACM Workshop on Information Quality in Information Systems (IQIS 2005) in conjunction with ACM SIGMOD/PODS International Conference, Baltimore, MD, USA, June 17, 2005. [pdf]
- L. Berti-Équille, F. Guillet (Eds). Proceedings of the 1st Workshop on Data and Knowledge Quality (DKQ 2005) Workshop in conjunction with the French National Conference on Extraction and Management of Knowledge (Extraction et Gestion des Connaissances - EGC 2005), Paris, France, January 18, 2005. [pdf]
- W. Jouve, B. Rousseau, L. Berti-Équille, Enriching Multimedia Content Description for Broadcast Environments: From A Unified Metadata Model to A New Generation of Authoring Tool. Proceedings of IEEE International Symposium on Multimedia (ISM 2005), pp. 87-94, IEEE Computer Society, Irvine, California, U.S.A., December 2005.
- L. Berti-Équille, F. Moussouni. Quality-Aware Integration and Warehousing of Genomic Data. Proceedings of the 10th Intl. Conference on Information Quality (ICIQ'05), pp. 442-454, Massachusetts Institute of Technology, Cambridge, MA, U.S.A., November 2005. [pdf]
- L. Berti-Équille, Assurer la qualité des données: un défi permanent pour les systèmes d'information, bases et entrepôts de données, Genie Logiciel, 74, pp. 13-20, Septembre, 2005.
- L. Berti-Équille. Cost of Low-Quality Data over Association Rules Discovery. Proceedings of International Symposium on Applied Stochastic Models and Data Analysis (AMSDA 2005), Brest, France, May 2005.
- J.-A. Benvenuti, L. Berti-Équille, E. Jacopin. When Protégé and Rules becomes Parsing for Learning. Proceedings of the First International Workshop Protégé with Rules in conjunction with the 8th Intl. Protégé Conference, Madrid, Spain, July 2005. [pdf]
- E. Guérin, G. Marquet, A. Burgun, O. Loréal, L. Berti-Équille, U. Leser, F. Moussouni. Integrating and Warehousing Liver Gene Expression Data and Related Biomedical Resources in GEDAW. Proceedings of the Second International Workshop on Data Integration in the Life Sciences (DILS 2005), pp. 158-174, San Diego, CA, U.S.A., July 2005. [pdf]
- A. Kouomou-Choupo, L. Berti-Équille, A. Morin. Optimizing Progressive Query-By-Example over Pre-Clustered Large Image Databases. Proceedings of the 2nd ACM SIGMOD International Workshop on Computer Vision meets DataBases (CVDB'05) in conjunction with ACM SIGMOD/PODS Conference, pp. 13-20, Baltimore, MD, USA, June 2005. Also published Actes des Journées Bases de Données Avancées (BDA 2005), Saint-Malo, France, pp. 215-226. [pdf]
- L. Berti-Équille. Recommendation of XML Documents Exploiting Quality Metadata and Views. Proceedings of the 2nd International Workshop on Data and Information Quality (DIQ 2005) in conjunction with the 17th Conference on Advanced Information Systems Engineering (CAiSE'05), Porto, Portugal, June 2005.
- L. Berti-Équille. Nettoyage des données XML: combien ça coûte ?, Actes du 1er Atelier Qualités des Données et des Connaissances (QDC 2005) en conjonction avec la conférence Extraction et Gestion des Connaissances (EGC 2005), Paris, France, pp. 11-18, Paris, Janvier 2005.
-
A. Kouomou-Choupo, L. Berti-Équille. Visual Feature Mining for Adapting Query-by-Example over Large Image Databases. Proceedings of International Workshop on Multidisciplinary, Video, and Audio Retrieval and Mining, Sherbrooke-Canada, October 2004.
-
L. Berti-Équille. Quality-Adaptive Query Processing over Distributed Sources. Proceedings of the 9th International Conference on Information Quality (ICIQ’04), pp. 285-296, Massachusetts Institute of Technology, Cambridge, MA, U.S.A., November 2004.
-
A. Kouomou-Choupo, L. Berti-Équille, A. Morin. Multimedia Indexing and Retrieval with Features Association Rules Mining. Proceedings of the IEEE International Conference on Multimedia and Expo (ICME'2004), pp. 1299-1302, Taipei, Taiwan, June 2004.
-
L. Berti-Équille, A. Kouomou-Choupo, A. Morin. Feature Mining for Multimedia Indexing and Retrieval (Poster). Proceedings of the 5th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS 2004), Lisbon, Portugal, April 2004.
-
A. Morin, A. Kouomou-Choupo, L. Berti-Équille. Research in Image Databases: How Statistical Data Analysis Methods can enrich the Association Rules (Poster). Proceedings of the 16th Symposium of International Association for Statistical Computing (COMPSTAT'2004), Prague, Czech Republic, 2004.
-
L. Berti-Équille. Un état de l'art sur la qualité des données. Revue des Sciences et Technologies de l’information (RSTI), Numéro Spécial Qualité des Systèmes d'Information, Hermès, 9(5-6) :117-143, 2004.
-
A. Kouomou-Choupo, A. Morin, L. Berti-Équille. Recherche dans de grandes bases d'images fixes : une nouvelle approche guidée par les règles d'association. Revue Nationale des Technologies de l'Information RNTI-E-2 (Actes de conférence nationale Extraction et Gestion des Connaissances, EGC 2004), Cépaduès, pp. 65-70, 2004.
-
J-A. Benvenuti, L. Berti-Équille, É. Jacopin, Le projet SABRE : de l'ontologie а l'inférence, (Poster), Actes des 15èmes Journées francophones d'Ingénierie des Connaissances (IC’04), Lyon, France, May 2004. [pdf]
-
L. Berti-Équille. La qualité des données comme condition а la qualité des connaissances : un état de l'art. Mesures de qualité pour la fouille de données. Numéro spécial, Revue Nationale des Technologies de l'Information (RNTI-E), Cépaduès, 2004.
-
A. Kouomou-Choupo, A. Morin, L. Berti-Équille. Recherche par le contenu dans une base d’images fixes : l’intérêt des règles d’association, Actes de l’atelier Fouille de données complexes dans un processus d’extraction de connaissances (FDC 2004) en conjonction avec la conférence nationale Extraction et Gestion des Connaissances EGC 2004, pp. 73-82, Clermont Ferrand, France, 2004.
-
L. Berti-Équille. Quality-Based Recommendation of XML Documents. Journal of Digital Information Management, 1(3):117-128, September 2003.
-
L. Berti-Équille. Quality-Extended Query Processing for Distributed Sources. Proceedings of the International Workshop on Data Quality in Cooperative Information Systems (DQCIS 2003) in conjunction with the 9th International Conference on Database Theory (ICDT 2003), pp. 55-63, Siena, Italy, January 2003.
-
J.-A. Benvenuti, L. Berti-Équille, É. Jacopin. Ontological Parsing of XML Documents: A Use Case in the Domain of Training French Military Staff. Proceedings of the 21st World Conference on Open Learning and Distance Education, Hong-Kong, February 2003.
-
L. Berti-Équille. Renseigner la qualité des connaissances par la fusion d'indicateurs sur la qualité des données. Revue des Sciences et Technologies de l’information RSTI-RIA-ECA (Actes de la conférence nationale Extraction et Gestion des Connaissances, EGC 2003), Cépaduès, 17(1-2-3), 2003.
-
L. Berti-Équille. Annotation et recommandation collaboratives de documents selon leur qualité (in French). Revue Ingénierie des Systèmes d'Information (ISI-NIS), Numéro Spécial Recherche et Filtrage d'Information, Hermès, 7(1-2/2002):125-156, 2002.
-
L. Berti-Équille. Integration of Biological Data and Quality-driven Source Negotiation. Proceedings of the Intl. Conference on Conceptual Modeling (ER'2001), LNCS, Volume 2224, pp. 256-269, Yokohama, Japan, November 2001.
-
L. Berti-Équille, A. Arcade. Integration of Biological Data on Transcriptome. Revue Ingénierie des Systèmes d'Information (ISI-NIS), Numéro Spécial Interopérabilité et Intégration des Systèmes d'Information, Hermès, 6(3/2001):61-86, 2001.
-
L. Berti-Équille, D. Graveleau, Documents, données et méta-données : une approche mixte pour un système de veille, Actes du colloque Veille Stratégique, Scientifique et Technologique (VSST01), Barcelona, Spain, pp. 115-126, 2001.
-
E. Guérin, F. Moussouni, L. Berti-Équille, Intégration des données sur le transcriptome, Actes de la journée de travail bi-thématique du GDR-PRC I3, pp. 219-228, Lyon, France.
-
L. Berti-Équille. Quality and Recommendation of Multi-source Data for Assisting Technological Intelligence Applications. Proceedings of the Intl. Conference on Database and Expert Systems Applications (DEXA'99), LNCS, Volume 1677, pp. 282-291, Florence, Italy, September 1999.
-
L. Berti-Équille. Qualité des données multi-sources et recommandation multi-critère, Actes du Congrès francophone INFormatique des ORganisations et Systèmes d’Information Décisionnels (INFORSID'99), Toulon, France, pp. 185-204, 1999.
-
L. Berti-Équille. Qualité des données et leur recommandation : modèle conceptuel, formalisation et application а la veille technologique, Thèse, Université de Toulon et du Var, 1999.
-
L. Berti-Équille. From Data Source Quality to Information Quality: the relative dimension. Proceedings of the 3rd Conference on Information Quality (ICIQ'98), pp. 247-264, Massachusetts Institute of Technology, Cambridge, MA, U.S.A., October 1998.
-
L. Berti-Équille, J.-L. Damoiseaux, E. Murisasco. Combining the Power of Query Languages and Search Engines for On-line Document and Information Retrieval: the QIRi@D Environment. Proceedings of the Intl. Workshop on Principles of Digital Document Processing (PODDP'98), LNCS, Volume 1481, pp. 116-127, St-Malo, France, Mars 1998.
-
L. Berti-Équille. Out of Overinformation by Information Filtering and Information Weighting. Proceedings of the 2nd Conference on Information Quality (ICIQ'97), pp. 187-193, Massachusetts Institute of Technology, Cambridge, MA, U.S.A, October 1997.
-
L. Berti-Équille. Designing and Filtering On-Line Information Quality: New Perspectives for Information Service Providers, Proceedings of the 4th International Conference on Ethical Issues of Information Technology (ETHICOMP98), Utrecht, The Netherlands, pp. 79-88, 1998.
-
L. Berti-Équille, D. Graveleau. Contribution а la définition d’un vigiciel : quelle modélisation de l’information factuelle, événementielle et référentielle ? Actes du colloque français Veille Stratégique Scientifique et Technologique (VSST’98), Ile Rousse, France, pp. 227-240, 1998.
-
L. Berti-Équille. Merging an Active Database and a Reflective System: Modeling a New Several Architecture, (Poster), Proceedings of the 15th British National Conference on Databases (BNCOD’97), London, U.K., pp. 119-120, 1997.
Projects
 |
MPA-POVERTY: Can marine protected areas alleviate poverty in the context of land desertification?
6-partner project, PI: Prof. D. Mouillot (MARBEC, Univ. Montpellier 2, France)
ANR French National Agency for Research, Jan. 2020 – Jan. 2024.
Role: Work Package Leader
[info]
|
 |
QUALIHEALTH: Enhancing the Quality of Health Data
7-partner project, PI: Prof. A. Bonifati (LIRIS, Univ. Lyon 1, France)
ANR French National Agency for Research, Feb. 2019 – Jan. 2023.
Role: Work Package Leader
[info]
|
 |
COCLICO: COllaboration, Classification, Incrémentalité et COnnaisssances
6-partner project, PI: Prof. P. Gançarski (ICUBE, Univ. Strasbourg, France)
ANR French National Agency for Research, Feb. 2019 – Jan. 2023.
Role: Work Package Leader
[info]
|
 |
FRESQUEAU: Data mining for assessing and monitoring the hydrobiologic quality of running waters
6-partner project, PI: F. Le Ber (Univ. Strasbourg, France)
ANR French National Agency for Research, Nov. 2011 – Aug. 2013.
Role: Task Leader
[info]
|
 |
EXQUALIBUR: Quality-introspective Data Management System
European Marie Curie Outgoing International Fellowship (FP6-MOIF-CT-2006-041000)
EU European Commission, Sept. 2007 – Dec. 2010.
Role: Project Leader
|
 |
ENTHRONE: End-to-End QoS through Integrated Management of Content, Networks and Terminals, Phase 1
32-partner project, PI: Thales Broadcast France
EU European Integrated Project (FP6-2002-IST-2.3.1.8), Dec. 2003- Dec. 2005.
Role: Project Leader [info]
|
 |
QUADRIS: Quality of Multi-source Data and Information Systems
6-partner project, PI: Thales Broadcast (Univ. Rennes 1, France)
ANR French National Agency for Research, Dec. 2003 – Dec. 2008.
Role: Project Leader
|
Patents
- Detecting dependence between sources, United States Patent 8190546, issued 5/29/2012 and co-invented with X. L. Dong and D. Srivastava
- Scalable Automatic Repair for minimal change and maximal likelihood, European Patent 12724324.4 issued on May 25, 2012 co-invented with M. Yakout and A. K. Elmagarmid.
- Scalable Automatic Repair for minimal change and maximal likelihood, United States Patent: 9619494 - 13/115.253 issued on April 11, 2017, co-invented with M.Yakout and A. K. Elmagarmid.
Awards
- Promoted to ACM Senior Member Grade, July 2021
- Promoted to IEEE Senior Member Grade, Dec. 2018
- Recipient of Prime d’Encadrement Doctoral et de Recherche, 2018-2021
- Recipient of William Mong Visiting Research Fellowship, Hong-Kong University, Dec. 2018
- Recipient of Prix de la Ville de Marseille, Accueil de chercheur, 2018
- Recipient of ICIQ 2011 Best Paper Award for the paper entitled “DAQ_UWE: A Framework doe Designing Data Quality Aware Web Applications” with C. Guerra-García, I. Caballero, Laure Berti-Équille, M. Piattini. In Proceedings of the 16th International Conference on Information Quality (ICIQ 2011), Adelaide, Australia, November 2011
- Recipient of Marie Curie Outgoing International Fellowship 2006 (FP6-MOIF-CT-2006-041000), 3 years funding (2007-2010) from the European Commission (selection rate: 18.8% of 445 submissions)
- Best Junior Researcher paper INFORSID (French Conference on Information Systems) for the paper entitled “Qualité de données multi-sources et recommandation multi-critère”, INFORSID, 1999
Talks
- Laure Berti-Équille, David Dao, Stefano Ermon, Bedartha Goswami. Challenges in KDD and ML for Sustainable Development. Tutorial at the International Conference on Data Engineering (KDD 2021), Sinpapore, 15 August 2021. [website]
- Keynote entitled "Data Curation for ML : Towards a Principled Apporach" at the 1rst Workshop on Data Assessment and Readiness for AI in conjunction with the International Pacific-Asia Conference on Knowledge Discovery and Data Mining (PaKDD), New Dehli, India, 11-14 May, 2021.
- Keynote entitled "ML-Based Knowledge Graph Curation: Current Solutions and Challenges" at the 5th workshop on Managing the Evolution and Preservation of the Data Web (MEPDAW) in conjunction with The Web Conf 2019 [slides]
- Laure Berti-Équille. Reinforcement Learning for Data Cleaning and Data Preparation. Talk at HILDA workshop in conjunction with SIGMOD2019, Amsterdam, The Netherlands, July 2019. [slides]
- Keynote entitled "Machine Learning-Based Data Cleaning: Current Solutions and Challenges" at 21ème Edition du Colloque International sur le Document Numérique CiDE.21, Djerba, April 2019. [slides]
- Laure Berti-Équille, Angela Bonifati, Tova Milo. Machine Learning to Data Management: A Round Trip. Tutorial accepted at the International Conference on Data Engineering (ICDE 2018), Paris, April 2018. [slides]
- Laure Berti-Équille, Javier Borge-Holthoefer. Scaling Up Truth Discovery. Tutorial at the International Conference on Data Engineering (ICDE 2016), Helsinki, May 2016. [abstract] [slides]
- Laure Berti-Équille, Javier Borge-Holthoefer. Veracity of Big Data: From Truth Discovery Computation Algorithms to Models of Misinformation Dynamics. Tutorial at the 24th ACM International on Conference on Information and Knowledge Management (CIKM 2015), Melbourne, October 2015. [slides]
- Laure Berti-Équille, Tamraparni Dasu. New Directions for Data Quality Mining. Tutorial at the International Conference on Knowledge Discovery and Data Mining (KDD), Paris, France, June 2009. [slides]
- Laure Berti-Équille, Tamraparni Dasu. Data Quality Mining: New Directions. Tutorial at the 2009 IEEE International Conference on Data Mining (ICDM), Miami, Florida, USA, December 2009. [slides]
- Laure Berti-Équille. Analytics and Probabilistic Approaches for Data Quality, Tutorial presented at the Summer School Big Data CNRS 2014, Oléron, June 2014.
- Laure Berti-Équille. Data Quality for Big Data Analysis, Keynote presented at the Summer School Big Data Analysis in Earth Sciences EarthBiAS 2014, University of Aegean, Rhodes, Greece, July 2014.
- Laure Berti-Équille. Truth Discovery Methods, Keynote presented at the Summer School Big Data Analysis in Earth Sciences EarthBiAS 2014, University of Aegean, Rhodes, Greece, July 2014.
- Laure Berti-Équille. Evaluation of Data and Knowledge Quality in Data Mining. Tutorial presented at the éEGC Winter School on Statistical Machine Learning and Data Mining, Hammamet, Tunisia, February 2010.
- Laure Berti-Équille. "Evaluation of the quality of data and knowledge in data mining", éEGC Winter School on Statistical Machine Learning and Data Mining, Hammamet, Tunisia, February 3, 2010.
Mentoring
@ESPACE DEV, Montpellier, France
- Robin Jarry (Oct. 2020 – present, Ph.D. thesis of Montpellier Univ.) co-supervised with M. Chaumont, Univ. Nîmes, LIRMM, Montpellier, France and G. Subsol CNRS, LIRMM, Montpellier, France: "Assessment of CNN-based Methods for Poverty Estimation".
from Satellite Images
- Ali Ben Abbes (Oct. 2020 – Oct. 2021) Postdoc, PARSEC project (2019-2022): "Deep learning for poverty prediction from satellite images".
- Marouane Azibou (Dec. 2020 – Dec. 2021, Ph.D. thesis of Montpellier Univ.) co-supervised with S. Sellami, Univ. Aix-Marseille, France: "Automatiser la gestion de la qualité des flux de données inter-applications par apprentissage semi-supervisé" with INSEPTI company.
- Amina Ferhati (Oct. 2020 – Aug. 2021, PFE internship, ESI Algiers): "Détection de fake news par apprentissage profond".
- Dylan VOISIN (March 2021 – July 2021, internship Master 2 Recherche) with N. Novelli, Aix-Marseille Univ. : "Etude de la robustesse des représentations de type embedding (plongement) par rapport au bruit et au manque de données pour les graphes de connaissances".
- Vinciane LE MAGUET (Feb. 2021 – July 2021, internship Master 2 Recherche) : "Apprentissage par renforcement profond interactif et explicable appliqué à la détection d'anomalies".
- Abdelouahab Chibah (Oct. 2020 – March 2021, Ph.D. thesis of Grenoble Univ.) co-supervised with S. Amer-Yahia, CNRS, IMAG, Grenoble, France: "Mining and querying the evolution of behaviors over time".
- Hussein Khansa (Oct. 2019 – August 2021, Ph.D. thesis of Montpellier Univ.) co-supervised with C. Gervet, Univ. Montpellier, France: "Learning robust constrained model ensembles from large uncertain spatio-temporal data scenarios: the case of the climate impact study on agricultural planning".
@Aix-Marseille University, Marseille, France
- Guillaume Chambaret (Mars 2019 – present, Ph.D. thesis) co-supervised with F. Bouchara, Univ. Toulon, France: "Analyse et prédiction à partir de séries temporelles et recommandation sous contraintes pour la maintenance prédictive".
- Ugo Comignani (Oct. 2019 – August 2020) Postdoc, ANR QualiHealth project
- Victor Polizzi (Feb 2019 – July 2019) M.Sc. Intern Univ. Toulon, France, Projet SEAMED: "Analyse d’éléments de bioacoustique terrestre diurne et nocturne des stations d’enregistrement du Domaine du Rayol" (in French).
- Liliane Kong Win Chang (Mars 2019 – July 2019) M.Sc. Intern Univ. Lyon, France, "Robustness to noise of ML model-agnostic explanation: an empirical study".
@Qatar Computing Research Institute, Doha, Qatar
- Kushal Sha (October 2015 – July 2016) M.Sc. Intern, "Urban computing: traffic density near-future forecasting".
- Mouhamadou Lamine Ba (October 2015 – April 2016), Research Associate (Postdoc), "Truth discovery for Web data and spatio-temporal events".
- Aisha El-Allam (May 2015 – July 2015) M.Sc. Summer Intern, Qatar-CMU, "Traffic Density Prediction based on Bluetooth sensored data".
- Posha Dave (May 2015-July 2015) M.Sc. Summer Intern, Qatar-CMU, "Information Extraction for Truth Discovery".
- Naman Goel (October 2014 – February 2015, Internship) B.Tech, M.Tech in Computer Science & Engineering from Indian Institute of Technology, BHU: "Meta-classifier for truth discovery, explanation and allegation".
- Dalia Attia Waguih (November 2013 – August 2014, Internship), M.Sc. in Computer Science from Alexandria University, Egypt: "Truth discovery algorithms implementation and experimental study".
- Mahmood Neshati, Ph.D. in information retrieval from Sharif University of Technology, Tehran, Iran (May 2014 – August 2014, Internship): "Combining constraint-based approaches with data mining-based approaches for detecting and repairing anomalies".
@Institut de Recherche pour le Développement, France
- Andres Troya (October 2013 – October 2016, Ph.D. thesis) co-supervised with Prof. Gançarski, Strasbourg University, France: "Collaborative Approach and Data and Knowledge Quality in Multi-Paradigm Remote Sensing Image Analysis".
- Hatim Chahdi (October 2013 – July 2017, Ph.D. thesis) co-supervised with Dr Isabelle Mougenot, Montpellier University 2: "Ontology-enhanced data classification".
- Eva Serrano (October 2013 – January 2017, Ph.D. thesis) co-supervised with Prof. Armienta, UNAM, Mexico: "Impact of data quality on the interpretations of statistical analysis results of Environmental Studies: Application to the Evaluation of the impact of emerging pollutants on the quality of the water of the rivers Tula, Taxco, Culiacan and Humaya in Mexico".
- Cesar Guerra, Ph.D., (May 2011 – July 2011, Ph.D. Internship) co-supervised with Dr Ismael Caballero, Univ. Castilla-La Mancha: "A Framework for Designing Data Quality Aware Web Applications".
@University of Rennes 1, Rennes, France
- Anicet Kouomou-Choupo, University of Rennes 1 (December 2002 – February 2006, Ph.D. thesis): "Improving Similarity Search in Very Large Image Databases with Multimedia Mining Techniques".
- Jean-André Benvenuti, University of Rennes 1 (December 2002 – December 2008, Ph.D. thesis): "Intelligent Parsing of XML Pedagogical Materials for Military Staff Training".
- Ravi Jain, University of South Australia (February 2007 – September 2007, Post-Doc): "Quality-Awareness for Data Clustering".
- Yongluan Zhou, National University of Singapore (October 2005 – March 2006, Ph.D. Internship): "Quality-Driven Distributed Query Planning and Optimization Based on Data Quality Negotiation".
- Manuel Bes (September 2001 – June 2002, M.Sc. Internship): "Comparative Study of Association Rule Discovery Algorithms for Genomic Data Mining".
- Anne Charlery (September 2002 – June 2003, M.Sc. Internship): "Indexing Techniques for Genomic Data".
- Mehrez Chaikha-Douaihy (February 2005 – June 2005, M.Sc. Internship): "Optimizing Content-Based Image Retrieval".
- Wilfried Jouve (February 2005 – August 2005, M.Sc. Internship): "Enriching Multimedia Content Description for Broadcast Environments".
Biographical Sketch
Laure Berti-Équille is a Research Director at IRD, the French research institute for sustainable development. Before, she was a full professor in Computer Science at Aix-Marseille University in France, a senior scientist at Qatar Computing Research Institute, an associate professor at University of Rennes 1 in France, and a 2-years visiting researcher at AT&T Labs Research in New Jersey, as a recipient of the prestigious European Marie Curie Outgoing Fellowship. Her current research interests are on the inter-play of data management and machine learning with a focus on anomaly detection, data cleaning and preparation, and data fusion. She has more than 100 publications in major conferences or journals along with two books (edited by Morgan&Claypool in 2015 and Hermès-Lavoisier in 2012), and 10 co-edited proceedings. She has co-organized numerous workshops on information and data quality in conjunction with top conferences such as SIGMOD and VLDB. She has given several tutorials and keynote talks on data curation and data engineering for applied machine learning (KDD'21, ICDE'18, ICDE'16, CIKM'15, KDD'09, ICDM'09). She is an Associate Editor of the VLDB Journal, Frontiers in Big Data, and the ACM Information and Data Quality Journal and served in many program committees of international conferences. She has been leading projects with grants from the French National Agency of Research (ANR), the French National Research Council (CNRS), Belmont Forum, and the European Union. She is a IEEE and ACM senior member.